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This  volume  contains  selected  papers  from  the  Eighth  South  Asian 
Languages  Analysis  Roundtable  which  was  held  at  the  University  of  Illinois  in 
May  1986.   The  local  organizing  committee  consisted  of  the  following  members, 
all  South  Asian  linguists  at  the  University  of  Illinois:   Jean  D'Souza,  Hans 
Henrich  Hock  (Chair),  Braj  B.  Kachru   Yamuna  Kachru, . Baber  Khan,  Mithilesh  K. 
Mishra,  Rajeshwari  Pandharipande,  Nalini  Rau,  Sarah  Tsiang,  and  Tamara 
Valentine.   The  national  committee  consisted  of  the  local  committee  plus  Tej 
K.  Bhatia  (University  of  Syracuse),  Edgar  C.  Polom6  (University  of  Texas),  and 
S.N.  Sridhar  (SUNY,  Stony  BrooJc)  . 

The  tradition  of  annual  Roundtables  dedicated  to  the  languages, 
linguistics,  and  literatures  of  South  Asia  was  started  at  the  University  of 
Illinois  in  1979  and  has  been  hosted  altogether  four  times  by  the  University 
of  Illinois .   It  was  inspired  by  a  very  successful  Conference  on  South  Asian 
Languages  and  Linguistics,  also  held  at  the  University  of  Illinois,  as  part  of 
the  1978  Linguistic  Institute,  and  organized  by  Hans  Henrich  Hock  and  Braj  B. 
Kachru. 

The  1986  South  Asian  Languages  Analysis  Roundtable  was  held  in 
conjunction  with  the  'Festival  of  India'  and  honored  Edward  C.  Dimock,  Jr.  for 
his  outstanding  services  and  achievements  in  the  support  and  promotion  of 
South  Asian  studies.   It  was  the  longest  Roundtable  so  far:   Including  a 
presession/colloquium  on  'Modernization  of  Indian  Languages  and  Literatures' 
(organized  by  Braj  B.  Kachru)  and  a  postsession,  the  'Second  Symposiiim  on 
Language  Maintenance  and  Language  Shift  in  South  Asia'  (organized  by  Tej  K. 
Bhatia  and  S.N.  Sridhar),  it  extended  from  28  May  to  1  June.   It  attracted 
more  than  eighty  active  participants,  with  more  than  seventy-five  papers  and 
presentations . 

With  the  exception  of  two  papers,  all  of  the  contributions  to  this 
volume  were  presented  in  general  sessions  of  the  1986  Roundtable.   Papers  from 
the  special  panels  are  expected  to  be  published  separately.   At  this  point,  we 
have  specific  information  on  two  volumes:   The  proceedings  of  the  'Symposium 
on  Sanskrit  Syntax'  are  being  edited  by  Hans  Henrich  Hock,  to  be  combined  with 
papers  from  the  1987  'Second  Symposium  on  Sanskrit  Syntax'  and  to  be  submitted 
to  an  Indian  publishing  house.   A  volume  of  papers  dealing  with  Government  & 
Binding  approaches  to  the  syntax  of  South  Asian  languages  is  being  prepared  by 
Alice  Davison  (University  of  Wisconsin,  as  of  1987/88)  and  Kashi  Wali 
(Universities  of  Cornell  and  Syracuse) ;  it  is  planned  to  appear  as  a  special 
issue  of  the  Cornell  University  Working  Papers  in  Linguistics.   Other  special 
volumes  that  are  in  the  planning  stage  include  'Language  and  Modernization, ' 
edited  by  Braj  B.  Kachru,  and  'Language  of  Religion, '  edited  by  Rajeshwari 
Pandharipande . 

Last  but  not  least,  I  would  like  to  acknowledge  the  help  of  my 
colleagues,  Braj  B.  Kachru,  Yamuna  Kachru,  and  Rajeshwari  Pandharipande,  in 
editing  this  volume. 

Hans  Henrich  Hock 
Editor 
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ASPECTUAL  ELEMENTS  OF  SIMULTANEITY  AND  ITERATION  IN  INDIAN  LANGUAGES: 
A  CASE  FOR  AN  AREAL  UNIVERSAL 

Anvita  Abbi 
Jawaharlal  Nehru  University 

Mithilesh  Kumar  Mishra 
University  of  Illinois 


Reduplicated  structures  in  the  South  Asian  languages  are  used  as 
markers  of  various  aspectual  distinctions.     In  the  present  paper,  we 
concentrate  on  only  two  of  those  -  aspects  of  simultaneity  and  iteration. 
In  our  discussion,  we  have  relied  on  data  from  30  Indian  languages  of  the 
Indo-Aryan,  Dravidian.  Austro-Asiatic  and  Tibeto-Burman  families.    In 
our  description  of  the  aspect  of  simultaneity  using  reduplicated 
structures,  we  propose  three  subtypes  of  simultaneity  and  their 
interactions  with  lexical  and  grammatical  aspects.   The  overall  pattern 
that  emerges  is  that  within  Dravidian,  all  sub-types  of  reduplicated 
structures  with  simultaneity  function  occur  quite  rarely  in  the  southern 
literary  languages  and  that  their  frequency  increases  with  greater 
vicinity  to  the  other  Indo-Aryan  and  Munda  languages  in  which  the 
occurrence  of  these  structures  is  more  or  less  at  100%.   Reduplicated 
structures  with  aspectual  value  of  iteration  occur  in  almost  all  the 
languages  under  investigation  and  therefore  seem  to  be  the  most  pan- 
Indian  feature.   By  showing  the  large  presence  of  reduplicated  structures 
with  function  of  simultaneity  and  iterative  aspects  in  genetically 
unrelated  Indian  languages,  we  propose  reduplicated  structures  as  an 
areal  universal. 


1.  In  recent  years,  the  topic  of  aspect  has  been  investigated  In  a  number  of  publications, 
Including  Comrie  1976,  Friedrich  1974,  Hopper  1979,  1982a&b,  and  Timberlake 
1982.  There  are  as  many  definitions  and  interpretations  of  the  notion  of  'aspect'  as 
there  are  authors  and  schools.  The  Slavic  School  has  always  maintained  a  distinction 
between  aspect  and  the  category  of  Aktionsart  ('kind  of  action').   However,  it  is  not 
always  clear  where  the  boundary  between  aspect  and  Aktionsart  is  to  be  drawn  and 
perhaps  that  is  the  reason  why  many  authors,  including  Comrie,  eschew  the  distinction 
altogether.   Western  linguists  working  on  non-Slavic  languages  have  conveniently 
defined  aspect  as  all  those  phenomena  which  are  not  tense  or  modality.  Authors  like 
Timberlake  (1982)  regard  aspect  at  the  sentence  or  word/phrase  level,  while  Hopper 
(1982a)  restricts  the  term  aspect  to  discourse  level  pragmatics  and  'characterizable  as 
completed  event  in  the  discourse'. 

In  the  present  paper  we  propose  to  look  at  the  aspectual  value  of  'Reduplicated 
Structures'  (henceforth  RS)  in  Hindi  and  other  South  Asian  languages.  For  the  purposes 
of  our  discussion,  we  make  a  fundamental  distinction  between(at  least)  three  types  of 
aspect:  (a)  Lexical  aspect,  i.e.  the  inherent,  semantically  determined  aspect  of  verbs. 
Compare  the  Hindi  distinction  between,  say,  t2filIlL.'speak'  (imperfective/continuative) 


and  ijima  'fall'  (perfective/completlve/telic).    For  convenience,  we  limit  the 
terminology  to  'imperfective'  vs.'perfectivel 

(b)  The  combined  category  tense/aspect,  i.e.  the  grammatically  determined  aspectual 
reading  of  different  tense-oriented  verbal  constructions,  such  as  in  Hindi 

(1)   us  ne  l^nsb  parhi 

'He  read  a  book'  (perfective) 
vs. 

vo  kItSb  parh  rahi  thi 

'He  was  reading  a  bool<'  (imperfective) 

Again,  we  will  limit  our  terminology  to  'perfective'  vs.  'imperfective'. 

Note  that  lexical  aspect  and  tense-aspect  ordinarily  are  independent  of  each  other,  so 
that  both  lexically  perfective  and  imperfective  verbs  can  occur  in  both  perfective  and 
imperfective  tenses: 


(2)  lexically  imperfective  lexically  perfective 
perfective  tense                    vo  boia  vo  gIrSL 

('He  spoke')  ('He  fell  down') 

imperfective  tense      vo  bol  rah3  thS  vo  gir  rahS  th3 

CHe  was  speaking*)  ('He  was  falling') 

(c)       The  aspectual  meaning  of  reduplicated  structures  (RS)  of  the  type  (3) 
which  will  form  the  main  topic  of  our  discussion. 

(3)  vo  pSn  becte  becte  bolS 

He  betel  sell-imp.  sell-imp.  spoke 
'He  spoke  white  selling  betels.' 


Here,  becte  becte  being  verbal  in  nature  and  adverbial  in  function  will  be  termed 
Reduplicated  Yerbal  Adverbs  (RVAOV);  for  detailed  discussion  see  Abbi  (1975,  1980). 
As  will  be  seen,  lexical  aspect  and  grammatical  tense-aspect  interact  in  a  very 
interesting  manner  in  these  reduplicated  structures. 

2.  The  Indian  languages  have  different  morphological  devices  to  indicate  various 

aspectual  distinctions.  One  of  the  most  important  and  pan-Indian  devices  has  been 

reduplicating  verbal  adverbs  of  the  type  (3).    Reduplicated  verbal  adverbs  (RVADV)  in 

conjunction  with  the  appropriate  main  verbs  (MV)  indicate  four  significant  aspectual 

categories.  They  are: 

(i)        Simultaneity 

(ii)        Antithetical  ('non-precipitative'  action) 

(iii)      Continuation-Duration 

(iv)      Iteration 

In  the  present  paper  we  examine  and  concentrate  on  only  two  of  these  aspects  — 
simultaneity  and  iteration. 


3.  Simultaneity:  what  might  be  called  an  aspect  of  simultaneity  can  be  expressed  in 
Indian  languages  by  temporal  phrases  or  clauses  introduced  by  expressions  like  'while.  . 
.',  'at  the  time  of. . .',  and  'when. . .'.  However,  these  language,  also  offer  variant 
structures  involving  reduplicated  structures  (RS).    It  is  not  easy  to  determine  for  all 
languages  whether  RS  are  simpe  alternatives  to  temporal  phrases  or  clauses,  or  whether 
they  are  preferred  under  certain  conditions  or  disfavored  under  other  conditions. 
Preference  tests  were  found  to  be  too  difficult  to  administer  and  they  tend  to  be  highly 
subjective  .  However,  for  some  languages  we  have  responses  from  informants  to  the 
extent  that  RS  is  preferred  in  certain  discourse  contexts  or  is  even  the  only  acceptable 
structure.   We  leave  this  point  open  for  further  investigation  and  return  to  the  notion  of 
simultaneity  as  expressed  by  RS. 

Simultaneity  identifies  the  synchronization  of  two  events  or  actions  (non-states)  at 
a  particular  point  of  time  which  in  turn  may  signal: 

(A)  the  completion  of  both  events  or  actions  together;  or 

(B)  protraction  of  one  of  the  events  or  actions;  or 

(C)  continuity  of  two  events  or  actions  for  a  certain  period  of  time. 

The  two  events  or  actions  discussed  here  are  represented  by  RVADV  and  MV.  In 
unmarked  order  RVADV  precedes  MV.  except  In  non-verb-final  languages  like  Kashmiri 
and  Khasi. 

The  semantic  nature  of  the  two  events  or  actions  entering  into  the  relationship  of 
simultaneity  determines  which  one  of  the  three  readings  above  will  obtain. 

3.1   The  diagrammatic  representation  of  the  aspectual  reading  of  the  type  (A)  would  be 
as  In  figure  1 . 


Figure  1.    Simultaneity,  type  (A) 

Here,  as  in  the  other  figures  which  foih)w.  the  arrow  marked  X  represents  RVADV 
and  the  arrow  Y  represents  MV.  The  constraint  on  the  relationship  of  this  type  between 
RVADV  and  MV.  at  least  in  Hindi,  Is  that  both  have  to  be  lexically  and.  grammatically 
(tense/aspect)  perfective.   Consider  the  following  examples. 

(4)  vo  baithte  baithte  gIrS/glr  gayS 
'He  fell  while  sitting  down.' 


(5)  *  vo  baUhte  bai^hte  glirtS  hai 

'He  falls  while  sitting  down.' 

(6)  *  vo  bai^hte  bai^hte  gir  rahS  thS 

'He  was  falling  while  sitting  down. 


(4)  Is  grammatical  because  bal\h  'sif  and jaJL 'fall*  are  both  lexically  perfective  and 
the  tense/aspect  of  the  total  structure  is  also  perfective.   Sentences  (5)  and  (6)  are 
ungrammatical  because  though  RVADV  and  MV  are  lexically  perfective,  grammatically 
(tense/aspect)  the  total  structure  is  imperfective. 

We  have  not  made  a  detailed  investigation  of  the  other  South  Asian  languages 
concerning  this  interrelationship  between  lexical  and  grammatical  aspect  on  one  hand 
and  the  aspectual  reading  of  structures  of  the  type  (A).  However,  our  data  contain 
structures  like  the  one  in  (7),  in  which  a  Hindi  type  (A)  structure  has  at  least  surface 
parallels  in  the  other  languages.  What  needs  to  be  further  studied  is  whether  these 
parallels  are  subject  to  the  same  restrictions  as  in  Hindi.    (In  this  and  other  later 
examples,  the  RVADV  is  set  off  by  underlining). 


(7)   'X  divided  the  property  at  the  time  of  leaving  the  country' 

HINDI:  (X  ne)  des  flhoj-ta  fthoj'tft  dhan  ka  b&Kylri.  klyS 

ASSAMESE:  dekh  erunte  erunte   hsmptir  bhSg  korile 

BENGALI:  (Rim)  des  charlte  charlte  sab  sampati  bete  dlye  che 

DQGRI:  dei  cho(jde  cho^de  (one)  dhanS  dlbSd  kit! 

GUJARAT!:  des  chodta  chodts  (emne)  palso  wSto  dido 

KQbJl^^M:  des  sopo  sopo  (tenni)  dUdda  watleti 

MARATHJ:  des  SQf  ta  SQf ta  (taini)  sampatti  wStii 

MAITHILI:  des  chofajt  ctioralt  (0)  dhanak  batbSrJ  kelak 

ORIYA:  des3  chap  chaj-u  (se  mane)  dhano  bStire 

PUNJABI:  mUlk  ehadde  cha^e  than  da  batvSri  kitta 

SINDHI:  mUlk  chadande  chaij/ande  (hUnan)  paise  ja  hissi  kalyS 

SADRI:  des  cho|-te  choj-te  dhan  ke  bait  deb 

KABUl:  kamai  gatou  dudu  na  sae 


BIRIJIA:  (htiku^  das  bSe  bSa  jo-joma  hatlii  laku 

CB£Ctt  des  8mte  Smte  urme  dhannan  xoUiyas  Ikccas 

KHARIA:  hoTom  melay  melay  kon  hokki  ba7a  kamu  kimS 


These  examples  employ  two  lexically  perfective  verb  forms  •  one  as  RVADV  and  the 
other  as  MV-,  as  in  Hindi,  chops  "leave'  and  ba^vSra  kamS  'divide',  to  show  the 
synchronization  of  two  events  or  actions  which  get  completed  at  the  same  time. 

What  is  interesting  is  that  the  Dravidlan  languages  •  Tamil,  Malayalam,  Kannada, 
and  (marginally)  Telugu  do  not  show  this  kind  of  simultaneity  by  RS.   Oraon  (also  known 
as  Kurukh)  is  the  single  exception. 

Hindi  and  other  Indo-Aryan  languages  offer  another  interesting  example  of  this  kind 
of  simultaneity  between  the  verb  (Hindi)  ba|na    'strike  (of  a  clock)'  and  the  verb 
(Hindi)  pahOcnS  'reach',  both  perfective  verbs.  (Note  that  although  pahtJcnS  can 
occasionally  be  used  imperfectively,  in  (8)  it  can  only  be  used  perfectively.)   The  Hindi 
construction  is  shown  In  (8)  along  with  the  parallels  from  other  South  Asian  languages. 
This  construction  is  not  available  in  many  of  the  Dravidlan  languages,  including  Kannada, 
Kodagu,  Malayalam,  and  Telugu.  Tamil  and  Oraon  are  the  only  Dravidlan  languages  to 
have  this  construction.    Kashmiri  in  Indo-Aryan  and  Meitei  in  the  Tibeto-Burman 
family  also  do  not  exhibit  this  constructk}n. 


(8)  The  train  reached  Allahabad  by  the  time  It  was  10  o'clock. 


HINDI: 

BENGAU: 


MARATHI 


KQNKANI 


PUNJABI 

SINDHI: 

MAITHILI 

QRIYA: 


rSt  ke  das  balte  baite  oSrl  IlShSbld  pahUd 
nikhav  do  balante  balante  tren  UShSbSd  goi  pile 
ratter  daata  bSjte  bSjte  ^ren  Ilih3b3d  p6cllo 

rSti  das  bajde  bajde  \ren  ilSbSd  pujjl  gayl 

ritri  daha  vasts  vasta  tren  allhSbSd  la  pahudi 

ratti  dah  ghante  watte  watta  ralla  II3.hSb3da  wasun  pawie 

rata  de  das  baldTya  baidlya  gaddi  IlShSbSda  pujjl 

ratja  cTtowaiande  waiande  gl(f  i  IlShSbSd  pahuti 

rait  ke  das  bajait  bajait  tren  UShSbSd  pahucal 

dasati  heyu  heyu  tren  ti  eiihSbSdre  pahud  gola 

OR  ' 

pr^  dastS  pSkhS  pSkhi  trenti  elShSbSdre  pahud  gola 


SADARI:  rat  ke  das  bajte  baite  gari  llShSbid  pahulc  galak 

TAMIL:  mani  patta  adikka  adikka  vandi     IlShSbSd  ai  orendao 

QRACN:  maxlte  das  baite  baite/xantanum    xarxanum  rel  II3h3b3d  arsIyS 

BIRIJIA:  nind  das  bale  bale  te  relgSri  ]l3h3b3d  sodsrienJ 

KHARIA:  idiba^a  u?upha  ba|ena  baiena  rel  HihSlbSd  dam  go^ki 

MllNDARI:  das  hafaiana  batelana  relll5h5bSd  hiiu  tana 

KABUI:  khwan  lii  lii  khou  llengJri  II3.hSb3d  khou  gantane 

PAITE:  daksom  aglnijTn  lain  Il3hSbSd  ka  tunta 


3.2.  The  simultaneity  of  the  type  (B)  can  be  diagrammatically  represented  as 
follows: 


Y 


Figure  2.    Simultaneity,  type  (B). 


Figure  2  identifies  those  situations  in  which  RVADV  is  lexically  imperfective  and  MV 
is  lexically  perfective.   As  in  type  (A)  ,  the  grammatical  aspect  of  the  entire 
construction  is  Hindi  can  only  be  perfective.   Compare  the  examples  in  (9)  and  (10). 


(9)  vo  dDfte  done  mar  gay5    (perfective) 

'He  died  while  running'. 

(10)  *vo  6or\e  done  mar  rahS  thJ  (imperfective) 

'He  was  dying  while  running.' 


In  situation  of  this  type  '9).  the  event  or  action  denoted  by  RS  may  or  may  not  be 
continued  beyond  the  completion  of  the  event  or  action  manifested  by  the  main  verb 
(MV).   For  example,  the  Punjabi  sentence  (11)  below  may  mean  that  the  action  caldl 
cald^  'walking  walking'  was  prolonged  even  after  its  coincidence  with  the  action  of  s^ 
BfiyI  'fell  down'.   If  this  were  not  so,  the  sentence  in  (12)  would  be  ungrammatical, 
which  it  evidently  is  not: 


(11)  ocal£i2.caid5  dlgpeyl 

he  walking  walking  fell  down 
'He  fell  down  while  walking.' 

(12)  o  cslda  caldS  dig  peya  si  tavi  caldS  reya 

'He  fell  down  while  walking  even  then/still  he  kept  walking.' 

However,  the  same  type  of  construction  will  be  ungrammatical  in  type  (A) 
constructions,  where  the  RVADV  is  lexically  perfective.   Consider  the  example  In  (13): 

(13)  *o  tren  ca  cards  card5  dig  peya  hor  b5d  vi  cardS  reya 

'He  fell  while  boarding  the  train  and  continued  to  board  the  train  thereafter.' 

The  possibility  of  continuation  of  one  of  the  events/actions  after  being  simultaneous 
with  the  other  for  some  time  is  illustrated  by  the  examples  in  (14). 


HM21: 

larki  calte  caltfl  gir  papi 

BENSAU: 

chele  cQlte  CQlte/iete  ieta  pofe  gelo 

ASSAMESE: 

lora  jane  \Sq\e  laijte  pori  gol 

OR 

lortu  khuj  karhi  Karhl  pori  gol 

MAITHIU: 

chaura  calalt  calalt  khals  paral 

ORIYA: 

puscaiu    calupurigali 

SADARI: 

chaUiiaiajaiaglrgelak 

mundacald3L2alda<3l3peya 

CQSBl: 

OR 

Jagat  sara  ^url  juri  tei  payi 

KASHMIRI: 

ladkt  pyav  cak^njjalian  vas' 

GUJARAT): 

chokro  caltQ  ClItQ  P^di  gayo 

MARATHI: 

mulqa  calt5  c^|tS  padl5 

KONKANI: 

callo  dhaUn  dhalJn  oadlo 

KANNADA:       hudugs  narits  nafits  sustsds 

OR 

huduga  noldedu  noTdedu  sustidS 
QB£Cti:  kukkos  Tknnm  Iknum/ekte  ekte  khattaras  keras 

BIRIJIA:  sengo  sene  sene  te  thaka  yana 

MUNDARI:         kora  hon  sen  sen  te  LHUjane 
KHARIA:  kongher  cona  cong  qur  golki 

MEITEI:  ahari  adu  cat-na  cat-na  coktha-ram-i 

PAITE:  naupafiBaLcalaketa 

KABUI:  ganmaina  ta?lafi  ta?lan-na  bunkai  kathe 

GANGfTE:  naopan  pa  6u  ci?  kom    ksm  Inapukt'ai 


Another  example  of  this  type  was  found  in  all  the  Indo-Aryan,  Tibeto-Bumian,  and 
Munda  languages,  while  the  Dravidian  group  showed  some  variations:  Malayalam, 
Kannada  (with  some  exceptions),  and  Telugu  speakers  emphatically  denied  the 
availability  of  such  constructions  in  their  languages,  while  Oraon.  Tamil,  and  Kodagu 
speakers  supplied  the  forms  without  hesitation.   Compare  the  data  in  (15). 


(15)    While  talking  tears  came  into  his/her  eyes. 

HINDI:  bSt  karte  karle  uski  IkhO  me  §su  S  gaye 

KODAGLh  takka  parindanda  parindanda  ippake  kannle  nir  bate 

QBfiCfcJ:  kasana  khirte  khirte  khannati  I  o  rkhatre 

TAMIL:  pesa  pesa  avan  kannil  llrunda  nir  varindada 

IHADQ:  aho  pum  pumin  amlkhi  ap d  tt5 

KHARIA:  kayamni  kayamna  hokra?  mon7nte  ramo"?!!  da?  abhore  gotki 


It  is  difficult  to  explain  why  Tamil  allows  such  construction,  while  Malayalam, 
Telugu,  and  Kannada  do  not.  At  this  point,  all  we  can  say  is  that  the  simultaneity  of  the 
type  (B)  (discussed  above)  is  not  fully  represented  in  the  Dravidian  languages. 


3.3.    Continuity  of  two  non-perfective  events  or  actions  for  a  certain  period  of  time  can 
be  diagrammaticaily  represented  as  in  figure  3. 

X 
> 


Y 

Figure  3.    Simultaneity,  type  (C). 


In  this  type  of  simultaneity,  both  the  RVADV  and  MV  are  lexically  imperfective  and  the 
tense/aspect  of  the  total  structure  can  be  either  perfective  or  imperfective.  Compare 
the  Hindi  examples  in  (16)  and  (17). 


(16)  vo  daurte  daujie  bol5  (perfective  tense/aspect) 

'He  spoke  while  running.' 

(17)  vo  daurtQ  daurte  bol  rahS  thS    (Imperfective  tense/aspect) 

'He  was  speaking  while  running.' 

This  structure  is  found  in  large  variety  of  South  Asian  languages;  cf.  the  examples  in 
(18)  and  (19). 

(18)  HINDI:  Kani  ealte  calte  gSnS  g3  rahi  thi 

'Kani  was  singing  a  song  while  walking.' 

BENGALI:  o  duray  duray  gSn  gSchilQ 
'He  was  Singing  a  song  while  running.' 

(19)  'He  spoke  while  ninning  (away)' 
ASSAMESE:  teu  daurl  dauri  ko  le 
CQSBl:  0  nasde  nasde  boiya 
GUJARAT):  o  darto  dsrto  bolyo 
KQNKANJ:  to  dauto  dauto  sanglo 
MARATHJ:  to  dhauta  dhauta  manSIS 
MAITHILI:  o  bhapait  bhagalt  bJjal 
QRIYA:  se  pappafs  ko  hlla 
PUNJABI:  odPfdl  dofda  bolya 
SADARI:  u  bhSate  hhSqtfl  bollak 


10 


KANNADA: 

avanu  vorta  vorta  herdS 

KQDAGU: 

CRfiCN: 

as  bhufignum  bhuhgnum  bscas 

MM2AB1: 

nir  nir  bakhari  tanal 

KHARIA: 

hokkar  yarta  varta  gamo 

ISABLU: 

kamaicaiiLQal&Z  naslc 

PAITE: 

amataitaiiailnapau 

Interestingly,  the  Dravidian  languages  in  general,  and  Tamil  in  particular,  do  not  exhibit 
R.S.  of  this  type.  Barring  examples  of  the  type  (15)  above,  Tamil  speakers  do  not  use 
RS  for  simultaneity  aspect. 

3.4.  In  fact,  from  the  examples  examined  so  far,  as  well  as  from  many  others  in  our  data 
collection,  it  becomes  clear  that  in  Dravidian,  all  subtypes  of  RSs  with  simultaneity 
function  occur  quite  rarely  in  the  southern  literary  languages  and  that  their  frequency 
increases  with  greater  vicinity  to  the  other  Indo-Aryan  and  Munda  languages  in  which 
the  occurrence  of  these  structures  is  more  or  less  at  100%.  Compare  the  percentages 
given  in  Table  1 . 


Oraon  -  100% 

Kodagu  •  100% 

Telugu  -  60% 

Kannada  •  60% 
Malayalam    33% 

Tamil-  10% 

Table  1. 


The  Indo-Aryan,  Munda  and  the  Tibeto-Burman  languages  show  more  or  less  100% 
occurrence  of  the  simultaneity  aspect  using  RS. 

3.5.   Up  to  this  point,  we  have  examined  the  aspect  of  simultaneity  in  construction  in 
which  the  agent  of  the  RVADV  and  that  of  the  MV  have  been  identical.  Now,  we  will  show 
cases  where  the  simultaneity  of  two  events  or  actions  involves  two  different  agents.  As 
far  as  we  can  tell,  the  aspectual  relationship  between  RVADV  and  MV  can  be  of  the  same 
nature  as  for  the  types  (A),  (B),  and  (C)  above. 
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The  sentences  In  (20)  illustrate  the  situation.   Most  of  the  languages  assign  genitive 
marking  to  the  agent  noun  of  the  RVAOV. 


(20)   'He  drowned  In  my  presence.'  (Literal:   He  drowned  with  my  looking  looking') 

UMii'  mere  dekhte  dekhte  vo  dub  gayS 

BSNGAU:  Smir  dekhlte  dekhlte  s'e  dube  gek> 

QRIYA:  mu  dekhu  dekhu  se  buri  gola 

ASSAMESE:  Not  available 

SiHDHl.:  mUhtje  dlsande  (jlsande  hu  bud:  vyo 

GUJARATI:  mhara  johta  iohta  te  dubi  gayo 

MARATHi:  mi  catll5_Catlli  to  buria 

PUNJABI:  mere  wekhdeya  wekhdeya  o  dub  gyi 

S£Q£B1:  mor  dekhte  deKhte  u  dub  galak 

MAITHIU:  hamra  dekhlte  dekhlte  o  duib  gel 

CQSBl:  mere  dikhde  dikhde  o  dubbi  gaySl 

KANNADA:  nlnu  noritta  noritta  avanu  mulugi  hodana 

£B&Cti''  eiihae   erte  erte  Ss  murkh  lyas  karas 

KQQAGtJ:  nan  notlanda  notianda  ippalCaav'S  mufugl  pod 

MUNDARl!  lei  lei  te  hun  ui  iqjans 

BIRIJIA:  Not  available 

KHARIA:  i]iya7a  yona  yong  hokkar  dube  gotki 

METEI:  ai-na  una  una  mahak  irakna  rammi 

PAITE:  ka?  et  et  lai  Inatumta 

KABUI:  S  hou  tan  tanna  kamai  phlankai  gane 

LAHULI:  oiu  khanda  khanria  dii  riiiha  gi  iti 
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For  Tamil  we  could  not  get  an  example  of  the  type  (20).  But  note  the  acceptability  of 
(21). 

(21)    TAMIL:    raman  pustakam  paddike  paddike  nan  unnava  tavaritten 

'I  cooked  the  whole  dinner  while  Ram  was  reading  the  book.' 
(Literally:   Ram  book  reading  reading  I  cooked  the  whole  dinner) 

When  we  examine  our  data  for  the  simultaneity  of  two  actions  or  events  performed 
by  two  non-identical  agents  (as  illustraed  in  (20))  the  following  pattern  emerges: 
Tamil  (with  few  exceptions),  Telugu,  Malayalam  and  Kannada  (with  few  exceptions)  in 
the  Dravidian  group,  and  Thado  in  Tibeto-Burman  do  not  exhibit  this  type  of 
construction.  Assamese  in  the  Indo-Aryan  branch  also  does  not  offer  simultaneity  of  two 
events  or  actions  with  two  non-identical  agents. 

4.  Iteration:  What  may  be  called  an  aspect  of  iteraton,  i.e.  of  the  repeated  occurrence  of 

an  event  or  action  at  short  intervals  of  time  is  manifested  in  the  following  ways: 

(i)     Reduplicated  verbs  with  imperfective  ending,  and 

(ii)    Reduplicated  verb  root  followed  by  conjunctive  participle  (CP)  marker. 

These  two  possibilities  can  be  exemplified  by  the  Hindi  examples  (22)  and  (23) 
respectively. 


(22)  VQ  aana  sunte  sunte    thak  gayS 

He  song  listen  listen  tire  went 
'He  got  tired  listening  to  the  song.' 

(23)  vo  gana  smLSlULkat  thak  gaya 

He  song  listen  listen  CP  marker  tire  went. 
'He  got  tired  having  listened  to  the  song.' 

The  repetition  of  an  action  or  an  event  represented  by  the  RS  might  have  a  resultant 
effect  on  the  action  of  the  main  verb  (causal  adverbs),  or  it  might  simply  indicate  the 
manner  -  one  of  the  three  types  of  simultaneity  discussed  above  -  in  which  the  action 
manifested  in  the  main  verb  is  represented.  Thus,  (24)  below  is  ambiguous  between 
(i)  He  died  because  of  eating  (too  much)  food  (causality) 
(ii)  He  died  while  eating  food  (simultaneity). 


(24)  vo  khana  hhSte  KhSte  mar  gayl 
He  food     eat     eat  die   went 


(24)  is  an  extreme  case  in  the  sense  that  context  alone  will  provide  the  disambiguating 
cues.  However,  sentences  of  the  type  (24)  are  not  always  ambiguous  because  the 
semantic  relationship  between  the  main  verb  and  the  reduplicated  iterative  verb  often 
clearly  narrows  down  one  of  the  two  possible  readings.  (For  detailed  discussion,  see 
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Abbj  1975,  1977)  consider  the  examples  in  (25)  and  (26)  for  iteration  manifested 
RVADVs. 


(25)   'Clothes  got  torn  by  repeated  washing.' 

Literal:  'clothes  got  torn  (having  been)  washed  washed' 

THADQ:  pon  Fsaplsap  leh  alho  po] 

KABUI:  phai  sourang  sourang  khouroi  SOkathe 

G^NgTE:  pun  3  sDosap  a  asapsete 


(26)   'He  got 

tired  listening  to  songs' 

HINDI: 

vo  gan§  sun   sun   Kar  thak  gayi 

BgNGALI: 

ORIYA: 

se  gita  suni  sunl  thaki  gol.5 

ASSAMESE: 

hi  aan  huni  huni  ohaaorl  oorll 

SINDHI: 

gSyan  sambhari  sSmbhSfl  ne  thaki  gayo 

MARATHI: 

PUNJABI: 

0  a3na  sundS  sundS  thak  av3 

SADARI: 

MAITHILI: 

I2QSBI: 

KASHMIRI: 

bath  bozan  tx>zan  thok  su 

TAMIL: 

patai  Ketta  Ketta  avanaka  aluttu  vittada 

TELUQU: 

Itanu  pSta  vini  vini  alasipoyadu 

KANNADA: 

avanu  h5ru  kelli  kejM  sustada 

MALAYALAM: 

avan  patta  Ketlit  Kettit  vesamiccu 

CB&m 

5s  dandi  mente  mente  ansaras  keras 

KQOAnih 

ava  patahattaJsatta  cot  pod 
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MUNQ^Bi:  sni  duro  h  aim  Ketfl  aim  keta  lagaln 

BIRIJIA:  uni  slrifi  ^um  ^um  thakayena 

KHARIA:  hokkar  iJoh  ondor  ondor  thakke  gotki 

MEITEI:  mahak  Isai  ta-na  ta-na  coktha  ramml 

PAITE;  ami  la  nai  K]il2_ht^Ih  apukta 

LAHULI:  du  gura  rewS  rew5   chajilyS 

5.  Conclusion:  As  noted  In  3.  and  4.  above  simultaneity  constructions  using  RS  are 
fairly  restricted  in  Dravidian.   On  the  other  hand,  M  the  languages  under  consideration 
showed  iterative  construction  (using  RS).  The  iterative  value  therefore  seems  to  be  the 
most  pan-Indian  feature  of  reduplicated  structures. 

Moreover,  the  large  presence  of  RS  showing  simultaneity  and  the  near-total  spread 
of  Iterative  reduplicated  structures  in  genetically  unrelated  Indian  languages  adds  to  the 
already  available  evidence  for  South  Asia  as  a  convergence  area.  Further  work  Is 
required  in  neighboring  languages  such  as  those  of  South-east  Asia  and  Iran  to  determine 
to  what  degree  the  phenomenon  may  extend  beyond  South  Asia. 
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A  LEXICAL  PHONOLOGY  APPROACH  TO  HINDI  SCHWA  DELETION* 
Camllle  Bundrick 


In  this  paper  I  claim  that  D'Souza's  (1985)  non-linear  account  of  Hindi  schwa 
deletion  can  be  improved  upon  by  three  insights  that  lexical  phonology  has  to  offer. 
First,  the  need  for  D'Souza's  rule  of  initial  accent  is  eliminated  by  establishing 
prefixes  in  Hindi  as  phonological  words  with  regard  to  the  operation  of  cyclic 
phonological  mles.  Secondly,  speaker  variability  on  deletion  of  suffixal  schwas,  also 
accounted  for  in  D'Souza  by  the  rule  of  initial  accent,  is  explained  by  whether 
syllabification  and  foot  construction  apply  on  each  cycle  or  post-cyclically.  Finally,  by 
considering  schwa  reinstatement  as  a  post-cyclic  njle,  an  explanation  can  be  given 
for  D'Souza's  need  for  schwa  reinstatement  in  the  cases  where  speakers  exhibit 
variation  in  schwa  deletion  in  three-consonant  clusters. 


0.  Introduction 

Schwa  deletion  in  spoken  Hindi  is  a  complex  phenomenon  that  has  intrigued  linguists  for  a 
number  of  years.  One  reason  that  it  continues  to  do  so  is  that  the  phenomenon  is  not  clear  cut  ~  there 
is  often  variation  among  native  speakers  about  which  schwas  delete  and  which  ones  donl. 
Unfortunately,  past  analyses,  such  as  Ohala  (1983)  and  D'Souza  (1985),  have  run  into  the  difficulty  of 
adequately  delimiting  the  potential  deletion  sites  while  at  the  same  time  accounting  for  speaker 
variation  with  a  straightforward  phonological  rule,  without  resorting  to  a  list  of  exceptions.  The  problem, 
I  feel,  is  one  not  so  much  with  the  rule  as  with  the  phonological  framework  in  which  the  mle  is  framed. 

What  I  woukJ  like  to  propose  in  this  paper  is  that  schwa  deletion  in  Hindi  can  best  be  accounted  for 
in  a  framework  of  lexical  phonology  like  that  proposed  by  Kiparsky  (1982).  Mohanan  (1981)  and 
Rubach  (1984).  That  does  not  mean  that  the  previous  approaches  to  the  schwa  deletion  problem 
should  be  thrown  out.  Rather,  the  insights  into  the  functioning  of  phonological  rules  made  possible  by 
the  framework  of  lexical  phonology  can  be  successfully  combined  with  the  insights  already  made  in  the 
area  of  Hindi  schwa  deletion  to  eliminate  what  in  the  other  approaches  seem  to  be  exceptions. 

The  paper  will  be  structured  as  follows.  First  I  will  give  an  overview  of  the  schwa  deletion  facts  in 
Hindi.  This  will  be  followed  by  a  brief  description  of  lexical  phonology.  Then  I  will  explain  D'Souza's 
(1985)  non-linear  approach  to  Hindi  schwa  deletion.  Finally,  I  will  show  how  a  lexical  phonology 
treatment  of  schwa  deletion  handles  both  the  deletion  facts  and  what  are  treated  as  exceptions. 

1.  Overview  of  schwa  deletion  facts  in  Hindi 

Schwa  deletion  nwst  commonly  occurs  on  verb  and  noun  stems  when  a  vowel-initial  suffix  is 
added.  The  resulting  a  ~  0  alternation  can  be  seen  in  the  following  examples: 
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(1) 

Verb; 

YBibssai 

cast 

nkal 

niOfa: 

"come  out' 

pakar 

pakr+a: 

•catch" 

pifak 

pid<4a: 

'squeeze' 

(2) 

Noun: 

oblique  pi 

kamar 

kamr+C: 

•waist' 

ssbak 

sabk+«: 

'lesson- 

ke:s3r 

ke:sr+«: 

•saffron' 

Vowel-initial  suffixes,  however,  do  not  always,  trigger  schwa  deletion  in  the  stem.  The  schwa 
usually  is  not  deleted  if  the  deletion  woukJ  result  in  a  three-consonant  cluster.  That  is,  it  does  not  apply 
if  the  schwa  in  question  is  either  preceded  or  foltowed  by  a  consonant  cluster,  as  shown  betow: 

(3)  Schwa  preceded  t?Y  cluster: 


ncm.  sing. 

pustak  pustak+«:  'book' 

ki:rtan  ki:rtan+6:  'song' 

aksar  aksar-fO:  'letter* 

(4)  Schwa  followed  bv  cluster: 

nom.  sing.  oblique  pi. 

palaqg  palar)g-f«:  "bed' 

tilasm  tilasm-f<):  'magic' 

daraxt  daraxt+6:  Iree" 

Some  three  consonant  clusters,  however,  are  acceptable,  and  schwa  deletion  can  optionally  apply 
in  the  speech  of  some  native  speakers  in  spite  of  a  preceding  or  following  cluster.    For  exannple, 
alternations  as  in  (5)  exist  in  spoken  Hindi: 

(5)  jungal+i:  <->  jungl+i:  'wikJ' 

do:ng3r+i:  <-->  do:ngr+i:  'pertaining  to  hill  people' 

Another  fact  atx>ut  schwa  deletk)n  is  that,  even  though  it  does  apply  to  a  suffixed  stem,  it  does 
not  apply  to  prefixed  stems,  in  spite  of  the  fact  that  no  three  consonant  clusters  would  ensue: 


(6) 

be+par^+a:   ->  bepar^'a: 

(*beprha:) 

•unread^ 

a+samay     ~>  asamay 

(*asmay) 

•inopportune' 

a+mar+an  ->  amaran 

ramran) 

'until  death' 

In  addition,  schwas  tocated  in  suffixes  are  usually  exempt  from  deletion.  In  (7)  below,  the  schwa  in 
questbn  is  in  the  second  ntorpheme,  and  both  the  second  and  third  morphemes  are  considered 
suffixes: 
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(7)  k3la:+w3t+i:  — >  kalawati:  "name  for  a  girf 
kari:+g3r+i:  — >  kari:gari:  'craftsmanship' 
ekaki:+pan+a:  — >  ekaki:pana:  'lonesomeness' 

There  is,  however,  some  speaker  variation  on  deletion  of  schwas  in  suffixes.  Ohala  (1983)  cites 
the  following  example,  which  will  be  dealt  with  in  greater  detail  in  section  4. 

(8)  /bana:+wat+i:/   — >  banawti:  '\^  banawati: 
"artificial"   ("make"  +  nom.  suff .  +  i:) 

To  sum  up,  there  are  three  facts  about  Hindi  schwa  deletion  that  are  important  for  a  lexical 
phonology  treatment.  First,  there  is  speaker  variation  as  to  which  three-consonant  clusters  are  an 
acceptable  result  of  schwa  deletion.  Secondly,  schwa  is  deleted  only  in  a  suffixed  stem,  but  never  in 
a  prefixed  stem  Finally,  schwas  located  in  suffixes  are  usually  exempt  from  deletion,  except  for  cases 
such  as  (8)  above,  in  which  speaker  variation  exists.  After  giving  a  brief  background  of  lexical 
phonology  in  section  2,  I  will  discuss  D'Souza's  non-linear  approach  to  schwa  deletion  in  section  3.  In 
section  4,  I  will  return  to  these  three  facts  about  schwa  deletion,  drawing  on  D'Souza's  and  Ohala's 
insights  for  a  lexical  phonology  treatment  of  this  phenomenon. 

2.  Lexical  Dhonoloov:  a  brief  background 

Although  lexical  phonologists  differ  in  the  actual  nuts-and-bolts  description  of  the  theory,  several 
aspects  remain  constant  across  the  different  approaches  to  the  theories.  The  first  of  these  is  that  both 
morphological  and  lexical  boundaries  are  relevant  to  the  application  of  phonological  rules.  Each 
addition  of  a  nnorpheme  to  a  root  creates  a  cycle,  at  which  phonological  rules  apply  to  the  part  of  the 
word  that  has  been  formed  on  that  cycle.  These  phonological  rules  operate  either  alongside  word 
formation,  i.e.  after  each  morpheme  is  added,  or  after  word  formation  is  complete,  with  the  order  of  the 
addition  of  each  morpheme  made  available  through  bracketing.  The  result  of  the  application  of  the 
cyclic  rules  on  all  cycles  is  phonological  words.  After  phonological  words  have  been  obtained, 
postcyclic  rules  apply,  resulting  in  the  phonetic  representation.  Also  cnjcial  to  lexical  phonology  is  the 
notion  of  strict  cyclicity.  According  to  Rubach  (1984:12),  the  strict  cyclicity  principle  "requires  that  part 
of  the  structural  description  of  the  rule  must  be  present  in  the  domain  of  another  cycle,  i.e.  no  cyclic 
rule  may  apply  to  structures  internal  to  a  single  cycle."  The  implication  of  this  principle  is  that  cyclic  rules 
cannot  apply  in  morpheme-internal  positions. 

3.  D"Souza"s  non-linear  account  of  Hindi  schwa  deletion 

D'Souza  (1985)  gives  an  excellent  account  of  Hindi  schwa  deletion  within  the  framework  of 
non-linear  phonology  in  terms  of  syllable  structure  and  metrical  feet.  Her  account  utilizes  a  core 
skeleton  consisting  of  unlabelled  X  slots  to  which  vowels  and  consonants  are  linked,  as  shown  below: 

/l/i/A 

(9)  XXXXXXX 


ii: 


3ak6:  'lessons'  Ka:nta: 


sabak6:  'lessons'  ka:nta:  "wife* 

Vowels  fonn  the  syllable  head  (denoted  with  a  solid  line)  to  which  consonants  are  incorporated 
(shown  by  broken  lines)  in  the  following  order: 

I  '1  i  f 

(10)  First  onset:      XX  — >  XX,    then  coda        XX  ->  XX 
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Long  vowels  are  represented  by  two  X  slots  and  indicated  thus: 


i 


The  foot  constmction  rule  is:  'Build  left  dominant  quantity  sensitive  feet  beginning  at  the  left  edge 
of  the  word.'  That  means  that  the  first  syllable  of  a  word  automatically  receives  a  strong  branch  of  a  foot, 
represented  by  ?  .  In  addition,  any  heavy  syllable,  meaning  a  syllable  consisting  of  a  long  vowel  or  a 
vowel  plus  consonant,  will  be  marked  as  strong.  All  other  syllables  will  then  be  on  a  weak  branch.  Using 
the  same  examples  as  above,  I  will  illustrate  foot  construction: 

XX>0OO(X 

Idirta: 
t    t 

In  sabakO:.  the  first  syllable,  £3  automatically  receives  a  strong  foot  (f )  via  the  left  dominant  foot 
constmction  rule.  The  second  syllable  ba,  since  it  does  not  contain  a  long  vowel  or  end  in  a 
consonant,  receives  a  weak  foot.  The  third  syllable  Jsfi:  has  a  long  vowel  and  is  therefore  on  a  strong 
foot.  The  word  ka:nta:  has  only  bng  vowels,  and  so  no  weak  feet  are  permitted.  On  the  basis  of  this 
metrical  analysis,  D'Souza  formulates  the  following  rule  to  account  for  schwa  deletion: 


1        ¥ 

(12)   schwa  desyjiabification      x  — >   x  / 


a 


I   V 


What  this  rule  states  is  that,  if  a  schwa  is  located  on  the  weak  branch  of  a  foot,  then  the  association 
line  marking  it  as  a  syllable  head  is  disassociated.  The  way  this  rule  deletes  a  schwa  in  the  phonetic 
representation  is  to  assume  that  any  element  not  in  a  syllable  at  surface  structure  is  not  phonetically 
realized  (i.e.  not  pronounced). 

The  folbwing  derivation  illustrates  these  rules: 

.•II  ■■A 

(13)     XXXXXXX  Since  the  second  schwa  is  on  a  weak  branch, 

my 

sabakd:  schwa  desyllabification  applies,  yielding  (14): 


vt 
LA 


(14)     XXXXXXX  The  association  line  for  the  second  schwa  is  dis- 

sabakO:  associated,  leaving  the  consonant  prevk)usly  asso- 


V 


associated  with  it  (i.e.  (b])  stranded. 


Syllabification  rules  then  reapply,  and  the  stranded  consonant  [b]  is  linked  to  the  preceding 
syllable.  The  schwa  remains  unsyllabified  and  therefore  is  not  pronounced: 


(15)     XXXXXXX 
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-    sabkS: 

VT 

This  theory  also  nicely  handles  the  cases  where  a  schwa  is  followed  by  a  single  consonant,  as 
shown  in  the  examples  in  (16): 

re  <"  r 

/]   •[•  /I  /h 

(16)     XXXXX  XXXXX 

wwi         win 

nikal  picak 

ft  ft 

Within  a  quantity-sensitive  framework  as  proposed  by  D'Souza,  such  a  schwa  would  automatically 
be  on  a  strong  foot  (long  vowels  and  closed  syllables  are  on  strong  feet),  and  thereby  never  be  input  to 
the  schwa  desyllabification  rule. 

Two  problems  for  D'Souza's  mie  of  schwa  desyllabification  arise  when  schwa  deletion  would  result 
in  a  three-consonant  cluster  (See  examples  (3),  (4),  and  (5)  above.)  First,  one  exception  to  D'Souza's 
schwa  desyllabification  rule  exists  when  the  deletion  of  a  schwa  preceded  by  a  consonant  cluster  is 
subject  to  speaker  variation,  as  in  (5)  (repeated  below),  and  the  speaker  does  not  delete  the  schwa: 
r    <-    <-  r    r    r 

(5)      xxxxxxxx  xxxxxxxxx 

\llllll-'  11-7/// F 

jur)gal+i:  do:r}gar+i: 

V  I  V  t 

Oungali:]  ~    [jungli:]  (do:ngari.]  ^  [do:rigri:] 

D'Souza's  rule  predicts  that  these  schwas  will  always  delete,  because  they  are  on  a  weak  foot. 
Thus,  the  speakers  who  do  not  delete  the  schwa  must  b)e  accounted  for.  A  second  problem  arises 
when  a  schwa  is  preceded  by  a  consonant  cluster  and  followed  by  a  single  consonant  (eg.  [pustakO;]) 
According  to  the  rule  of  foot  building,  these  schwas  (preceded  by  a  cluster  and  foltowed  by  a  single 
consonant)  are  on  a  weak  foot,  if  the  following  consonant  fomns  a  syllable  with  a  folkjwing  vowel.    But, 
even  though  it  is  on  a  weak  foot,  a  schwa  in  this  environment  does  not  delete.  To  account  for  the 
unacceptable  cases  of  schwa  deletion  resulting  in  a  three-consonant  clusters,  D'Souza  devises  a 
schwa  reinstatement  rule,  which  resyllabifies  the  schwa  for  those  speakers  who  don1  delete  it.  This 
rule  is: 


(17)  schwa  reinstatement  x->x/x 

\       1 

a        a 
(where  x'  denotes  an  unsyllabified  stot) 
The  derivation  for  pust3k->-6:.  using  the  schwa  reinstatement  mie,  would  be: 

.••T-./l/A  :'\\     /A 

(18)  XXXXXXXX  schwa  desylabificatbn       XXXXXXXX 

Vt  Vi 
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Schwa  resyllabification  then  applies,  yielding 

:\\    \:h  .'T'./l/A 

(19)  XXXXXXXX  core  syllabification  XXXXXXXX 

m    ->      w 

Another  case  where  D'Souza's  analysis  predicts  schwa  deletion,  but  where  it  does  not  apply  is 
across  a  morpheme  boundary  to  the  left,  as  in: 
<■  f    i- 

(20)  XXXXXXX 


V  t 
r./i'/r'. 

XXXXXX 

IIWW 

a+mar+an 

V  1 


'until  death' 


In  spite  of  the  appropriate  foot  structure,  where  the  schwas  are  predicted  to  be  on  a  weak  branch, 
the  schwas  do  not  delete.  For  these  examples  D'Souza  posits  a  rule  of  Initial  accent,  which  states 
that  'Every  morpheme  initial  syllable  is  the  head  of  a  foot.'  (p.  21)  Thus,  instead  of  the  foot  structures 
given  in  (20),  the  foot  structure  for  these  examples  would  be: 


(21)      XXXXXXX  XXXXXX 


.'l/l/f'. 


am.     iM\ 

\  \   \         \  \  \ 

The  reason  for  postulating  such  a  rule  is  to  prevent  root-initial  schwas  from  ever  being  in  the  right 
environment  for  the  application  of  schwa  desyllabification,  thus  preventing  the  application  of  the  rule  in 
these  cases. 

4.  A  lexical  phonology  analysis  of  Hindi  schwa  deletion 

An  analysis  of  Hindi  schwa  deletion  seems  ripe  for  a  lexical  phonology  approach  for  two  reasons. 
First,  when  accounting  for  the  facts  of  schwa  deletion,  crucial  reference  must  be  made  to  morpheme 
boundaries,  in  that  certain  morpheme  boundaries  in  Hindi  prevent  the  application  of  the  schwa 
deletion  rule.  Secondly,  the  process  of  schwa  reinstatement  discussed  in  D'Souza  can  be  viewed  as  a 
natural  postcyclic  process  in  a  lexical  phonology  framework.  I  will  first  discuss  the  involvement  of 
morpheme  boundaries  in  the  njle  of  Hindi  schwa  deletion  before  returning  to  the  question  of  schwa 
reinstatement. 

As  mentioned  earlier,  one  morpheme  boundary  that  is  always  taken  into  account  by  speakers  of 
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Hindi  is  the  one  separating  a  prefix  from  tfie  root.  Following  Rubach  (1984)  and  his  analysis  of  Polish 
prefixes,  I  will  claim  that  prefixes  in  Hindi  are  'phonological  words,'  and  therefore  do  not  take  part  in  the 
cyclic  phonological  rules  resulting  from  suffixation.  Recall  that,  although  prefixes  in  Hindi  create  the 
proper  environment  for  schwa  deletion  in  roots,  the  schwa  does  not  delete.  To  illustrate: 

(22) 


UR: 

parh 

'read'   present  tense 

affix: 

pat^+a: 

'read'  past  tense 

affix: 

[be]lparh+a:] 

'unread' 

When  affixed  in  this  way,  prefixes  can  never  trigger  deletion,  whether  syllabification  and  foot 
building  apply  lexically,  i.e.  after  each  affixation,  or  post-lexically.  Because  the  prefix  is  considered  to 
already  be  a  separate  phonological  "word,"  it  is  therefore  not  subject  to  the  cyclic  rules  of  the  word 
formation  process.  A  similar  process  occurs  in  English,  accounting  for  the  differences  in  the 
phonological  behavior  of  the  prefixes  iih  and  uTh-  Although  the  [n]  in  the  prefix  irt  assimilates  to  a 
following  liquid  or  stop  (illegal,  irreplaceable,  impolite,  etc.),  the  [n]  in  uiL  never  assimilates  (unlavyful. 
unpopular).  Thus  it  has  been  claimed  that  uiL  is  a  phonological  word,  and  as  such,  is  prevented  from 
undergoing  the  cyclic  assimilation  process  that  ioi  undergoes. 

Another  area  where  schwa  deletion  in  Hindi  is  sensitive  to  morpheme  Ijoundaries  concerns 
schwas  in  suffixes.  D'Souza's  account  handles  these  seeming  exceptions  with  the  mle  of  initial  accent, 
which  specifies  that  every  morpheme-initial  syllable  be  the  head  of  a  foot.  Thus,  the  schwa  in  a  word 
which  exhibits  speaker  variation,  such  as  /bana:+w3t+i7  (see  (8)  above  or  (23)  and  (24)  below  for  the 
variation),  could  never  delete,  since  it  is  not  on  a  weak  foot.  Schwas  in  suffixes  which  do  delete  thus 
raise  problems  for  D'Souza's  approach.  Apparently,  the  rule  of  initial  accent  is  not  adequate  to  account 
for  these  data.  M.  Ohala  (1983),  in  her  very  extensive  linear  phonology  account  of  Hindi  schwa 
deletion  deals  with  this  type  of  variation  in  a  different  way.  Ohala's  explanation  for  the  difference  in 
pronunciation  is  that  some  speakers  'know  about"  the  morpheme  boundaries  and  others  do  not  (p. 
133).  That  is,  those  who  know  about  the  morpheme  boundary  would  say  [banawati:],  whereas  those 
unaware  of  the  boundary  would  say  [banaiwti:].  If  we  assume  that  Ohala  is  con-ect,  then  an  analysis 
incorporating  D'Souza's  account  into  a  lexical  phonology  framework  is  possible.  For  the  speaker 
unaware  of  the  morpheme  boundary  (Speaker  A  below),  the  pronunciation  [bana:wti:]  could  be 
derived  by  first  applying  word  formation  mies  and  then  doing  syllabification  and  building  feet.  But  for 
speakers  aware  of  the  morpheme  boundary  (Speaker  B  below),  the  pronunciation  [bana:w8ti:l  could 
only  be  obtained  by  first  syllabifying,  building  feet,  and  then  doing  word  fomnation.  In  other  words,  for 
Speaker  A,  schwa  deletion  is  a  post-lexical  rule,  whereas  for  Speaker  B,  it  is  a  lexical  rule.   A  derivation 
of  the  two  pronunciations  of  /b3na:-i-w3t-i-i:/  will  illustrate  this  point: 


(23)     Speaker  A    (unaware  of  morpheme  boundary) 
UR:  bana: 

affix.:  bana:+wat 


affix: 
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syHslruc.  .4.1  /\;1 

&foot  str.  banaiwat'i: 

t  V  I 


A.'h  .'1 

a<Jesyll.  banaiwati: 

Sresyll.  f  '\/  ] 

PR:  [banaAwti:] 

(24)        Speaker  B     (aware  of  rrwrpheme  boundary) 

UR:  bana: 

affix.:  [bana:]wat 

'^  r    <■ 
syllab.  &  ;'!  ;'|    /f; 

foot  str.  [[baria:]wat] 

\   \     \ 
affix.:  [[[bana:]wat]i:] 


syll.struc.  ;1  .'|    /f.  | 

&  foot  str.  [[[bana:]wai]i:] 

T  t    1  1 

PR:  [bana:wati:] 

By  allowing  the  syllable  structure  and  foot  building  mles  to  apply  either  on  every  cycle  or  after  the 
cyclic  word  formation  mles  have  applied,  we  can  account  for  the  two  different  pronunciations  rendered 
by  native  speakers  of  Hindi. 

As  a  final  point,  recall  from  the  earlier  discussion  that  a  schwa  surrounded  by  three  consonants 
sometimes  deletes  and  sometimes  doesnl,  depending  upon  whether  the  speaker  finds  the  resulting 
three-consonant  cluster  acceptable.  (See  (5)  at)ove.)    D'Souza's  approach  tries  to  account  for  the 
differing  pronunciations  given  by  different  speakers  by  proposing  a  schwa  reinstatement  rule  for 
speakers  who  do  not  accept  the  triple  cluster.  Since  the  X  slot  linked  to  schwa  in  her  account  "holds"  the 
place  where  the  schwa  was  disassociated,  schwa  reinstatement  requires  only  a  simple  rule  of 
resyllabification. 

Utilizing  aspects  of  D'Souza's  theory  then,  I  would  like  to  suggest  that  schwa  reinstatement  is  a 
postlexical  rule.  In  that  way,  the  schwa  can  be  disassociated  by  the  rules  of  syllabification  and  foot 
building  during  the  cyclic  word  formation  process.  Then,  since  the  X  slot  still  holds  the  timing  slot  for 
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the  disassociated  schwa  at  the  postcyclic  level,  the  schwa  can  be  reinstated  to  break  up  unacceptable 
clusters  on  an  individual  basis.  Since  the  native  speakers  of  Hindi  that  Ohala  cited  differed  on  the 
clusters  each  felt  were  acceptable,  this  seems  to  be  the  only  tenable  approach. 

5.  Conclusion 

To  sum  up,  there  are  three  main  improvement^^;  that  a  lexical  phonology  approach  offers  to 
D'Souza's  non-linear  account  of  Hindi  schwa  deletion.  First,  by  establishing  prefixes  in  Hindi  as 
phonological  words  as  far  as  cyclic  phonological  mles  are  concemed,  the  need  for  the  application  of 
D'Souza's  rule  of  initial  accent  to  account  for  why  the  schwas  in  prefixed  stems  do  not  delete  is 
eliminated.  Secondly,  the  variability  of  speakers  deleting  suffixal  schwas  is  explained  by  whether 
syllabification  and  foot  constaiction  apply  on  each  cycle  or  post-cyclically.  Finally,  by  considering  schwa 
reinstatement  as  a  post-cyclic  rule,  an  explanation  can  be  given  for  the  speaker  variation  in  schwa 
deletion  in  three-consonant  clusters. 


NOTE 

*l  would  like  to  thank  Laura  Downing,  Chuck  Kisseberth.  and  Hyang-Sook  Sohn  for  their  valuable 
comments  on  earlier  drafts  of  this  paper.  Of  course,  I  accept  responsibility  for  all  errors  and  omissions. 
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mi-MOVEMENT   IN   HINDI-URDU   RELATIVE   CLAUSES* 

Alice  Davison 


Relative  clauses  in  Hindi-Urdu  do  not  have  obligatory  nx>vement  of  the  relative 
NP  to  pre-sentence  COMP  position  in  surface  structure,  unlike  in  English  and  other 
languages.  Yet  relative  NPs  in  situ  may  have  semantic  scope  over  more  than  the 
clause  containing  the  relative  NP,  just  as  though  it  was  moved  to  a  c-commanding 
COMP  position  determining  the  semantic  interpretation  of  the  relative  and  other 
clauses.  Further,  the  possibility  of  having  scope  over  more  than  one  clause  is  limited 
by  syntactic  factors,  like  the  Bounding  restrictions  (or  subjacency  requirement) 
limiting  syntactic  movement.  The  restrictions  in  Hindi-Urdu  are  based  on  the  relation 
of  the  internal  clause  to  a  governor.  Hindi-Urdu  is  a  head-final  language,  with 
government  to  the  left.  Relative  NPs  may  have  multiple-clause  scope  over  left- 
branching  clauses.  It  is  proposed  that  in  Hindi-Urdu,  the  anatogue  of  ^-  movement 
takes  place  in  the  derivation  of  Logical  Form,  rather  than  S-Structure,  and  is  limited 
by  a  government-based  restriction  rather  than  Subjacency. 


Introduction:  The  main  purpose  of  this  paper  is  to  contrast  the  syntactic  stmctures  of 
languages  like  English,  with  obligatory  ^-  movement,  with  languages  like  Hindi-Urdu  which 
altow  in  situ  relative  phrases.  As  we  will  see  shortly,  though  English  and  Hindi-Urdu  differ  in 
surface  syntactic  structure,  the  relative  phrases  have  in  common  the  same  semantic  scope 
relations  and  are  subject  to  analogous  constraints  on  possible  scope  relations.  The  principal 
issue  in  this  paper  will  be  the  syntactic  representation  of  semantic  scope  relations,  and  how 
conditions  on  possible  scope  relations  are  represented  in  English  and  Hindi-Urdu. 

In  this  paper,  I  will  be  discussing  relative  clauses  of  the  form  shown  in  (1)  t>elow: 

(1)  [laRkee-nee  |oo  kitaabi  curaaii]  (voo  usee;  vaapas  kar  deegaa) 
boy-erg.  which  book  steal-pert,  he  it-DAT  return  give-fut. 
The  boy  will  retum  the  book  [whichj  he  stole  e;].' 

The  square  brackets  enclose  the  relative  clause,  which  will  be  the  main  focus  of  the 
paper.  Within  the  relative  clause,  the  boldface  relative  NP  joo  kitaab  "which  book'  is 
surrounded  by  the  other  constituents  of  the  relative  clause.  It  is  not  moved  obligatorily  to  the 
left-nrx}st  position  in  the  clause  (although  there  is  a  tendency  for  relative  NPs  to  occur  in  first  or 
second  positk>n  in  the  clause).  In  the  sentence  in  (1),  the  relative  NP  occurs  in  the  same 
preverbal  positbn  that  it  woukj  occupy  in  a  non-relative  clause  by  virtue  of  being  a  direct 
object. 

In  this  respect,  the  relative  NPs  of  Hindi-Urdu  contrast  sharply  with  English.relative 
operators  like  which,  as  illustrated  in  (2)  and  (3).  The  English  sentence  in  (2)  is  ill-formed 
because  the  relative  phrase  which  occurs  in  situ,  in  the  direct  object  position  which  the 
con-esponding  non-relative  NP  wouki  occupy  in  an  ordinary  clause.  Instead,  the  relative 
phrase  must  be  moved  to  a  left-most,  preclausal.  or  COMP  positton;  cf.  (3). 
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(2)  "The  boy  will  return  the  book  [coMP  [he  stole  which.]] 

(3)  The  boy  will  return  the  book  [cOMPWhIch  [he  stole  e.]] 

In  Hindi-Urdu,  relative  phrases  may  occur  in  situ,  and  are  not  obligatorily  moved  to  COMP 
position  in  the  formation  of  surface  syntactic  structure  (S-Structure).  In  English,  relative 
phrases  must  be  moved  to  COMP  in  the  derivation  of  S-Structure. 

The  reference  of  the  relative  phrase,  or  its  semantic  scope,  is  determined  in  English  by 
its  syntactic  position  in  COMP.  Since  COMP  is  a  sister  of  S,  immediately  dominated  by  S',  the 
relative  phrase  c-commands  S  and  all  its  subconstituents.  In  English,  the  semantic  scope  of  a 
relative  phrase  corresponds  to  what  the  relative  c-commands  at  S-Structure.  Since  structural 
relations  established  in  S-Structure  carry  over  to  Logical  Form  (LF)  (unless  some  allowable 
change  occurs),  the  c-command  relation  also  holds  at  LF,  which  is  the  input  representation  to 
semantic  interpretation.  We  can  now  make  the  first  assumption  about  languages  with  relative 
phrases  in  situ  in  S-Structure:  In  these  languages,  the  semantic  scope  of  relative  phrases  will 
be  determined  in  the  same  way  as  in  languages  with  syntactic  movement,  that  is,  by  the 
syntactic  relation  of  c-command.  However,  this  will  hold  only  in  Logical  Form,  a  level  of 
syntactic  representation  derived  from  S-Structure  by  the  application  of  syntactic  movement 
(cf.  Huang  (1982)). 

Semantic  and  syntactic  scope:   In  languages  like  English,  with  obligatorily  moved 
relative  phrases,  the  position  of  the  relative  phrase  in  S-Structure  has  semantic  conse- 
quences. These  are  not  immediately  evident  in  the  case  of  a  simple  monoclausal  relative 
illustrated  in  (I)  and  its  English  counterpart  in  (3).  Nevertheless,  the  reference  of  the  relative 
NP  in  (1)  or  (3)  is  determined  with  reference  to  the  clause  in  which  it  originates:  It  isnl  just 
SOME  book  which  is  referred  to,  but  the  book  having  the  property  that  the  boy  stole  it.  The 
interpretation  of  which  in  (4a).  is  still  different.  The  reference  of  the  relative  phrase  here  is 
determined  both  by  the  clause  of  origin  and  by  the  clause  containing  that  clause,  they  think 
SI. 

(4)  (a)   The  book  [which  [they  think  [that  he  stole  e]]]  was  mine. 

(b)  They  think  [that  the  book  [which  [he  stole  e]]]  was  mine. 

In  sentence  (4b),  the  reference  of  which  is  again  as  in  the  simple  structures,  (1)  and  (3), 
even  though  the  sentence  contains  both  the  clauses  he  stole  e  and  the  matrix  clause  they 
think  S'.  The  position  of  which  in  S-Structure  is  what  conveys  the  semantic  scope  of  the 
relative  operator.  Assuming  that  each  clause  S'  consists  of  COMP  and  S,  then  the  semantic 
scope  of  the  relative  NP  corresponds  to  the  COMP  position  which  the  relative  phrase 
occupies. 

in  the  case  of  monoclausal  relative  structures  like  (1),  the  relative  phrase  has  scope  over 
its  clause  of  origin.  But  in  more  complex  cases  such  as  (5),  the  relative  phrase  jis  aadmil 
koo  ^fvhich  man  (dat.)"  has  scope  over  both  an  internal  clause  (Si)  and  its  matrix  clause  (S2): 

(5)  [pulis    [jis  aadmii-koo  pro  bam     rakhnee-kaa   doosii  maantii  hai]] 
police    which  man-DAT  tjomb  place-inf .  of      guilty  t)elieve  is 
S1         S2              ^ 

(mujhee  pakkaa  visvaas  hai  ki  voo  nirdoos   hai.) 

I-DAT        fimi       tjelief     is  that  he  innocent     is 

'i  firmly  believe  that  the  man  is  innocent  [who  [the  police  believe  (e  to  t>e  guilty  of 

setting  the  bomb]]] 
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The  relative  NP  is  in  situ  inside  a  subordinate  clause  In  S-Structure,  as  shown  in  (5),  but  it 
will  be  assumed  to  move  to  the  COMP  position  of  the  matrix  clause  in  the  derivation  of  Logical 
Form.  There,  its  semantic  scope  is  determined  with  reference  to  its  syntactic  position  in  a 
syntactic  and  semantic  representation.  How  this  occurs  and  what  conditions  it  is  subject  to  will 
be  shown  in  more  detail  shortly. 

Returning  to  languages  like  English  which  have  obligatory  3^-nx)vement,  we  note  that 
syntactic  nrx)vement  from  the  site  of  origin  to  a  position  in  COMP  is  assumed  to  take  place  in 
very  specific  ways,  subject  to  conditions  on  well-formed  movement.  This  is  illustrated  in  the 
tree  structure  in  (6): 

(6)    Syntactic  nfx>vement  to  COMP  in  the  derivation  of  S-Structure: 


In  D-structure,  the  representation  in  which  lexical  subcategorization  Is  satisfied  (cf 
Chomsky  1981),  the  relative  phrase  which  originates  in  a  syntactic  direct  object  positton  and 
so  satisfies  the  subcatgeorization  of  steal  for  an  object.  It  receives  the  semantic  theta  role  of 
'patient'  which  steal  assigns  to  that  position.  Since  this  position  has  an  associated  theta 
position,  and  since  an  NP  may  have  one  and  only  one  theta  role  assigned  to  it  (Chomsky 
1981),  the  NP  may  nrx>ve  only  to  a  position  to  which  no  theta  role  is  assigned.  COMP  is  one 
such  position.  Movement  is  to  the  nearest  COMP  position,  or  else  the  subjacency  condition 
would  be  violated.  This  condition  stipulates  that  between  the  source  of  a  nwved  NP  and  its 
destination  or  landing  site,  only  one  cyclic  node  (NP  or  S)  may  intervene.  Movement  from 
COMP  may  only  be  to  another  COMP  site,  because  of  the  nature  of  the  empty  category  created 
by  wh-movement,  which  can  have  an  antecedent  only  outside  of  S  (Chomsky  1981 ,1982). 
Hence  the  two  instances  of  nrwvement  in  (6)  are  well-formed.  The  interpretation  in  which  the 
relative  phrase  has  scope  over  two  clauses  is  based  on  a  well-formed  syntactic  stmcture  in 
which  the  relative  has  come  to  c-commmand  two  clauses  from  COMP. 

A  very  similar  Hindi-Urdu  sentence  is  illustrated  in  (7): 

(7)    [tum       (PRO  joo  kitaabee  paRhnaa]  caahtee  hoo]  vee  ghum    gaii  haf. 

you-fam.  which  books  read-inf.     wart     be    they  wander  goi^erf.  are 

Si         82 
The  books  [  which  [you  want  (PRO  to  read  e]]]have  gotten  lost.' 


In  this  sentence  the  relative  phrase  ]oo  kitaabe  e  also  has  scope  over  two  clauses:  its 
own  clause  of  origin  and  the  matrix  clause.  Though  no  syntactic  movement  has  occurred  in 
the  derivation  of  the  S-Structure  (7),  let  us  assume  that  in  Hindi-Urdu,  as  welt  as  in  other 
languages,  semantic  scope  is  determined  by  syntactic  scope.  Since  the  semantic  scope  of 
the  relative  phrase  includes  the  matrix  as  well  as  subordinate  clause,  this  assumption  means 
that  Joo  kitaabe  e  must  £-command  both  clauses  at  some  level  of  representation,  in  this 
case,  in  Logical  Form.  The  tree  structure  in  (8)  illustrates  syntactic  movement  of  a  relative 
phrase  to  COMP  in  LF: 

(8)    Syntactic  movement  in  the  derivation  of  Logical  Form: 

S* 


The  derivation  shown  in  (8)  is  very  similar  to  the  one  in  (6).  Many  of  the  conditions  on 
well-formed  derivations  also  hold  here.  Movement  takes  place  from  a  theta-marked  position, 
the  position  of  the  direct  object  of  paRhnaa  'read-inf .'  to  a  non-theta  position  in  COMP  of  the 
embedded  clause,  and  then  to  the  COMP  of  the  matrix  clause.  This  meets  the  theta  criterion 
which  allows  no  more  than  one  theta  role  to  be  assigned  to  an  NP.  For  reasons  which  cannot 
be  given  here  in  detail  because  of  limitations  of  space,  we  cannot  incorporate  Subjacency 
directly  into  the  grammar  of  Hindi-Urdu  as  a  well-formedness  condition  on  movement  in  LF. 
Instead,  I  will  discuss  below  the  specific  conditions  on  wide  and  narrow  scope  readings  which 
are  realized  in  Hindi-Urdu. 

Before  going  on  to  a  statement  of  well-formedness  conditions,  I  want  to  comment  on  the 
syntactic  structure  of  the  embedded  clause  given  in  (8)  for  the  sentence  in  (7),  particularly  the 
presence  of  COMP  in  the  embedded  clause.  This  clause  differs  from  the  embedded  clause  in 
(6)  in  an  important  way.  In  the  English  sentence  in  (6),  the  embedded  clause  is  the  object  of 
think,  which  takes  either  a  finite  sentential  complement,  as  in  (6),  or  a  non-finite  'small  clause' 
such  as  the  NP-NP  complement  in  (9). 

(9)    We  think  [John  a  fool]. 

In  Hindi-Urdu,  the  vertj  caahnaa  'to  want'  takes  a  non-finite  infinitive/gerund  complement, 
with  a  null  subject  PRO,  which  must  be  coindexed  with  the  matrix  subject,  as  in  (7).  This 
complement  clause  must  precede  the  verb.  (The  same  verb  also  allows  a  finite  clause  with  a 
lexically  realized  subject,  not  necessarily  Identical  to  the  matrix  subject.  We  will  return  to  this 
stnjcture  below.) 
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H  the  object  of  the  matrix  verb  Is  an  infinitive/gerund  in  IHindi-Urdu,  it  might  be  argued  that 
such  complements  have  clause-like  structure,  with  subjects  and  objects,  but  are  not  really 
clauses.  In  particular,  we  might  ask  if  the  complement  has  a  COMP  position  dominated  by  S'.  If 
CX)MP  is  absent  from  the  complement  clause,  its  absence  would  explain  immediately  why  the 
relative  phrase  Joo  kitaabee  has  wide  scope  over  two  clauses,  and  in  fact  MUST  have  wide 
scope.  It  appears  in  the  matrix  CX)MP  because  it  has  nowhere  else  to  go.  if  this  hypothesis  is 
right. 

But  there  is  evidence  that  this  explanation  is  not  con-ect  for  all  complement  clauses. 
There  are  SOME  complement  clauses  for  which  it  is  correct,  but  these  have  special  properties. 
The  sentence  in  (5),  reproduced  below,  is  one  such  example  of  a  non-finite  complement 
clause  which  lacks  S'.  Its  LF  structure  is  shown  in  (10). 

(5)    [pulis    [jis  aadmll-koo  [PRO  bam     rakhnee  -kaa  doosii  maantii  hai]]] 
police    which  man-DAT  bomb  place-inf.  of     guilty  believe  is 

Si        S2  S3 

(mujhee  pakkaa  visvaas  hai  ki  voo  nirdoos       hai.) 
I-DAT  firm       belief     is  that  he  innocent    is 

'I  firmly  believe  that  the  man  is  innocent  [whoj  [the  police  believe  [ej  to  be  guilty  of 
setting  the  bomb]]] 


maantihai 


The  relative  phrase  is  extracted  from  subject  position  of  the  complement  of  maantii  hai 
'considers,  believes'.  The  relative  NP  has  an  unusual  marking  for  a  subject,  the  dative- 
accusative  postposition  -koo.  This  postposition  is  'exceptionally'  assigned  by  verbs  like 
maannaa  lo  consider,  samajhnaa  'understand,  know',  deekhnaa  1o  see*,  to  the  subject  of  the 
complement  clause.  Case  is  usually  assigned  by  verbs  within  a  clause,  because  clause 
boundaries,  or  S',  block  case  assignment.  So  this  type  of  case  marking  is  anomalous  without 
some  special  provision  which  allows  it.  As  the  tree  structure  in  (10)  illustrates,  the  special 
provision  is  the  deletion  of  the  S'  and  COMP  of  the  embedded  clause  complement  to  maantii 
hai  The  verb  of  the  matrix  clause  governs  the  complement  clause  subject  position  and 
assigns  it  case,  since  no  S'  node  intervenes. 
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If  we  extend  this  analysis  to  other  cases,  however,  we  find  that  it  contradicts  other 
principles  of  the  grammar.  The  sentence  in  (7)  and  (8)  has  a  non-finite  complement  with  a  null 
subject  or  PRO.  This  kind  of  empty  category  occurs  ONLY  in  ungoverned  positions,  to  which 
no  case  is  assigned.  Infinitive  subjects  are  typically  null  and  ungoverned  because  the 
governor  and  case  assigner  of  the  subject  position  in  finite  clauses  is  absent  from  non  finite 
clauses— this  is  INFL  or  the  inflection  and  agreement  constituents  of  finite  clauses.  When 
PRO  occurs  in  the  subject  position  of  a  non-finite  clause,  it  is  govemed  neither  by  an  element 
of  its  own  clause  nor  by  any  element  of  the  matrix  clause.  PRO  is  protected  by  S'  from  being 
govemed  by  a  matrix  clause  verb.  Referring  back  to  the  proposed  LF  structure  of  (7)  in  (8),  we 
see  that  the  presence  of  S'  in  the  emljedded  clause  protects  PRO  from  being  governed  in  the 
matrix  clause. 

(8) 


Since  there  are  theoretical  reasons  to  assume  that  S'  is  present,  it  is  natural  to  assume 
that  also  COMP  is  present  in  the  embedded  sentence.  Hence  we  will  represent  all  embedded 
sentences  as  having  S'  and  COMP  unless  there  are  some  special  reasons  (as  in  the 
Exceptional  Case  Marking  contexts  of  (5)  and  (10))  to  believe  that  S'  is  deleted. 

Conditions  on  well-formed  movement  to  COMP:   We  now  return  to  the  question 
of  what  special  restrictions  there  are  on  the  movement  of  the  relative  operator  to  COMP.  For 
Hindi-Urdu,  it  appears  that  semantic  scope,  dependent  on  syntactic  scope,  is  correlated  with 
the  position  and  role  of  the  clause  of  origin.  There  are  verbs  in  the  language  which  allow  two 
kinds  of  complement.  One  type  is  the  kind  just  discussed,  a  non-finite  clause  which  occurs  in 
preverbal  position,  as  in  (5)  and  (7).  The  same  matrix  clause  verbs  allow  another  kind  of 
complement,  which  is  finite  and  occurs  to  the  right  of  the  verb.  The  contrast  is  shown  in  (1 1 ) 
for  a  complement  clause  not  containing  a  relative  operator. 

(11)  (a)   pulis  [[  us  -  aadmii-koo  coor]  samajhtii  hai] 
police  this  man  dat.    thief   believe    is 
"The  police  believe  [this  man  to  be  a  thief].' 

(b)  pulis  samajhtii  hai  [ki  voo  aadmii  coor  haq 
police  believe   is  that  this    man  thief     is 
The  police  believe  that  this  man  is  a  thief. 
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The  sentence  in  (5)  contains  a  more  complex  clause  with  the  same  structure  illustrated  in 
(11a),  and  with  a  relative  operator  inside  the  embedded  clause.  This  relative  operator  has 
wide  scope  over  tx)th  its  clause  of  origin  and  the  matrix  clause.  The  altemative  structure  with  a 
finite  complement  clause  would  tie  the  sentence  in  (12): 

(12)  [pulis  sooc   rahii  hai    [ki    Jls  aadmll-nee/]ln-]ln  aadmlyoo-nee  bam 
police  think  progr.  is  that  which  man-erg     /which-which  men-erg         bomb 
Si  S2 

rakhaaQ  mujhee  yah  pakkaa  visvaas  hai  ki  voo  nirdoos  hai. 
placed     l-OAT      this  firm      belief     is  that  he  innocent  is 

(a)  'Wide  scope' reading: 

I  firmly  t>elieve  that  the  man  is  innocent  who  the  police  believe  e  set  the  bomb; 
I  am  convinced  [that  whoever  the  police  believe  e  e  set  the  bomb]  is 
innocent. 

(b)  'Contradictory'  reading: 

'The  police  believe  [that  whoever  (really)  set  the  bomb,  I  am  convinced  that 
he  is  innocent]'. 

The  structure  in  (12)  is  exactly  analogous  to  (5),  which  has  the  wide  scope  reading  and 
which  in  fact  must  have  the  wide  scope  reading  because  of  the  deletion  of  S'  and  CX3MP.  The 
question,  then,  is  whether  the  wide  scope  reading  is  possible  in  (12)^qr  the  relative  phrase  jls 
aadmil-nee  ,  or  the  'generic'  reduplicated  version  jIn-jIn  aadmlyoo-nee  "which  man, 
whichever  men-erg.'.  Judgements  are  very  mixed  about  the  interpretation  and  well- 
fonriedness  of  this  sentence.  Some  speakers  will  say  that  the  wide  scope  interpretation  (12a) 
is  possible  for  (12),  though  the  structure  (5)  is  preferable. ^  Others  find  the  structure  well- 
fomf>ed  only  if  it  has  a  different  constituent  structure,  as  in  (13),  which  con-esponds  to  a 
contradiction-of-beliefs  reading,  as  in  (12b).  For  these  speakers,  wide  scope  is  not  permitted 
out  of  a  finite  clause  to  the  right  of  the  matrix  verb. 

(13)  pulis   sooc   rahii  hai   [ki        [Jls  aadmil-nee/jln-jin  aadmlyoo-nee 
police  think  prog,  is  that  which  man-erg        /  which-which  men-erg 
bam  rakhaa]   mujhee  yah  pakkaa  visvaas  hai  [ki  voo  nirdoos  hai]] 
bomb  placed    l-DAT     this      firm  belief  is  that     3  sg  innocent    is 

The  police  think  that  [I  believe  [whoever  set  the  bomb]  is  innocentf. 

The  conditions  on  well-formed  movement  to  COMP  can  be  summarized  as  follows: 

(14)  COMP  to  COMP  movement  may  not  take  place  out  of 

(a)  right  branch  (finite)  clauses 

(b)  if  these  clauses  are  adjuncts  and  not  subjects  or  objects.2 

Speakers  who  reject  the  wide  scope  reading  (12a)  adhere  to  condKion  (14a),  which 
prohibits  CX)MP  to  COMP  movement  from  any  finite  clause  which  is  a  right  branch  in  any  phrase. 
Those  who  accept  the  wide  scope  reading  (12a)  also  have  the  conditnn  (14b),  which  says 
that  right  branch  finite  clauses  which  have  the  status  of  being  the  argument  of  a  verb,  a 
subject  or  object,  permit  nrrovement  out  of  COMP.  This  distinction  describes  the  differences  of 
sp>eakers'  judgements  about  (12)  and  means  that  no  speaker  shoukj  accept  wide  scope 
readings  out  of  finite,  right  branch  adjunct  clauses,  adverbial  or  modifier  clauses  not 
sutx:ategorized  by  a  verb.3  (A  similar  pattem  is  found  for  constituent  questtons.) 


There  are  contrasts  of  exactly  this  type,  in  which  a  wide  scope  reading  is  readily 
available  out  of  a  non-finite,  left  branch  adjunct  clause,  but  not  out  of  the  con-esponding  finite 
and  right  branch  versions.  For  exannple,  the  wide  scope  reading  is  possible  for  jls  kltaab- 
koo  in  (15).  Here,  the  postposition  see  "from"  (in  boldface)  is  the  head  of  the  phrase  in  which 
the  internal  clause  is  a  left  branch.  The  relative  phrase  has  scope  over  both  Si  and  S2. 

(15)  [(meeree  bhaaii- kee  jls  kitaab-koo  paRhnee  see]  maf  gussaa  hoogaii] 

my  brother-of  which  book  DAT.      read-inf.from      I     angry  become  pert, 

(vookitaab  acchii  nahfithii). 
ttnatbook    good    not  was 

The  book  whichj  I  got  angry  on  account  of  my  brother's  reading  ei  was 
disgusting'. 

The  sentence  in  (16)  should  bejhe  counterpart  of  (15),  except  that  the  adjunct  clause  is 
a  right  branch  of  the  preposition  kyoo  ki  'because'.  However,  this  sentence  is  ill-formed  if  it  is 
meant  to  be  synonymous  with  (15).  It  has  a  different  reading  and  constituent  structure;  cf. 
(17). 

(16)  '[[mafgussaa  hoo  gaii  [kyoo  ki    meeree  bhaaii-nee    jookltaab   paRhii 

I       angry  became       because      my     brother-erg.    which  book    read 
(voo  kitaab  acchii  nahiTthii) 
that  book  good   not    was 

The  book  whichj  [I  got  angry   because  my  brother  read  ej]  was  disgusting". 

(17)  I  got  angry  [because  the  book  [which  my  brother  read  ej]  was  disgusting. 

in  the  (15)-(16)  reading,  the  cause  of  anger  is  that  one's  brother  read  the  book,  which 
independently  was  a  nasty  book.  In  the  (17)  reading,  the  nasty  quality  of  the  book  AND  the 
fact  that  my  brother  read  it  are  the  causes  of  anger.  In  the  (15)-(16)  reading,  the  relative 
phrase  has  scope  over  the  reason  clause.  In  the  (1 7)  reading,  the  reason  clause  contains  and 
has  scope  over  the  relative  clause.  In  the  (17)  reading,  the  relative  operator  has  narrow 
scope,  over  only  its  clause  of  origin. 

Relative  phrases  in  Hindi-Urdu  have  semantic  scope  over  their  clauses  of  origin. 
Whether  they  have  wide  scope  over  matrix  clauses,  too,  is  dependent  on  the  syntactic 
relation  of  the  embedded  clause  to  the  constituent  which  it  is  a  sister  of,  the  verb  or 
adposition  in  the  matrix  clause.  If  the  embedded  clause  is  a  non-finite  and  left  branch  clause, 
a  relative  phrase  within  it  in  S-Structure  may  have  wide  scope  over  the  matrix  clause  at  LF, 
whether  or  not  there  is  an  intermediate  COMP,  as  in  (7),  and  whether  or  not  the  clause  is  an 
argument,  as  in  (7),  or  an  adjunct,  as  in  (15).  If  the  embedded  clause  is  a  finite  and  right 
branch  clause,  a  relative  phrase  within  it  may  for  some  speakers  not  have  wide  scope  at  all  (cf. 
Condition  (14a)).  For  others,  wide  scope  is  allowed  out  of  a  finite  clause  only  if  it  is  an 
argument,  as  subject  or  object  'connected'  with  the  verb  of  the  higher  clause.  If  it  is  an 
adjunct,  wide  scope  of  a  relative  phrase  is  never  possible,  as  in  Condition  (14)  b.  These  are 
informal  and  specific  conditions  on  when  relative  operators  may  move  to  the  matrix  COMP  and 
so  c-command  the  matrix  clause.S 

In  spite  of  the  formal  differences  at  S-Stoicture  between  languages  with  in  situ  relatives 
and  relatives  which  move  to  COMP  obligatorily,  there  are  common  issues  which  involve 
semantic  scope.  The  commonality  of  semantic  scope  relations  can  be  represented  formally  in 
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terms  of  syntactic  relations  and  syntactic  conditions  on  well-formed  movement  to  COMP.  The 
differences  between  languages  can  be  stated  in  terms  of  when  syntactic  movement  to  COMP 
is  required,  in  deriving  S-Structure  or  Logical  Form. 


NOTES 

*l  am  grateful  to  Baber  Khan,  Chandrika  Kaul,  Manoj  Jha,  and  Abha  Gupta  for  judgements 
about  the  Hindi-Urdu  sentences,  and  to  James  Gair,  Yarrxjna  Kachru,  and  Andrea  Tyler  for 
helpful  comments. 

1  There  are  alternative  ways  of  expressing  this  meaning,  using  two  finite  clauses.  The 
relative  phrase  which  is  intended  to  have  scope  over  the  matrix  clause  occurs  in  the  matrix 
clause  in  S-stnjcture.  It  is  coindexed  with  a  pronoun  (null  or  overt)  in  the  embedded  clause, 
giving  the  equivalent  of  semantic  scope  over  two  clauses  via  movenrwnt  in  LF: 

(i)    pulis  jls-aadmil-koo/JIs  aadmll  kee    baaree-  mee  maantii  hai 
police  which  man  DAT  /  which  man    of  concerning         considers 
ki  0/voo  coor  hai, ... 
that  he    thief     is 
The  man  concerning  whom  the  police  think  that  he  is  a  thief..' 

2What  is  expressed  here  as  a  set  of  stipulations  can  t>e  phrased  in  more  general  terms. 
For  example,  since  movement  in  LF  leaves  an  empty  category  in  the  S-stnjcture  site  of  the 
relative  phrase,  the  conditions  on  nxjvement  can  be  expressed  in  terms  of  conditions  on 
coindexing  an  empty  category  with  an  antecedent  which  c-commands  it.  One  such  proposal 
is  the  Connectedness  condition  of  Kayne  (1983),  which  defines  a  connected  tree  between 
the  empty  category  and  the  antecedent,  made  up  of  government  relations  and  X"  projections. 
This  proposal  woukJ  give  the  same  results  as  the  conditions  in  (14)  if  non-finite,  left  branch 
clauses  are  always  govemed,  allowing  connectedness  between  embedded  and  matrix 
clauses.  Finite,  right  branch  clauses  are  never  govemed,  except  through  argument  relations, 
and  only  for  some  speakers.  This  condition  excludes  adjunct  clauses  if  they  are  right 
branches. 

3j.  Gair  suggests  that  the  conditions  (14)  may  hokj  for  all  speakers,  txjt  not  all  speakers 
may  perceive  an  argument  relation  between  the  matrix  clause  and  the  embedded  sentence  if 
the  embedded  clause  is  not  in  the  normal  position  of  an  embedded  subject  or  object  clause. 
Similar  distinctions  exist  for  constituent  questions  between  argument  position  to  the  left  of 
the  governor  and  adjunct  positions  to  the  right.  Wide-scope  readings  are  not  permitted  for 
constituents  in  non-finite  subordinate  clauses  postposed  to  the  right  of  a  verb.  Directionality 
rather  than  tense  is  the  deciding  factor. 


REFERENCES 

CHOMSKY,  Noam.  1981.  Lectures  on  Government  and  Binding.  Dordrecht:  Foris. 

— .  1982.  Some  concepts  and  consequences  of  Government  and  Binding.  Cambridge: 

MIT  Press. 
HUANG,  J.  1982.  Logical  relations  in  Chinese  and  the  theory  of  grammar.  MIT  Ph.D. 

dissertation. 
KAYNE,  R.  1983.  Connectedness.  Linguistic  Inquiry  16. 


todies  in  the  Linguistic  Sciences 
olune  17,  Nunber  1,  Spring  1987 


THE   DEVELOPMENT   OF   PERSONAL   PRONOUNS    IN   MODERN   GONDI 

Garapaii,    U.    Rao 

SUNY,    Stony   Brook 

and 

Osmania  University,  Hyderabad 

Abstract 

Gondi ,  a  Dravldian  language,  exhibits  a  bewildering 
multiplicity  of  pronominal  variants  which  cannot  be  derived 
from  a  single  proto-forra.  A  comparison  of  Proto-Gondi 
personal  pronouns  with  similar  reconstructions  in  the  other 
Dravidian  languages  suggests  that  the  variation  among 
personal  pronouns  is  not  a  phenomenon  exclusive  to  Gondi  but 
can  be  traced  to  Proto-South  Dravidian.  However,  there  remain 
a  number  of  variants  unaccounted  for  by  the  reconstructed 
forms.  The  development  of  these  variants  is  attributed  to  the 
loss  of  distinction  between  second  singular  and  plural 
nominative  forms  and  subsequent  efforts  toward  the 
restoration  of  the  distinction* 


1.0  Introduction: 

Gondi,  the  language  of  the  Gonds  or  Koytor  (as  they  call  them- 
elvea) ,  is  one  of  the  most  widely  spoken  nonliterary  languages,  with 
wo  million  speakers  spread  over  a  large  tract  of  Central  India. 

It  has  been  observed  that  Gondi  exhibits  a  great  deal  of 
ariation  from  one  region  to  another,  more  in  phonology  than  in  morph- 
logy  and  syntax.  There  are  at  least  ten  speech  varieties  identified 
y  the  variation  in  personal  pronouns.  The  bewildering  multiplicity  of 
ronominal  variants  found  in  Gondi  is  unprecedented  in  any  other 
anguage  of  the  subcontinent.  Each  pronoun  has  a  number  of  regional 
ariants,  and  the  use  of  more  than  one  alternant  in  the  same  dialect 
s  not  uncommon.  In  the  following  sections  the  source  and  the  develop- 
ent  of  variation  in  personal  pronouns  will  be  discussed. 

2.0  The  data:1 

Personal  pronouns  in  Gondi  are  distinguished  for  person  (first 
1)  and  second  (2))  and  number  (singular  (sg. )  and  plural  (pi.)).  Each 
ronoun  has  at  least  two  forms,  a  free  form,  which  occurs  in  the 
ominative  (nom. ) ,  and  a  bound  form,  which  occurs  elsewhere.  The 
atter  is  referred  to  as   oblique  base  (obi.). 

2.1  The  singular  forms  of  the  first  and  second  persons  have  two 
ajor  variants,  one  with  an  initial  nasal  and  the  other  without,  as 
hown  in  (i>  a.  and  b. 
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(I)  a.   First  person  singular  forms: 

nom.         obi.    Geographical  distribution 

ana         na-     Sehore,  Hoshangabad ,  Betul , 

Chindwara,  Amraoti ,  and  Nagpur. 
nana,  nan    nS-     Akola,  Nagpur,  Wardha ,  Yeotmal , 

Bhandara ,  Balaghat,  Seoni ,  Handla, 
Ourg,  Chandrapur,  Bastar ,  Handed, 
Adilabad,  Khammam,  and  Koraput. 

(1)  b.   Second  person  singular  forms: 

ima,  uma    nT-     Sehore,  Hoshangabad,  and  Balaghat. 
ima,  ime    nT-     Betul,  Amraoti ,  Akola,  Wardha, 

Nagpur,  Yeotmal,  Chindwara,  Seoni, 
Mandla,  and  Balaghat. 
nima,  nlm    nl-     Durg  and  Bastar. 
nime,  nim    ni-     Chandrapur  and  Adilabad. 
nima         nl-     Khammam   and   Malkangiri. 

2.2  The  plural  forms  of  the  first  and  second  person  pronouns 
exhibit  an  even  greater  variety  in  shape  and  size  than  the  correspond- 
ing singulars.  Compare  the  forms  given  in  (2)  and  (4). 

(2)  First  person  exclusive  plural  forms: 

nom.  obi.     Geographical  distribution 

ama\  ma-     Sehore. 

ama^,  amap     ma-     Betul,   Chindwara,  Balaghat, 
Amraoti,  Akola,  Wardha, 
Nagpur,  Bhandara. 
amak,  amok     mi-     Sehore,  Balaghat,  and  Yeotmal. 
mama^,  mamaf    mS-     Northern  Seoni,  Mandla,  North- 
western  Bastar   (plains) . 
mamo^ ,  mamok    mff-     Adilabad. 
raamo^u,  mama^u  Southern  Chandrapur. 

namat,  namo^    mS-     Northeast  of  Bastar,  Eastern 
Adilabad   and  Chandrapur 
(Grierson  1906) . 
mamaV,  mam        ma-     Northern  Seoni,  Western  Mandla. 

mama,  mam  mff-  Southwest  of  Bastar. 
mamo,  mam,  mom  mS-  Southeast  of  Bastar. 
mama  ma-     Khammam  and  Malkangiri  (Burrow 

and  Bhattacharya  1960). 
mat,  mSr  «n5-     Northwest  of  Bastar  (hills), 

and  Northeast  of  Garhchiroli. 

2.3  There  are,  in  some  dialects,  two  plural  forms  for  the  first 
person,  represented  by  morphologically  and  semantically  distinct  sets, 


> 


aplo 

aplo- 
aple- 

amahk, 

amok 

ma- 

mara^ 

raa- 

manaf 

mana- 

manal 

mana- 

mana^a 

mana- 

manafj 

mana- 

t 
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viz.    exclusive   (of   addressee)  (excl.)  as  in  <2)  and  inclusive   (of 
addressee)  (incl.)  as  in  (3). 

(3)  First  person  inclusive  plural  forms: 
apan ,  apon    apna-    Sehore,  Hoshangabad,  Chindwara, 

and  Handla 

Beiul ,  Seoni,  and  Adilabad 

Nagpur,  Yeoimal ,  and  Akola 

Amraoti  (Grierson  1906) 

Adilabad 

Southern  Chandrapur 

Southern  Bastar 

Khammam  (Tyler  1969) 

Khammam  (Kachineni  1979) 

(4)  Compare  the   second  person  plural  forms: 
ima^,  ime^     ml-      Sehore,  Hoshangabad,  Betul , 

Chindwara,   Seoni,   Mandla, 
Balaghat,  Bhandara,  Yeotmal , 
Amraoti;  Akola,  and   Nagpur 
(Grierson  1906). 

ima^,  uma^     ml-      Betul  (Shahpur) ,  Hoshangabad, 
and  Balaghat  (Waraseoni). 
Mandla  (Ghugri). 
Chandrapur. 

Southern  Chandrapur  and 
Adilabad  (Boath) . 

nima\,  nlrae^    ml-     Northern  Bastar  (plains),  and 
Adilabad  (Neville  1969). 
Adilabad. 

Bastar  (Madia  dialect 
(Grierson  1906)). 
Northwestern  Bastar  (hills) 
and  northeastern  Garhchiroli. 
Southeastern  Bastar. 
Southwestern  Bastar,  Khammam, 
and  Malkangiri. 

2.4  On  the  basis  of  phonological  and  morphological  variation, 
these  pronominal  variants  may  be  identified  as  belonging  to  one  of  the 
four  sets,  as  summarized  in  (5). 

(5)  (a).  Variants  with  an  initial  V(owel)-. 
(b).  Variants  with  an  Initial  n-. 

(c).  Variants  with  an  initial  m-. 

<d) .  Variants  with  morphological  restructuring. 


imahk, 

imak 

mi- 

mima^. 

mimoV 

mf- 

mimeV 

mr- 

mira^ 
miraf 

mr- 

mt- 

«Tt,  mir 

ml- 

mlr,  mlr 
mfr 

mX- 

mr- 
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2.5  The  pronominal  alternants  in  different  combinations  form  the 
following  ten  major  sets  of  personal  pronouns  as  found  in  various 
dialects  of  Gondi .  These  sets  disregard  a  great  deal  of  variation, 
e.g.  uma,  ime,  mamo^,  ime\,  etc.  Each  of  these  sets  is  a  character- 
istic of  a  particular  region  though  certain  regions  exhibit 
lapping  sets.  Ten  major 
sets  given  in  (6).  Al< 
against  the  characteristic  pronominal  set  (cf.  map  1). 


>egion  though  certain  regions  exhibit  over-^ 
dialect  groups  may  be  identified  based  on  thel 
so  see  (7)  in  which  each   dialect   is   listed^ 


(6)   1  sg.      I)  an 
pl 
2  sg 
pl 


II)  nan£ 


III) 


amaT 

amaT 

mamaT 

ima 
imaT 

ima 
imaT 

ima 
imaT 

IV) 

nana 

V) 

nana 

VI) 

nana 

III) 

nana 

mamaT 

mamaT 

namaT 

mamaT 

nima 
nimaT 

nima 
mimaT 

nima 
nimaT 

nima 
miraT 

VIII) 

nana 
mlT 
nima 
mIT 

IX) 

nana 
mama 
nima 
mir 

X) 

nana 
mama 
nima 
mTr 

I  will  use  -T  at  the  end  of  the  plural  forms  as  a  short  hand 
expression  to  indicate  the  plural  suffix  variants:  -t ,  -f , 
and  -k. 


(7) 


let  no.  dialect 

I  WG  (Western  Gondi) 

II  CG  (Central  Gondi) 
HI  NG  (Northern  Gondi) 

IV  NBG  (W)  (Northwest  Bastar  Gondi) 

V  SG  (Southern  Gondi) 

VI  NBG  (E)  (Northeast  Basiar  Gondi) 

VII  AG   (Adilabad  Gondi) 

VIII  HMG  (Hill  Maria  Gondi) 

IX  SBG  (South  Bastar  Gondi) 

X  SEG  (South  Eastern  Gondi) 


3.0   Reconsiruction: 

To  reconstruct  the  Proto-Gondi  set  of  personal  pronouns  based  on 
the  data  given  in  section  2.0  alone  may  be  misleading  because  the 
variation  noted  among  the  personal  pronouns  of  Gondi  is  also  found, 
though,  to  a  lesser  extent  in  most  of  the  languages  of  South-central 
Dravidian  (ScDr.)  of  which  Gondi  is  a  member.  However,  as  a  test  case 
I  would  first  consider  reconstructing  personal  pronouns  for  Proto-' 
Gondi   based  on  the  data  available  within  Gondi  and  then   compare   thel 
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results  with  the  reconstructions  for  other  subgroups  of  Dravldian. 

3.1   On  the  basis  of  the  data  drawn  from  Gondl ,  the  following  set 
of  personal  pronouns  could  be  set  up  for  Proto-Gondi : 


1  sg.    1  pi.    2  sg.    2  pi. 


ana 

amaT 

ima 

imaT 

nana 

naroaT 

nima 

nimaT 

mamaT 

mimaT 

mama 

miraT 

PG 


Table  1 


The  above  reconstruction  is  supported  by  the  fact  that  neither 
Proto-Gondi  nor  any  dialect  of  Modern  Gondi  have  ever  experienced  loss 
of  initial  nasals. 2  The  dialectal  variants  of  second  plural  miT  (mlj^, 
mij  and  mfr  of  sets  VIII,  IX  and  X)  and  the  first  plural  mST  TmSi  and 
ma]f)  are  excluded  from  the  reconstruction  because  they  were  considei — 
ed  either  due  to  morphological  restructuring  and  /  or  borrowing.  Thus 
the  reconstructed  forms  shown  in  table  1  may  be  considered  as  proto- 
set  of  the  dialects  representing  the  pronominal  sets  1-VlI  but  not  of 
Proto-Gondi . 

3.2  A  close  look  at  the  pronominal  variants  of  the  singular  and 
plural  forms  reveals  that  many  of  the  plurals  carry  an  additional 
plural  suffix  which  can  be  any  one  of  the  three  variants:  -^,  -£ 
and  -k.  However,  this  feature  cannot  be  reconstructed  for  Proto-Gondi 
for  the  following  reasons:  1)  The  plural  suffix  is  not  found  in  the 
dialects  of  SBG  (set  IX)  and  SEG  (set  X);  2)  Even  among  the  other 
dialects  where  it  is  commonly  found,  the  plural  suffix  marker  is  not 
the  same;  3)  Occasionally,  in  some  of  these  dialects  (NG  and  AG  see 
#2.2)  the  plural  suffix  may  altogether  be  omitted. 

However,  it  might  be  argued  that,  if  a  plural  suffix  is  found  in 
several  dialects,  it  might  be  reconstructed  for  Proto-Gondi.  In  spite 
of  its  occurrence  in  a  number  of  dialects,  the  absence  of  it  in  two 
dialects  (SBG  and  SEG)  covering  a  considerable  region  is  conspicuous 
and  should  be  accounted  for.  It  will  be  evident  from  the  following 
discussion  that  in  fact  the  presence  of  morphologically  restructured 
and  borrowed  forms  in  these  dialects  (which  exhibit  sets  VIII,  IX,  and 
X)  may  be  interpreted  as  a  precursor  of  the  plural  suffixation 
experienced  by  the  other  dialects.  All  this  evidently  indicates  thai 
the  formation  of  plurals  by  the  addition  of  plural  markers  is  not  a 
pan-Gondi  phenomenon  and  thus  does  not  lend  support  for  the  ( 
reconstruction  of  this  formation.  i 


41 


3.3  Therefore,  a  correction  should  be  made  in  order  to  arrive  at 
a  reasonably  safe  proto-set  of  personal  pronouns.  The  corrected  proto- 
type of  the  pronominal  system  is  given  in  (8),  along  with  the 
corresponding  oblique  bases: 

(8) 


nom. 

obi. 

1    sg. 

*ana 

*nS- 

pi. 

Xama 

*ma- 

2    sg. 

*ima 

»nT- 

pi. 

«ima 

«mT- 

3.4  Curiously,  the  second  plural  nominative  is  homonymous  with 
the  second  singular  nominative,  whereas  the  corresponding  oblique 
bases  show  overt  differences.  On  the  comparative  evidence,  the  Proto- 
Gondi  second  singular  may  be  related  to  pre-Gondi  second  plural 
*»ima,  (cf.  second  plural  nominative  in  Ta.  nTm,  ninka^j  Ma.  ninftaj.; 
Ko.  ni'm;  To.  ntm;  Kod.  nii^ga;  Ka.  nTm,  nTvu,  nfnga^;  Tu.  niku^u, 
inku|u;  Te.  mimm-  'oblique  base  in  accusative;  Kon .  nTm;  Kv.  mrmbu;Pe. 
lelegi;  Man.  Tm;  Nk.  im;  Pa.,  Ga.  Im;  Kur.,  Malt,  nim;  Br.  num.  (DEDR 
3688)  and  Proto-Dravidian  second  plural  nominative  Klro  (Zvelebil 
1962),  Wnlm  (Krishnamurti  1968,  Subrahmanyam  1968b,  Zvelebil  1977), 
which  presumably  has  replaced  the  pre-Gondi  second  singular  **i_na. 
Hence  a  prototype  of  pronominal  system  for  pre-Gondi  can  be  posited  as 
shown  in  (9) . 

(9) 


nom. 

obi. 

1 

sg- 

**ana 

*»na- 

pi. 

**ama 

»xma- 

2 

sg- 

**ina 

»»nl- 

pi. 

**ima 

**mi- 

Compare  the  reconstructed  form  of  pre-Gondi  second  singular  *»ina 
with  that  of  the  reconstructed  form  of  Proto-Dravidian  second  singular 
*rn  (Zvelebil  1962),  »nTn  (Krishnamurti  1968,  Subrahmanyam  1968b, 
Zvelebil  1977),  and  «in).r  (Kuiper  1972:147);  (cf.  Ta .  ,  Ma.,  Ko.  ,  To. 
nT;  Ka.  nTn  (^u)  ;  Kod.  nrni,  ni";  Tu .  i_;  Te.  n  fvu ,  Tvu;  Kon.  nin;  Ku.  Inu 
(W) ,  nlnu";  Kv.  ninu;  Pe.  In;  Man.  In;  Kol .  nfv;  Nk.  nlv,  Iv;  Pa.,  Ga. 
In;  Kur.,  Malt.  nIn;  Br.  nf  (UEUK  3684). 

3.5  The  pre-Gondi  pronominal  set  is  regular,  with  the  singular 
vs.  plural  distinction  expressed  by  the  opposition  of  medial  n/m, 
while  person  is  expressed  by  the  a/i_  contrast.  Proto-Gondi  did  not 
inherit  the  second  singular  **ina  of  pre-Gondi  (see  (8)),  while  the 
reconstructed  second  singular  and  plural  forms  for  Proto-Gondi  are 
homonymous  (see  (8)),  that  led  to  the  expansion  of  the  domain  of  *i.ma 
to  cover  second  singular  nominative  as  well  (yet  the  distinction 
between  the  corresponding  oblique  bases  is  maintained).  Presumably 
this  should  have  rendered  the  Proto-Gondi  pronominal  set  incoherent  or 
asymmetric.    (Incidentally,   Modern  English  too,   does  not  distinguish 
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between  the  second  person  singular  and  plural  not  only  In  the 
nominative  but  also  in  the  oblique  base,  therefore  not  asymmetric.) 
Consequently  the  pronominal  system  must  have  been  in  a  state  of  great 
instability  that  led  to  a  number  of  innovations  toward  regularity 
which  are  reflected  in  the  dialects  of  Modern  Condi. 

4.0   Discussion: 

To  substantiate  the  above  reconstruction,  the  immediate  task  will 
be  to  account  for  all  divergent  forms  of  pronouns  found  in  the 
dialects  of  Modern  Condi,  assuming  the  proto-type  set  as  the  source. 

4.1  The  structure  and  the  distribution  of  the  modern  pronominal 
sets  across  the  dialects  suggest  two  different  processes:  (1)  Innovat- 
ion toward  the  restoration  of  the  singular  vs.  plural  distinction  in 
the  second  person  nominative  by  the  addition  of  a  plural  suffix  (as 
seen  in  ima  :  imaT,  nima  :  nimaT);  (2)  Analogical  spread  of  the 
initial  nasals  of  oblique  bases  to  the  corresponding  nominative  forms 
(e.g.  ima  >>  nima,  Kama  >>  mama).  (It  may  be  mentioned  here  that  the 
operation  of  any  one  of  the  two  processes  would  restore  the  singular 
vs.  plural  distinction.)  Further,  the  distribution  of  the  personal 
pronouns  across  the  dialects  of  Condi  indicates  that  the  process  of 
restoration  of  number  distinction  (by  appending  a  plural  suffix) 
started  in  the  north-western  dialects  (sets  I-Vll)  and  moved  down 
toward  the  south-eastern  dialects  (SEG  and  SBG  represented  by  the 
sets  X  and  IX),  which  it  never  reached,  while  the  analogical  extension 
of  initial  nasals  first  appeared  in  the  dialects  of  the  south-east 
and  spread  toward  the  north-west  (see  WG  representing  the  set  I  is 
totally  unaffected,  whereas  sets  II  and  III  are  partially  affected). 
The  figure  1  is  a  diagrammatic  illustration  of  the  spread  of  these 
innovations: 

Hindi 


I  WG 


II 


AG 


VII 


Telugu 


Fig.  1. 
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The  above  assumption  is  necessitated  by  the  pronominal  sets  I 
<aQ£f  ^maT,  Ima,  ImaT) ,  IX  and  X  (nana,  mama,  nlma,  mir/mir) ,  which 
do  not  show  one  or  the  other  of  the  two  innovations.  In  other  words, 
some  dialects  must  have  started  showing  the  analogical  spread  of  the 
nasals  even  before  the  development  of  the  singular  vs.  plural 
distinction,  and  in  some  other  dialects  the  converse  may  be  true.  The 
dialects  which  exhibit  the  analogical  spread  of  nasals  prior  to  plural 
suffix  affixation  are  geographically  contiguous  with  Telugu  and  other 
South-central  Dravidian  languages  <cf.  map  1).  Analogical  extension  of 
the  nasals  of  the  oblique  bases  to  the  nominative  forms  is  an  areal 
phenomenon  found  in  Telugu,  Konda ,  Kui ,  and  Kuvi .  In  a  similar  fashion 
the  dialects  of  Gondi  (HMG,  SBG  and  SEG)  have  borrowed  mlr  from 
Telugu  (see  Subrahmanyam  1968)  to  replace  the  inherited  homonymous 
second  plural  ^Ima.  In  addition  to  Gondi  and  Telugu,  certain  other 
languages  of  South-central  Dravidian  exhibit  *mrr_(.u)_  which  according 
to  Krishnamurti  (1968:194)  is  an  areal  convergence  among  these 
languages. 

4.2  Obviously  the  second  plural  forms  of  sets  VIII,  IX,  and  X 
<!d1I»  !5l£  s"cl  mfr)  on  one  hand,  and  of  sets  IV  and  VI  (nima,  ni.raaT; 
possibly  of  VII,  mimaT  <<  *nimaT)  on  the  other  hand  indicate  that 
the  spread  of  initial  nasals  of  the  corresponding  oblique  bases  had 
not  resulted  in  a  singular  vs.  plural  contrast.  Presumably,  dialects 
IV-X  share  this  feature.  Had  the  initial  nasals  n-  and  m-  of  second 
person  oblique  bases  spread  to  the  homonymous  second  person  singular 
and  plural  nominative  forms  (tfl.ma  :  ^ina)  there  would  have  resulted 
forms  like  nima  and  mi.ma  corresponding  to  the  first  person  singular 
and  plural  nominative  forms  nana  and  mama.  However,  the  spread  of  n- 
both  to  the  second  singular  and  plural  nominative  forms,  rather  than 
the  expected  n-  and  m-  of  respective  clique  bases  as  in  the  case  of 
the  first  singular  and  plural  pronouns,  failed  to  restore  the  singular 
vs.  plural  distinction  among  the  homonymous  second  person  nominative 
forms.  Consequently  the  asymmetry  or  the  irregularity  in  the  proto- 
type system  remained  unchanged  .  At  this  stage,  a  number  of  innovat- 
ions (see  HA. 3  A-G)  toward  the  restoration  of  number  distinction 
were  carried  over. 

4.9  To  conclude  the  foregoing  discussion,  I  will  posit  the 
following  changes  to  account  for  the  divergence  of  the  Proto-Gondi 
pronominal  system  from  which  is  found  in  the  dialects  of  Modern  Gondi: 

(A)   (i)  Analogical  spread  of  n-  of   the  first   singular   oblique 
base    to   the   corresponding   nominative   form   as 
attested  in  sets  II-X. 

nom.  :  obi. 
1  sg.   :       «ana  :  »na-    >>   nana  :  na- 
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(A)    (ii)   Analogical  spread  of   m-  of  the  first   plural   oblique 
base  to  the  corresponding  nominative  as   attested  in 
sets  III-X. 

nom.    :   obi . 
1  pi.  :    Kama   :   Kma-    >>   mama  :  ma-  ' 

(A)  (iii)  Analogical  spread  of  n-  of  the  second  singular  oblique 
base  to  the  homonymous  second  singular  and  plural 
nominative    forms.  Attested  in  sets  IV-X.   (cf.  (10)) 

(10)  nom.  :  obi. 

2    sg.  :  )(ima    :     Jfni-  >>         nima    :     nT- 

pl.  :  *ima    :     *tk1-         >>         nima    :     ml- 

(A)  (iv)   Replacement  of  m-  of  the  nominative  first  plural   by  n- 
to   generalize   the  initial  n-  for   all   nominative 
forms  as  attested  in  set  Vi   (11)  below: 

(11)  *nana  nana 
*raamaT  >>  namaT 
Knima      nima 
i^nimaT     nimaT 

(A)  (v)   Replacement  of  n-  of  the  nominative  second   plural   form 

by   ro-  to  achieve  regularity  in  the  system.   Attested 
in  set  V  as  in  (12) : 

(12)            Knana  nana 

XmamaT  mama! 

Knima  nima 

XnimaT  >>   mimaT 

(B)  Plural  suffix  affixation.  Attested  in  sets  I-VIl  (cf.   (13)). 

(13)         »ama  +  T  >  amal  (sets  1  and  11) 

Kmama  4  T  >  mama!  (sets  111  to  VII) 

wima  -f  T  >  imaT  (sets  I  to  III) 

*nima  +  T  >  nimaT  (sets  IV  to  VII) 

A  number  of  nonlocal  as  opposed  to  local  changes  (as  thei^r  effect 
is  ii™it£^  io  EEonominal  systentj.  e^&^j.  innovations  lA-B).  have 
additionally  contrubuted  to  the  diversity  of  pronouns: 

(C)  A   regular  change  «r  >  j^  is  attested  in  a  number   of   Modern 

Gondi   dialects.   Compare  set  IX:   mT{i  (<^mrr),   as   well 
as  o{:   (<  )tor)  *  they  (masc.  )  ,   in  the  same  dialect. 
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<D)  The  plural  auffix  variant  -^  freely  varies  with  -r  in  many 
dialects  of  Gondi  ,  as  in  uda^2l!j^a£  'sit  (plural)!'. 

(E)  A  sporadic  change  like  a  >  o/m is  known  to  have  operated  in 

many  dialects  of  Gondi  (cf.  mom-  'to  sell'  (<»mam-). 

<F)  An  unrounded  back  vowel  /a/  in  second  syllable  was  either 
fronted  or  rounded  by  a  preceding  front  or  rounded  vowel 
respectively  as  in  sewwer  (<*sewwar)  'gum,  resin',  sowwor 
(<*sowwar)  'salt'.  This  is  historically  older  than  rule  (E) . 

(G)  Word-final  vowels  are  optionally  reduced  or  deleted  in  many 
of  the  present-day  dialects,  except  in  SEG.  (cf.  (14)) 

(14)       nana'^nan       cf.  marri~mar  'son' 
mama^mam  maccu'^mac  'dew' 

4.4  Certain  dialects  exhibit  pronominal  variants  which  may  be 
due  to  morphological  restructuring.  The  second  plural  form  ml,ra^  was 
the  result  of  contamination  of  mimaj^  (which  occurs  as  a  regional 
variant  mime|^  in  AG  (Boath)  ,  see  (4)  (cf.  set  V))  with  mTr  (but  not 
due  to  the  addition  of  the  plural  suffix  -a^  to  mir  which  would  have 
resulted  in  Vmtra^,  with  a  long  vowel  in  the  Initial  syllable). 4  A 
parallel  development  is  found  in  the  first  person  plural  inclusive 
!!>£L£^  °^  ^^^  same  dialect,  which  is  possibly  derived  from  first 
person  plural  exclusive  namat  on  the  analogy  of  mlra^  Later,  by 
functional  polarization,  nara^  was  restricted  to  first  person  plural 
inclusive  usage,  beside  earlier  mama^  which  continued  to  be  used  as 
the  first  person  plural  exclusive  pronoun. 

In  the  dialect  of  HMG  (set  VIII)  the  earlier  form  mama  was 
replaced  with  an  analogically  remodeled  form  ma^  after  the  second 
plural  ml^  <5ljl  <  !5i£'  • 

5.0   What  does  comparative  evidence  indicate? 

The  reconstruction  of  personal  pronouns  for  Proto-Gondi  attempted 
in  section  3  is  entirely  based  on  the  data  drawn  from  the  dialects  of 
Gondi.  In  this  section  I  try  to  compare  the  reconstructed  personal 
pronouns  for  Proto-Gondi  with  the  reconstructed  forms  for  various 
subgroups  in  Dravidian.3  Compare  the  first  singular  forms  given  in 
(15). 

(15)         Go.       ana,    nana; 

SDr.  :       Ta.    ySn ,    nSn;    Ma.    ftSn;    Ko.    an;    To.    8n;    Ka.    an,    nSn; 
Kod.       ntfnl ,    na^ 
ScDr.:       Te.       9nu,    nSnu;    Kon.    nSnu;    Ku.    Snu,    nSnu;    Kv.    nSnu; 
Pe.       Sin  (eg);       Man.    Sn; 
CDr.:    Kol.,    Nk. ,    Pa.,    Ga.    Sn; 
NDr.:    Kur. ,    Malt.    8n;    Br.    T    (obi.     (k)-an-    (Emeneau) . 


The  presence  of  n-  forms  of  the  first  singular  in  the  languages 
of  South  Dravidian  and  South-central  Dravidian  (Tamil,  Malayalam, 
Kannada,  Kodagu ,  Telugu,  Konda ,  Kui ,  and  Kuwi )  suggests  that  the 
development  of  n-initial  forms  is  not  confined  to  Gondi ,  but  goes  back 
to  Proto-South-South-central  Dravidian.  The  Gondi  first  singular  ana 
has  its  counterparts  in  every  subgroup  of  Dravidian.  Therefore  both 
Kana  and  *nana  may  be  reconstructed  for  Proto-Gondi . 

5.1  Compare  the  following  forms  of  first  plural  given  in  (16): 

(16)  Go.  amaT,  mamaT,  mam (a),  namaT; 

SDr.:  Ta.  yam,  yafika^  ,  nankal ;  Ma.  nanhal ;  Ko.  am;  To.  em; 
Ka.   am,  nam,  nSvu;  Kod.  ei)ga,  nai)ga; 
SCDr.  :  Te.  emu,  nemu ,  memu;  Kon.  map;  Ku.  mSTmu;  Kv.   mambu; 
Fe.  3p(et^);  Man.  am; 
CDr.:  Kol.,  Nk.,  Pa.,  Ga.   am; 
NDr .  :  Kur .  ,  Malt,  em;  Br.  nan  'we'(<  ^nam  'we  incl.' 

DEDR  3647) . 

The  m-initial  forms  of  the  first  plural  otherwise  are  found  not 
only  in  Gondi  but  also  in  the  languages  of  South-central  Dravidian 
(Telugu,  Konda ,  Kui,  and  Kuwi),  which  warrant  the  reconstruction  of 
the  first  plural  exclusive  Wmam  for  Proto-South-central  Dravidian. 
Cognates  of  n-initial  forms  are  rather  widespread  among  the  languages 
of  South  Dravidian  (Tamil,  Malayalam,  Kannada,  and  Kodagu),  but  in 
South-central  Dravidian  only  in  Telugu  (besides  Gondi)  exhibits  such 
forms.  It  is  apparent  from  the  distribution  of  first  plural  forms  in 
each  language,  that  none  of  the  three  variants  i.e.  vowel  initial,  n- 
initial,  and  m-initial  forms  are  exclusive  to  Gondi. 

5.2  According  to  Krishnamurti  (1968),  the  Gondi  first  plural 
exclusive  namaT  is  originally  a  cognate  of  Proto-Dravidian  ^nSm  (1  pi. 
incl.),  which,  however  lost  its  distinctiveness  as  first  plural 
inclusive  and  merged  with  the  first  plural  exclusive  in  Modern  Gondi. 
Compare  the  following  forms  given  in  (17)  as  cognates  of  *nam  by 
Krishnamurti  (1968:191)  (Only  selected  forms  are  reproduced  below.): 

(17)  SDr.:       Ta.       nam,       nShkal;     Ma.     nSm,    nahnal;     Kod.     naijga; 

Ka.  nam;  Tu.nama; 
ScDr.  :   Te.  nemu,  memu,  manamu;  Go.  nama^,  mamai^,mama; 

Kon.  map;   Ku.   mSmu;  Kv.  mambu; 
CDr.:   Kol.  nSnqf;  Nk.  nind,  nSm; 
NDr.:   Kur.,  Malt,  nam;  Br.  nan. 

None  of  the  first  plural  inclusive  variants  found  in  the  dia- 
lects of  Gondi  seems  to  be  derived  from  the  reconstructed  Proto 
Dravidian  first  plural  inclusive  »(nSra.  Of  the  four  major  dialectal 
variants  of  the  first  plural  inclusive,  apan  with  its  minor  variants 
is   a   loanword  from   Indo-Aryan  and  is  mostly  found  in   the   northern 
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dialects  of  Gondi .  The  form  manal  aeems  to  be  an  early  borrowing  from 
Telugu  (cf.  Telugu  1  pi.  incl.  nom.  manam,  or  more  appropriately,  the 
corresponding  oblique  base  in  accusative  manal.-)  .  Closely  resembling 
this  form  is  mana£,  in  which  tha  Telugu  plural  marker  -llul  was  re- 
placed with  -r  a  plural  marker  in  Gondi.  Compare  also  mana^a,  found 
in  Khamman  (Tyler  1969).  The  final  -a  of  the  latter  form  is  due  to 
analogy  with  the  other  pronouns  (cf.  nana,  mama,  ni.ma)  .  The  prefinal  i 
is  due  to  the  merger  of  /jp/  with  /^/  in  that  dialect.  The  variant 
!>^ra^  used  in  the  Adilabad  dialect  was  an  analogical  creation  after 
the  second  plural  nira^  (cf.  #4.4). 

5.3  As  may  be  recalled,  the  second  singular  forms  of  Gondi  (nima 
and  ima)  are  related  to  Proto  Dravidian  2  pi.  ^InlTm  (see  #3.4). 
Therefore  it  is  appropriate  to  compare  the  second  person  forms  (both 
singulars  and  plurals)  with  their  cognates  in  the  sister  languages  as 
in  (18): 

(18)    Go.  2  sg.:ima,  nima;  pi.:  imaT,  niraaT,  mimaT, 

mlT,  mTj" ,  mir; 
SOr.:   Ta.   nlm,  nl/>ka},nrr,  nlyir;  Ha.  niAAa^;  Ko.  ni'm; 
To.  ni'm;  Ka.  nTm,  nTvu ,  nTnga^ ;  Kod.  niyga; 
Tu.  niku|u,  inku^u.  Try; 
ScDr. :   Te.   tru,   mTru;   Kon.   nira,  mTru;  Ku.  Tru ,  mrru; 
Kw.  mTmbu ,   mlru;   Pe.   ?p(eg);  Man.  Tm; 
CDr.:  Kol.  nTr;  Nk,  nTr,  Fm;  Pa.,  Ga.  Tm; 
NDr. :  Kur. ,  Malt,  ntm;  Br.  num. 

5.4  The  comparison  makes  it  clear  that  all  three  variants  (i.e. 
the  vowel-initial,  n-initial  and  m-initiai  forms)  of  the  second 
person  have  immediate  cognates  in  one  or  the  other  sister  language  of 
South-central  Dravidian.  We  found  the  same  variation  as  for  the  first 
plural  (cf.  #5.1).  The  second  plural  mfr  In  Gondi,  Telugu,  Konda,  Kui , 
and  Kuwi  may  be  explained  as  either  due  to  a  shared  innovation  in 
these  languages  or  due  to  an  areal  diffusion  (Krishnamurti  1968:204-5, 
Subrahmanyam  1969).  Variants  of  the  Gondi  second  plural  mlsaX 
(^mimeT'^roi.moT)  ,  Kw.  mrmbu,  and  Te.  mimm-  (ace.  base)  may  look  back  to 
an  older  form  ^mTm  common  to  the  subgroup.  The  reconstructed  form  of 
the  Proto-  Dravidian  second  plural  has  initial  n-  (cf.  Krishnamurti 
1968,  but  see  alsoZvelebil  1962,  1977;  Kuiper  1972),  but  vowel 
initial  forms  are  also  found  in  at  least  two  subgroups:  South-cetral 
Dravidian  and  Central  Dravidian.  Again  this  suggests  that  the  variants 
of  the  second  singular  i.ma,  Q.1>!$b>  plural  imaT,  nlns>  and  mi.maT,  and 
mTr  are  not  innovations  within  Gondi,  but  shared  retentions  from  a 
common  source. 

5.5  The  only  variants  that  may  be  considered  innovations  within 
Gondi  are  :  minor  variants  with  changes  in  vowels  (e.g.  i.me^ ,  amo^)  ; 
morphologically  restructured  forms  like  mlrai,  mara^,  mat,  *"<^  ?5it; 
and  plural  markers  -la)_i,   -k  etc.   For  the  latter,   note  the  argument 


under   section   3.2   that  plural  marking  on   Gondi 
cannot  be  reconstructed  for  Proto-Gondi. 


personal   pronouns 


The   evidence  presented  above  leads  us  to  reconstruct  the  follow- 
ing personal  pronouns  for  Proto-Gondi  as  in  (19): 


1  sg. 

pi.   (excl.) 

2  sg. 
pi. 


)(ana,  Mnana 

Kama ,  Kmama 

Kima,  l^nima 

Xima.  Xnima 


Knama 


Kmima 


6.0   Conclusion: 

In  this  paper,  1  have  reconstructed  personal  pronouns  for  Proto- 
Gondi  and  then  tried  to  compare  these  with  the  cognates  (and  also  the 
reconstructed  forms)  of  pronouns  of  various  subgroups  of  Dravidian.  I 
have  argued  using  the  relevant  data  from  other  Dravidian  languages  to 
justify  that  certain  variants  of  personal  pronouns  found  in  various 
dialects  of  Gondi  are  not  developments  within  Gondi  but  are  inherited 
from  an  earlier  common  source.  The  latter  method  suggests  that  all  the 
three  major  variants  viz.  vowel -ini tial ,  n-initial,  and  m-initial 
forms  may  be  reconstructed  for  Proto-Gondi ,  whereas  the  reconstructed 
forms  of  personal  pronouns  using  Gondi  data  suggest  that  all  variants 
excepting  vowel-initial  forms  are  innovations  within  Gondi  and  thus 
cannot  be  reconstructed  for  Proto-Gondi . 


What  is  important  is  that  the  'external'  evidence  suggests  that 
the  dialect  variation  in  Gondi  is  much  older.  The  presence  of  nasal- 
initial  forms  only  in  those  dialects  of  Gondi  which  are  geographically 
contiguous  with  other  South-central  Dravidian  languages  is  evidently 
an  indication  of  areal  duffusion  or  outside  influence.  The  method 
which  takes  into  consideration  of  only  Gondi  data  predicts  that  the 
diffusion  took  place  after  the  division  of  Gondi  from  its  immediate 
sister  languages,  while  the  method  which  uses  'external'  evidence 
assumes  that  the  development  of  initial  nasals  antedates  Gondi.  How- 
ever, the  reconstruction  by  the  latter  method  is  justified  by  the 
occurrence  of  the  phenomenon  of  vowel-initial  and  n-initial  pairs  in 
the  languages  of  South-Dravidian  group  (ascribed  to  Proto-South-South- 
central  Dravidian  (cf.  Krishnamurti  1976)).  The  development  of  m- 
initial  forms  of  the  first  and  second  plural  nominative  was  an 
innovation  traceable  to  Proto-South-central  Dravidian. 


NOTES 

A  preliminary  version  of  this  paper  entitled  'Analogy  and  dia- 
lect variation'  was  presented  at  the  1986  Linguistic  Society  of 
America  Annual  meeting  in  Seattle,  Washington.  I  am  most  grateful  to 
Bh.  Krishnamurti  for  helpful  suggestions  and  critical  comments  on  this 
paper.   I   am   indebted  to  Mark  Aronoff  and  S.N.   Sridhar   who   helped 
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immensely  in  improving  the  quality  of  the  paper.  I  would  like  to  thank 
Robert  Hoberman  and  H.S.  Ananthanarayana  for  their  comments  on  the 
earlier  version  of  this  paper. 

Gondi  data  are  given  in  phonemic  transcription.  Vowel  length  in 
non-initial  syllables  is  predictable,  however,  with  a  few  exceptions 
which  are  irrelevant  here:  Vowels  in  non-initial  syllables  are  long  in 
the  dialects  representing  the  pronominal  sets  I-VIII.  Word-final 
vowels  are  generally  short,  but  long  in  checked  final  syllables  in  the 
dialects  representing  the  pronominal  sets  IX  and  X.  Consonants  are 
geminated  after  short  vowels  in  initial  syllables  in  all  dialects. 

The  reverse  case  of  prothetic  nasals  is  attested  by  such  words 
as  nend-  <<*end-)  'to  dance'  (DEDR  895),  and  inam-~mom-  through  the 
intermediary  and  attested  forms  vom-~om-  (<*am-)  'to  sell'  (DEDR  186). 

Krishnamurti  (1976:139-153)  subgroups  the  Dravidian  languages 
as  follows: 

South  Dravidian  I:  Tamil  (Ta. ) ,  Malayalam  (Ha.),  Kodagu  (Kod. ) , 
Toda  (To.),  Kota  (Ko. ) ,  Kannada  (Ka. ) ,  and  Tuiu  (Tu.); 

South  Dravidian  II:  Telugu  (Te. ) ,  Gondi  (Go.),  Konda  (Kon.),  Kui 
(Ku.),  Kuwi  (Kw. ) ,  Pengo  (Pe.),  and  Manda  (Nan.); 

Central  Dravidian:  Kolami  (Kol.),  Naiki  (Nk. ) ,  Parji  (Pa.),  and 
Gadaba  (Ga. ) ; 

North  Dravidian:  Kurukh  (Kur. ) ,  Malto  (Hal.),  and  Brahui  (Br.). 

According  to  him,  one  of  the  three  clear  isoglosses  covering 
South  Dravidian  I  and  II  is  'The  analogical  formation  of  «nan/«nan- 
'I'  from  the  PDr.  inclusive  plural  »nam/»nam-  'we'  in  addition  to  the 
retention  of  PDr.  »yan/»yan-  '1'...'.  In  his  later  paper  (1978:2)  he 
renames  South  Dravidian  II  as  South-central  Dravidian,  and  South 
Dravidian  I  as  South  Dravidian. 

Krishnamurti  and  Subrahmanyam  (1968:203)  suggest  that  mi.ra^  was 
derived  from  mtr+a^  followed  by  shortening  of  the  vowel  in  the  initial 
syllable.  However,  there  is  no  proper  evidence  to  such  an  assumption, 
see  counter  examples:  s51  ■*■  ap  — >  /sSla^/,  Cse:la.rJ  'sister', 
suf  +  a^  — >  /suj-a^/,  Csu:ra.^]  'see  (pi.)!'. 
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DEVANAGART   WORD-PROCESSING  ON  THE    IBM-PC » 
Stephen  C  Helmrelch 


This  Is  an  Interim  report  on  the  development  of  a  word-processor 
that  makes  the  DevanagarT alphabet  available  on  the  IBM-PC.  This 
wond-processor  Is  particularly  geared  toward  use  of  that  writing  system 
with  the  Sanskrit  language.  To  that  end,  It  Incorporates  the  Sanskrit 
mies  of  sandhi  within  the  word-processing  framework.  The  word- 
processor  Is  also  aimed  towards  those  coming  to  Sanskrit  from  an 
English-speaking  background.  The  keyboard  Input  Is  closely  related  to 
the  common  Romanized  transcription  of  the  Devanagarf.  The  program 
also  provides  a  basis  for  exercises  to  assist  students  In  learning 
Devanagarr characters  and  the  rules  of  Sanskrit  sandhi. 


0.  INTRODUCTION.  The  adaptation  of  the  computer  from  Its  original  setting 
(mathematics  and  engineering)  to  use  by  those  In  the  humanities  has  been  somewhat 
slow.  For  those  working  In  languages  and  linguistics,  It  has  been  slower  still.  In  the 
latter  case,  one  of  the  main  stumbling  blocks  has  been  the  unavailability  of  characters 
(not  to  mention  complete  alphabets  and  syllabaries)  beyond  those  generally  In  use  for 
modem  European  languages  (that  Is,  beyond  the  Roman  alphabet). 

The  Apple  Macintosh  was  one  of  the  first  personal  computers  to  provide  the 
capability  of  multiple  fonts  that  could  be  fairly  easily  designed  for  Individual  use. 
IBM,  however,  has  provided  a  built-in  set  of  only  255  characters,  comprising  the 
basic  Roman  alphabet  along  with  a  few  letters  with  diacritics  (primarily  vowels  with 
accents  and  umlauts).  While  this  Is  adequate  for  English,  French,  German,  and 
Spanish,  It  Is  unusable  for  languages  written  with  other  alphabets  and  even  for  some 
requiring  just  a  few  additional  characters,  such  as,  for  example,  Hungarian  or  Swedish. 
Both  Apple  and  IBM,  In  addition,  are  equipped  only  with  the  traditional  keyboard 
layout.  Therefore,  even  If  additional  characters  are  available,  there  still  remains  the 
question  of  the  relationship  between  the  Roman  alphabet  keyboard  and  the  additional  or 
replacement  characters.  This  Is  a  particular  difficulty  for  Ideographic  writing  systems 
like  Chinese. 

It  was  to  assist  In  this  accommodation  of  computers  to  the  humanities  that  I 
designed  the  Sanskrit  word-processor  which  I  will  describe  In  this  brief  report.  First 
I  outline  some  of  the  difficulties  faced  In  computerizing  the  DevanSgarf  alphabet  in  a 
user-friendly  way.  Then  I  discusss  briefly  some  other  DevanagarT  word-processors. 
Finally,  I  describe  the  construction  and  use  of  both  the  font  Itself  and  the  word- 
processing  program. 

I.  PROBLEMS.  Providing  a  font  for  the  DevanagarT  writing  system  Is  not 
simply  a  matter  of  creating  an  alternate  set  of  characters  to  replace  the  Roman  alphabet 
generally  found  on  personal  computers.  There  are  three  major  problems  In 
Implementing  a  DevanagarT  word-processor  ( I )  the  large  number  of  compound 
characters;  (2)  the  context-dependent  forms  of  each  character;  and  (3)  the  non- 
llnearlty  of  the  graphic  symbols.  In  order  to  understand  these  difficulties  a  short 
Introduction  to  the  DevanSgarT writing  system  Is  In  order 


52 


Although  probably  related  distantly  to  the  Roman  alphabet,  the  DevanagarT 
writing  system  Is  In  fact  a  combination  of  syllabary  and  alphabet.  The  basic  unit  of  the 
writing  system  Is  called  an  aksara.  (This  term  will  be  used  Interchangeably  with  the 
word  character)  The  writing  system  Is  alphabetic  In  that  each  consonant  sound  has  a 
typical  character  shape  (which  Is  modified  according  to  context)  It  Is  syllabic, 
however.  In  that  the  basic  character  shape  for  each  consonant  sound  Includes  an 
Inherent'  schwa  or  a  sound.  Other  following  vowel  sounds  are  Indicated  by  other 
diacritics  around  the  aksara.  To  Indicate  that  no  vowel  follows,  a  special  diacritic  (the 
virama)  must  be  placed  under  the  aksara.  (Vowels  have  their  own  character  shape 
when  they  appear  Initially  or  In  Isolation.) 

The  basic  set  of  aksaras,  then,  consists  of  the  symbols  for  single  consonant 
sounds.  These  are  modified  by  various  diacritics  to  produce  the  representations  for  the 
I  I  basic  open  syllables  beginning  with  that  sound.  Thus  we  have  In  ( 1 )  the  character 
for  the  s  sound  combined  with  the  1 1  vowel  sounds,  (r  Is  a  vocalic  r). 

(i)^^ftT^^^^^^#^^ 

sa  sa  si  sr  su  su  se  sal  so  sau  sr  s 
The  Implications  of  the  "Inherent"  a  sound  In  each  aksara  for  the  representation 
of  consonant  clusters  Is  that  a  cluster  cannot  simply  be  written  by  Juxtaposing  the 
basic  asksaras,  since  this  would  be  read  with  an  Intervening  a  sound.  Instead,  to 
Indicate  the  cluster,  the  two  symbols  must  In  some  way  be  conflated.  Thus,  for 
example,  we  see  below  the  Devanagarf  equivalents  of  sa,  ta,  sata,  and  sta. 

(2)  ^  H  ^HH  ^ 

Thus,  the  writing  system  Is  syllabic  also  In  the  fact  that  each  consonant  cluster 
Is  represented  by  a  separate  aksara.  It  would,  of  course,  be  possible  to  use  the  virama 
to  write  'sta"  as  In  (3)  or  even  as  In  (4).  But  this  Is  not  the  normal  way  In  which  the 
syllabary  Is  used  and  would  be  regarded  as  highly  unusual  and  a  violation  of  the  basic 
principles  of  the  writing  system. 

(3)  ^ 

(4)  ^Tc{37 

There  Is  some  degree  of  regularity  In  the  formation  of  compound  aksaras.  Often 
the  consonants  are  conjoined  left-to-rlght  or  top-to-bottom.  Below  we  see  an  example 
of  each  type  of  conjoining. 

(5)  ^    ^    T^T  ^    ^    5 

sa    ma    sma  sa    tha  stha 

In  a  number  of  cases,  however,  there  Is  a  special  combination  form  for  the 
character,  seemingly  unrelated  to  the  basic  form.  And,  there  are  some  forms  which 
simply  must  be  memorized.  Occasionally,  too,  more  than  one  way  Is  available  to  write 
a  compound  character  Compare  the  examples  In  (6). 

(6)  T    ^      ^  oT    5(    5f 

ra  pra  mra  Ja  na  nna 
It  should  be  pointed  out  that  the  Devanagarf  writing  system  Is  not  syllabic  In 
the  phonological  sense.  An  aksara  consists,  with  few  exceptions,  of  a  series  of  Initial 
consonants  followed  by  a  vowel.  Thus,  the  aksaras  of  the  Devanagarf  syllabary  do  not 
correspond  to  the  syllables  of  the  Sanskrit  language,  or  really  of  any  other  language.  As 
with  most  languages,  Sanskrit  has  both  open  and  closed  syllables.  Thus,  for  example, 
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the  Sanskrit  phrase  which  consists  of  the  sequence  of  words  In  (7)  would  be  syllabified 
as  In  (8)  and  written  as  the  set  of  aksaras  In  (9). 

(7)  ksetresu  slktSbhlr  meghanam  adbhlr  dhanyarh  prarijdham 

by  the  water  which  drops  from  the  clouds  upon  the  fields  the  grain  grows 
tall 

(8)  kset  re  su  sik  ta  bhir  me  gha  na  mad  bhlr  dhan  yam  pra  ru  dham 

kse  tre  su  si    kta   bhi  rme  gha  na  ma  dbhl  rdha  nyarn  pra  rO  dha  m 

As  can  be  seen,  this  style  of  writing  for  the  most  part  Ignores  word  boundaries. 
The  consonant  ending  a  word  Is  combined  with  the  Initial  character  of  the  following 
word  to  create  a  compound  character  (If  It  Is  a  consonant),  or  the  following  vowel  Is 
placed  as  a  diacritic  on  the  preceding  consonant.  A  space  In  the  string  of  aksaras 
generally  occurs  only  when  a  word  boundary  and  an  aksara  boundary  correspond.  All 
aksaras  hang  down  from  a  base-line  which  carries  completely  through  the  line  of 
writing,  broken  only  as  Indicated  above,  or  where  there  Is  a  mark  of  punctuation.  There 
are  also  a  few  characters  which  require  breaking  the  base-line. 

This  connection  across  wort  boundaries  poses  some  difficulties  for  the  basic 
wort-processing  tasks  of  Inserting,  deleting,  and  moving  worts  or  other  segments  of 
text.  That  Is,  the  graphic  shape  of  the  wort  to  be  Inserted,  for  example,  will  vary 
depending  on  the  characters  Immediately  to  the  left  and  right  of  the  Insertion  point. 
These  characters  are  also  subject  to  graphemic  variation.  A  user-friendly  wort- 
processor  should  handle  these  variations  automatically,  Instead  of  requiring  the  user  to 
manually  alter  the  character  shapes.  These  tasks  are  made  doubly  difficult  when 
working  with  Sanskrit.  The  Sanskrit  language  has  an  extensive  set  of  rviles  of  sandhi 
which  apply  across  wort  boundaries.  These  consist  primarily  of  rules  of  assimilation 
and  vowel  coalescence.  Unfortunately,  the  result  of  these  mles  Is  not  simply  expressed 
In  speech  but  Is  reflected  also  In  the  written  fonns.  Thus,  for  example,  a  wort  final  a 
combines  with  a  following  Initial  I  to  produce  an  e.  To  show  the  difficulties  of  simply 
altering  wort  orter,  the  two  worts,  fire  and  here,  shown  In  their  Isolation  forms  In 
( 1 0)  would  appear  as  ( 1 1 )  In  one  order  and  as  ( 1 2)  with  reversed  orter 

(10)  3iP*it1     3T^ 

agnis  atra 

(11)  Srf^^R^f 

agnlratra 

(12)  3i5^iP»i: 

atragnlh 

There  are  a  few  additional  complications  to  the  DevanagarTalphabet. 

(a)  It  Is  not  completely  linear.  As  we  have  seen  in  examples  (5)  and  (6), 
certain  of  the  compound  characters  are  not  linearly  oriented.  But  beyond  this,  /  Is 
written  before  the  consonant  cluster  after  which  It  Is  pronounced.  Thus,  for  example. 
stt  would  be  written  as  In  (1 3). 

(13)  ftcT 
l-s-t 
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In  addition,  a  cluster-tnlttal  ris  written  as  a  hook,  placed  at  the  veaj  end  of  the 
aksara  which  It  precedes.  Thus,  kSrtsnya ,  entirety",  would  be  written  as  In  (1 4). 

(14)  ot7K^«-4 

k-a-tsnyr- 
(b)There  are  some  exceptions,  as  mentioned  above,  to  the  rule  that  each  aksara 
represents  an  open  syllable.  First,  a  word-final  nasal  consonant  Is  written  as  a  small 
dot  above  the  previous  aksara  (anusvara).  Thus,  "kam"  would  be  written  as  follows: 

(15)  ^ 

Also  there  are  certain  consonant  clusters  which  cannot  be  written  as  a  compound 
aksara  (generally  due  to  the  difficulty  of  combining  the  consonant  shapes).  In  such  a 
situation,  the  consonant  cluster  Is  written  as  a  sequence  of  two  simpler  aksaras,  the 
first  ending  with  the  vlrama  (In  order  to  eliminate  the  Inherent  vowel).  Such  Is  the 
case  for"ts'  as  In  "lltsu"  In  (16). 


(16) 


feZ^ 


(c)The  Devanagarf  writing  system  Is  almost  Inherently  proportionally-spaced. 
Although  the  "Ideal"  Is  for  each  aksara  to  take  up  the  same  amount  of  space,  lengthy 
consonant  clusters  will  require  more  room  than  single  characters.  This  has 
repercussions  even  for  such  seemingly  compositional  constructions  as  diacritic 
placement.  Diacritics  placed  under  an  aksara  should  appear  under  the  middle  of  that 
aksara.  With  characters  of  varying  width,  the  placement  of  the  diacritic  Is  not  exactly 
the  same.  The  font  used  for  the  examples  here  has  two  versions  of  most  diacritics.  The 
user  must  guess  which  version  should  be  used  with  which  characters.  (17) 
demonstrates  the  difference. 


(17)  ^^    ^^     ^^   "^Ir 


In  conclusion,  the  Devanagarf  writing  system,  as  In  most  things  connected  with 
human  language.  Is  a  complex  combination  of  predictable  generalizations  and 
Idiosyncratic  singularities.  The  particular  difficulties  that  this  writing  system  raises 
for  the  construction  of  a  word-processor  are  ( I )  how  to  deal  with  the  large  number  of 
(compound)  characters  in  terms  of  storage,  presentation,  and  keyboard  arrangement; 
and  (2)  how  to  deal  with  the  context-dependent  forms  of  the  characters  (conflation  and 
rules  of  sandhi  across  word-boundaries).  In  the  next  section,  some  earl ler  approaches 
to  these  problems  are  described.   Then  the  solution  chosen  here  Is  outlined. 

2.  SOLUTIONS  The  earliest  versions  of  a  Devanagarf  word-processor  were 
created  for  Macintosh  PCs.  They  required  the  user  to  construct  the  characters  as  they 
were  needed.  The  font  provided  all  the  "parts"  of  the  characters  (about  100)  that  were 
necessary  to  create  all  the  aksaras  of  the  Devanagarf  system  Most  consonants  existed 
In  both  a  solitary  and  a  combination  form.  Certain  (but  not  all)  compound  characters 
had  special  key-presses  Often  two  or  more  versions  of  the  diacritics  existed.  In  order 
to  allow  for  variable  placement  around  the  consonants.  It  can  be  readily  Imagined  that 
this  type  of  keyboard  arrangement  Is  not  particularly  easy  to  learn  or  to  work  with. 
'First,  there  Is  the  large  number  of  character  parts  that  must  be  associated  with  the 
correct  key-press  Then  there  is  the  difficulty  of  knowing  whether  or  not  there  Is  a 
special  character  available  or  whether  It  must  be  constructed  piece  by  piece.  Finally, 
the  problems  of  cutting  and  pasting  are  quite  difficult.  If  you  want  to  Insert  a  word.  It 
Is  often  necessary  to  retype  the  adjacent  words  as  well,  simply  to  get  the  correct 
compound  characters,  as  was  seen  In  examples  (I0)-(12).  This  type  of  word- 
processor  Is  now  available  for  IBM  PCs,  though  only  with  the  use  of  a  Hercules  card. 
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There  are  now  somewhat  more  sophJstlcated  versions  available  which  provide  a 
limited  amount  of  context-sensitive  processing.  For  example,  using  the  UBC 
DevanSgarT  f ont  (an  unsophisticated  processing  program  for  the  Macintosh)  to  type 
Sanskrit  su,  you  must  type  s  followed  by  a  To  type  Sanskrit  stu.  however,  you  must 
press  Options  (to  obtain  the  compound  form  of  s)  then  t  and  then  u.  This  Is  Illustrated 
In  example  (18).  Later  versions  would  determine  whether  the  key-press  following 
the  s  represented  a  consonant  or  a  vowel  and  place  on  the  screen  the  appropriate  form 
of  the  s  aksara. 

(18)  Result:     ^  ^  cT 

>•  >* 

KeyPress;  s    u  Optlon-s     t     u 

The  first  word-processors  for  the  IBM-PC  sidestepped  entirely  the  Issue  of 
screen  display  and  concentrated  on  providing  printing  capabilities  for  Devanagarf 
characters.  These  programs  created  print-characters  and  associated  them  with 
particular  key-presses.  On  screen,  only  ordinary  ASCII  characters  were  visible,  but 
the  file,  when  printed,  resulted  In  Devanagarf  characters.  Thus  what  appeared  on  the 
screen  had  no  correlation  to  what  appeared  on  the  printed  page.  Proofreading  and 
correcting  with  such  a  program  Is  time-consuming. 

The  present  proposal  attempts  to  provide  a  more  sophisticated  Interface  between 
keyboard  and  screen  display.  For  this  purpose,  the  standaro  Roman  transcription  for 
SanskrIt/DevanagarTwas  adopted  as  a  basis.  It  was  slightly  modified  so  that  only 
single  strokes  corresponded  to  single  characters.  Sanskrit  has  a  vocalic  system  of  9 
vowels,  a  1  u  r  I  (short),  and  3  T  u  r  (long).  There  are  also  four  diphthongs:  e  o 
(short),  a1  au  (long).  Consonants  occur  In  five  series,  corresponding  to  guttural, 
palatal,  lingual,  dental,  and  labial  points  of  articulation.  Each  series  has  five 
members,  consisting  of  a  voiceless  and  a  voiced  stop  (each  with  an  unsplrated  and  an 
aspirated  version)  and  a  nasal.  There  are  also  four  semi-vowels,  three  sibilants,  and 
an  "h".  Retrof  lex  consonants  are  Indicated  by  a  dot  beneath  the  corresponding  dental 
consonant  and  aspiration  Is  Indicated  by  a  following  h.  In  almost  all  cases,  then.  It  was 
possible  to  assign  a  Sanskrit  phoneme  to  a  single  Roman  character.  Length  In  vowels 
was  Indicated  by  capitalization,  as  was  aspiration.  Retroflexion  was  generally  Indicated 
by  the  Control-key.  In  the  few  cases  where  additional  assignments  were  necessary  the 
Alt-key  was  used.  The  next  example  shows  the  standart  transliteration  and  the  key- 
press assigned  by  the  word-processor. 

(1 9)  Transcription     Key-Press  Transcription      Key-Press 


Ctrl-r 


a 

a 

e 

e 

3 

A 

a1 

E 

1 

1 

0 

0 

T 

I 

au 

0 

u 

u 

r 

R 

u 

L 

t 

CI 

h 

H 

k 

k 

kh 

K 

g 

g 

gh 

G 

n 

Ctrl-g 

c 

c 

ch 

C 

J 

J 

Jh 

J 
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t 

Ctrl- 

■t 

th 

Alt-t 

CI 

Ctrl- 

■d 

dh 

Alt-d 

n 

ctn- 

-n 

t 

t 

th 

T 

d 

d 

dh 

D 

n 

n 

P 

P 

Ph 

P 

b 

b 

bh 

B 

m 

m 

1 

y 

1 

r 

V 

r 

V 

■     S  5  S  Ctrl-S 

s  s  h  h 

It  Is  not  entirely  clear  that  this  Is  the  best  possible  key-press  assignments.  It 
might  be  better  to  avoid  the  Control  and  Alt  keys,  which  are  Idiosyncratic  to  IBM  and 
IBM-clone  keyboards  In  favor  of  keys  that  are  on  all  regular  keyboards.  There  are  a 
number  of  letters  that  are  not  put  to  use  In  this  system  of  assignments:  x,z,q,w,  for 
Instance.  The  one  above  would  serve  even  better  If  Shift-Control  would  be  aspirated 
retrof  lex  consonants,  such  as  th  and  dh,  or  long  retrof  lex  vowels,  such  as  r 

In  any  case,  once  a  key-assignment  Is  made,  the  Sanskrit  Is  typed  In  word  by 
word,  leaving  a  space  between  words.  As  each  key  Is  pressed,  the  aksara  corresponding 
to  that  key  appears.  If  the  subsequent  key-press  creates  a  compound  aksara,  the  one  on 
the  screen  Is  erased  and  the  new  one  replaces  It.  This  Is  accomplished  by  means  of  a 
TurtoPASCAL  program,  which  collects  all  the  key-presses  and  evaluates  the  character 
to  be  printed.  Thus,  If  one  types  In  the  sequence  of  characters,  s-t-u,  the  following 
will  appear  on  the  screen  (In  the  same  location): 

(20)    ^ 

These  characters  are  formed  In  two  stages.  Using  a  font-editor  (see  figure  I  for 
a  screen  dump  of  the  font  editor)  the  parts  of  characters  needed  to  create  all  the 
compound  aksaras  are  created  and  saved  as  bit  patterns.  This  parallels  the  earlier 
word-processors  In  which  these  character-parts  were  associated  directly  with  key- 
presses. However,  In  this  case,  the  character^parts  have  been  pre-assembled  Into 
complete  characters,  or  rather  the  Instructions  for  creating  the  complete  characters 
have  been  stored  In  a  file.  The  TurtoPASCAL  program  then  associates  each  sequence  of 
legitimate  key-presses  with  the  record  In  the  file  that  has  stored  the  Instructions  for 
the  appropriate  character  Then  the  character  Is  placed  on  the  screen.  Figure  2  shows  a 
direct  screen  dump  of  the  basic  DevanagarT aksaras  as  they  would  appear  on  the  screen 
of  the  word-processor^it  Is  the  number  of  these  character  records  that  are  stored  as 
the  word-processing  file.  The  width  of  each  character  Is  also  stored  so  that  the  cursor 
can  be  moved  appropriately.  Thus,  for  example,  If  the  screen  showed  the  Sanskrit  word 
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Dlllllllllllllllllllllllllllll 

Illllllllllllllll 

Illlllllllll nil 

Illllllliillililllillllll 

lllillliili liilllii 

Illlillliillllliill Ill 

iiiimiiiiiiiimiiiiiiiiiiii 
iiiiiiii 
liiiiiii 

iiiiii     nil     mill 

iiiiii  mill     Illlllllllll 

mill     Illlllllllll 

Illlllllllll 

miimiiii 

II  II I  Illlllllllll 

mi  IIIIIIII  Illlllllllll 
mill  mill  Illlllllllll 

IIIIIIII  mill  nil Ill 

imiiiim iimimiim 

iiiimiiiiiiiiiiimiiiiiiiii 

II iimmiim in 

miiiimmmiim iiii 

imiimim imiimi 

I iiiiiiiiiiiiiiiiiiiiiii 

mil iiiimiii II 

iiiiiniiiiiiiiiiiiiiiiiiiiiii 
imiiiiiiiiiiiiiiiiiiiiiiiiii 
iiiiimiimmiimiiiiiiiii 
mmmimmmiiimiiiii 
miiimimiiiimiiimiiii 


Cupsop  pad  to  Move 
Ins  -to  tupn  on 
Del  -to  tupn  off 

C  -  choose  chapactep 
N  -  save  to  neNOpy 
S  -  save  to  disk 
L  -  load  a  chapactep 
M  -  Hipe  gpid  clean 
P  -  PPint  chapacteps 
T  -  test  wopd  ppocessop 
fllt-Fl  -  quit 

EDITING:  No.  1 


Figure  I 
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sp"  ^  T  "^  ^ 
=q'  ^  ^  ^  ST 

5  5^5^ 
?T  2T  ^  tr  ST 
q  i?^  ST  ^  JT 


Figure  2 
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In  (21 ),  then  the  file  containing  that  word  might  have  only  the  numbers  365,  355, 
282  representing  the  records  In  the  character  file  that  contain  the  Instrvictlons  for 
creating  the  three  aksaras  In  the  text 

(21)  ^TTHT 

>• 
Yamuna 
At  this  stage,  note  that  nothing  has  been  done  about  word-boundary  phenomena. 
The  space  between  words  prevents  the  PASCAL  program  from  combining  two  consonants 
on  either  side.  Nor  has  any  sandhl  been  performed.  The  user.  In  fact,  must  type  In  the 
underlying  form  of  the  word,  regardless  of  what  appears  to  the  left  or  right.  However, 
whenever  the  cursor  leaves  a  line,  that  line  Is  replaced  on  the  screen  by  a  revised  line. 
Using  another  TurtoPASCAL  program,  the  characters  are  re-evaluated  on  either  side  of 
a  word-boundary  (space).  If  they  can  be  combined  Into  a  compound  character,  that 
single  character  replaces  the  original  two  and  the  Intervening  space.  If  the  two 
characters  are  subject  to  sandhl,  the  correct  sandhl  form  replaces  them.  For  example, 
here  Is  a  line  of  Sanskrit  as  It  would  be  typed  In  by  the  user  (the  second  line  Is^he  list 
of  key  presses  and  the  third  Is  the  nomnal  transcription): 

(22)  37R?ffT  7T^  ^^^  ^TTT  ^fTT^t^T^TcRT  "Sf^ 

Asit         rAjA     nalas       nAma   vlrasenasutas    ball 
SsR         r^ja     nalas       n3ma   vTrasenasutas    balT 
When  the  line  Is  complete,  and  the  user  goes  on  to  the  next,  the  following 
appears: 

(23)  3TRtt^^  ^T^  ^rm  oftT^j^THc^  ^^ 

asTdraja  nalo   nama   vTrasenasuto    balf 

Of  course,  there  are  situations  where  one  might  want  to  block  the  application  of 
this  process  (there  are  certain  exceptions  to  the  rules  of  sandhl).  In  such  cases,  a 
special  (Invisible)  character  Inserted  will  block  the  application  of  this  program.  Also, 
It  Is  possible  (for  Instance,  If  one  Is  copying  previously  prepared  text),  that  one  may 
want  to  type  In  the  sandhled  form.  This,  too.  Is  possible,  though  the  program  Is  not 
capable  of  reversing  the  rules  of  sandhl  and  creating  the  original  line.  Correctly 
undoing  sandhl  Is  not  generally  possible  unless  you  know  enough  Sanskrit  to 
understand  It.  The  function  from  underlying  form  to  sandhl  form  Is  many-to-one.  For 
example,  the  sandhl  form  In  (24)  can  be  resolved  as  either  (25)  or  (26).  Probably 
an  on-line  dictionary  would  permit  one  to  undo  most  sandhl.  However,  even  this 
word-processor  Is  not  that  brilliant. 

(24)  ITopt^ 
ekona 

(25)  \T^:  ^ 

ekah   na 
one  man' 

(26)  l7of>  ;j7TT 

eka     una 

"one  less" 
It  Is  easy  to  see  that  In  the  present  word-processing  program  the  normal  cut- 
and-paste  operations  of  word-processing  are  much  simpler  Words  can  be  deleted. 
Inserted,  or  moved  as  In  most  standard  English-language  word-processors.  The 
adjustments  that  need  to  be  made  In  the  text  on  either  side  of  the  Insertion  or  deletion 
are  handled  automatically  by  the  word-processor.  For  example.  In  (27)-(30),  the 
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user  would  Initially  type  In  at  the  key-board  the  second  line  of  (27),  while  the  first 
line  of  (27)  would  appear  on  the  screen.  Upon  leaving  that  line  of  text,  the  screen 
display  would  change  to  that  shown  In  (28).  To  change  the  word  order,  the  user 
simply  returns  the  cursor  to  the  line,  whereupon  the  screen  display  for  that  line  would 
return  to  that  shown  In  (27).  Following  the  usual  word-processing  procedures,  the 
second  word  Is  cut  from  Its  current  position  and  then  pasted  In  front  of  the  remaining 
word  as  In  (29).  When  the  cursor  leaves  the  line,  then  display  then  shifts  to  that  of 
(30). 

(27)  3Tf^5TH     37^ 

agnts  atra 

(28)  3Tf^?R^ 

agniratra 

(29)  3T^     STI^^T^ 

atra        agnis 

(30)  3T^Tf^: 

atragnlh 

3.  INSTRUCTIONAL  USE.  The  programs  that  support  the  operation  of  the  word 
processor  can  also  be  used  to  teach  beginning  students  of  Sanskrit.  The  two  major 
programs  are.  as  discussed  above,  ( 1 )  the  program  which  relates  the  standard  Roman 
transcription  to  the  screen  presentation  of  the  appropriate  DevanagarT characters  and 
(2)  the  program  which  performs  the  mles  of  sandhl,  given  a  line  of  Sanskrit  text.  By 
simply  extracting  these  two  PASCAL  programs,  various  sets  of  exercises  can  be  easily 
created. 

These  exercises  can  be  used  tohelp  teach  the  novice  student  several  basic  skills 
In  learning  Sanskrit.  The  first  program  (constructing  the  characters)  Is  useful  In 
teaching  the  Roman  transcription  system.  By  practicing  associating  strings  of  key- 
presses with  Sanskrit  characters,  the  student  will  learn  to  recognize  and  produce  both 
simple  and  complex  aksaras. 

The  second  program  Is  useful  In  learning  the  mles  of  sandhl.  The  student  can 
learn  to  perform  sandhl  appropriately  and  also  to  "undo"  sandhl  and  recognize  the 
underlying  forms. 

4.  CONCLUSION.  Currently  the  word-processor  Is  not  yet  ready  for  practical 
use.  All  of  the  character-parts  are  available,  and  all  of  the  basic  aksaras  have  been 
programmed.  However,  not  all  the  compound  aksaras  are  yet  available.  This  means 
that  the  secondary  program  which  performs  sandhl  Is  not  yet  complete.  Finally,  many 
of  the  word-processing  functions  are  not  yet  functional.  Work  should,  however,  be 
completed  by  the  end  of  1987. 

In  conclusion,  the  work  on  this  word-processor  shows  some  of  the  difficulties 
associated  with  making  computer  tools  accessible  and  useful  for  students  In  the 
humanities,  particularly  language  and  linguistics. 
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M  would  particularly  like  to  thank  Hans  Henrlch  Hock  for  his  assistance  In 
thinking  through  the  basis  for  this  word-processing  program  I  would  also  like  to 
thank  Atsushl  Fukada  for  his  help  with  the  PASCAL  programming.  The  Language 
Learning  Laboratorxj  at  the  University  of  Illinois  has  been  gracious  in  allowing  me  time 
and  equipment  to  do  the  programming. 

^Due  to  the  requirements  of  this  Journal  (namely  that  the  final  document  be 
printed  on  a  laser  printer)  and  to  the  fact  that  the  word  processor  Is  not  Intended  to 
produce  output  that  can  be  printed  by  the  laser  printers  avalllable,  It  Is  not  possible  to 
show  actual  output  from  the  word-processor.  A  dot-matrix  printout  would  look 
somewhat  smoother  than  the  figures  shown.  The  other  Sanskrit  printouts  In  this  paper 
were  produced  on  a  Macintosh,  using  an  early  version  of  Ashok  Ahlujkars  U.B.C. 
DevanSgarT  font.  For  the  transliterations,  diacritics  of  Hans  Henrlch  Hocks  Genva- 
based  LIngam  font  were  employed. 


studies  in  the  Linguistic  Sciences 
Volume  17,  Number  1,  Spring  1987 


POGULI  SYNTAX  IN  THE  LIGHT  OF  KASHMIRI 
A  PRELIMINARY  REPORT* 


Peter  Edwin  Hook 
University  of  Michigan 


The  syntax  of  Pogul i ,  an  Indo-Aryan  lan- 
guage spoken  in  the  mountains  just  south  of  the 
Kashmir  valley,  is  examined  with  special  atten- 
tion to  the  features  that  distinguish  Kashmiri 
from  most  of  the  Indie  family.  Although  Pogul i 
does  not  have  the  anti-absolutive  of  Kashmiri, 
it  still  disallows  dative  case  for  personal  and 
other  [+human]  direct  objects  in  ergative  ten- 
ses, distinguishes  direct  from  inverse  transi- 
tions, and  has  a  system  of  pronominal  suffixes 
even  more  complex  than  that  of  Kashmiri.  Having 
both  verb-final  and  V-2  dialects,  Pogul i  is  a 
promising  target  for  further  field-work. 


Pogul i  is  an  Indo-Aryan  language  spoken  in  the  dis- 
trict of  Doda  in  Jammu  and  Kashmir  state.  It  is  spoken  in 
several  dialects  whose  speakers,  numbering  from  ten  to 
twenty  thousand,  live  in  the  area  known  as  Pogul -Par istan 
in  the  mountains  to  the  east  and  north  of  the  town  of 
Ramsu  on  the  National  Highway  that  runs  from  Jammu  via 
Banihal  to  Srinagar.  Poguli  has  no  written  tradition  and 
no  published  literature  other  than  some  folk  songs  prin- 
ted locally.  After  the  early  sketches  published  in  Bailey 
(1908)  and  Grierson  (1919),  it  has  received  little  atten- 
tion from  linguists.^ 

Poguli  is  of  special  interest  because  of  its  close 
relation  to  Kashmiri  (some  consider  it  to  be  a  dialect  of 
the  latter;  see  Koul  and  Schmidt  1984:10ff).  As  Kashmiri 
is  typological ly  aberrant  from  the  rest  of  Indo-Aryan, 
the  study  of  adjacent  neighbors,  eg.  Poguli,  Siraji,  Zun- 
dhari  (Grierson's  Rambani?),  Khashali,  Bhadrawahi,  etc., 
promises  to  shed  light  on  the  sequence  of  events  in  the 
history  of  Kashmiri  which  led  to  its  striking  divergence 
from  the  rest  of  Indo-Aryan.  While  the  data  presented 
here,  collected  during  a  brief  trip  to  Pogul,  should  be 
considered  preliminary  in  nature,  it  does  provide  a  new 
perspective  on  Kashmiri's  typology  and  suggest  not  only 
answers  but  also  questions  that  we  may  profitably  ask 
about  the  history  of  northern  Indo-Aryan  languages. 

I.  Word  Order .  All  informants  with  one  exception 
used  the  modified  SOV  word  order  that  is  typical  of  Paki- 
stan and  northern  India.   In  this  word  order   type,   non- 
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sentential  objects  precede  the  verb  (1)  while  sentential 
objects  (and  some  other  subordinate  clauses)  follow  the 
verb  they  are  dependent  on  (2): 

(1)  mi     ciThi    da-I     leyU^ 
me-DT  letter   give-CP  pour-IMp3 

'Give  me  the  letter.' 

(2)  aaU   bi    go-s  ki    ti      t&s  ciThi   da-I 
I-NM  fear  went  that  you-ER  him  letter  give-CP 
1& 

poured 

'I  was  afraid  you  might  have  given  him  the 
letter.' 

The  exception  was  a  non-native  from  Sialkot,  a  zamindar 
who  used  Pogul i  in  speaking  to  workers  on  his  lands.  His 
Poguli  had  the  V-2  order  typical  of  Kashmiri  (as  well  as 
a  more  Kashmiri-like  phonology):^ 

(3)  aaU   pyeen-mi-s     tu      tyeer 
I-NM  send-lsF-2sAC  you-NM  there 

'I  will  send  you  there." 

II.  Pronominal  suffixes.  Poguli  has  a  system  of  pro- 
nominal suffixes  that  is  more  complex  than  Kashmiri's  on 
the  following  counts:  (A)  Poguli  has  four  series;  cf  (4). 
Kashmiri  has  three;  cf  (5).  (B)  Poguli  has  a  suffix  for 
every  person-number  combination;  Kashmiri  lacks  one  for 
the  first  person  plural.  (C)  Poguli  distinguishes  gender 
in  its  pronominal  suffixing  system;  Kashmiri  does  not. 


(4) 

Ab 

solutive. 

Ergative 

Accusative. 

Dative 

Ism 

-us 

-m 

N.A.5 

-iti 

Isf 

-is 

-m 

N.A. 

-m 

2sm 

-us 

-t 

-s 

-t 

2sf 

-is 

-t 

-s 

-t 

3sin 

-u 

-ni 

-n 

-s 

3sf 

-i 

-nyE 

-n 

-sa : 

Ip 

-asam 

-n 

N.A. 

-n 

2p 

-ath 

-u/-v 

-u/-v 

-u/-v 

3  pm 

-ua : 

-ne 

-nan 

-nan 

3pf 

-ia 

-nye 

-ny&n 

-ny&n 

(5) 

Ab 

solutive 

Anti- 

absolutive. 

Dative. 

Is 

-s 

-m 

-m 

2s 

-kh 

-th 

-y 

3s 

-n  (or  - 

0) 

-n 

-s 

ip 

- 

- 

- 

2P 

-v(i) 

-v(i) 

-v(i) 

3p 

-kh  (or 

-0 

) 

-kh 

-kh 
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In  a  form  of  gender  agreement  that  is  not  easily  accom- 
modated on  the  Poguli  chart,  the  accusative  suffixes  are 
accompanied  by  a  vowel  alternation  in  the  preceding  stem. 
For  example,  the  stem  of  Le  'pour;  throw'  is  1&^  if  the 
direct  object  is  masculine  (6)  and  l_e  if  feminine  (7): 

(6)  y&  biit  aahati  ki  ter  tAA  aaU  tu  teru 
she  fear  was  that  then  by  I-NM  you-NM  there 
mAA  pen-i      l&-mi-s 

not  send-CP  pour-lsF-2sAC 

'She  was  afraid  I  might  send  you  (m)  there  by 
then. ' 

(7)  y&  biit  aahati  ki  ter  tAA  aaU  tu  teri  mAA  peni 
lemis 

•She  was  afraid  I  might  send  you  (f)  there  by 
then.' 

In  addition,  like  Gujarati,  Bhitrauti  and  other  NIA  lan- 
guages far  to  the  south^,  Poguli  shows  gender  and  number 
agreement  of  adverbs  with  the  direct  object  (note  teru 
(ms)  'there'  in  (6)  and  teri  (fs)  in  (7)).  However,  in 
contrast  to  what  we  find  in  Gujarati  and  its  neighbors, 
such  agreement  seems  to  be  possible  only  if  the  direct 
object  is  in  the  nominative  case.  If  the  direct  object  is 
in  the  dative,  the  adverb  assumes  the  default  masculine 
singular  form  (teru  in  (8)): 

(8)  y&  biit  aahlkti  ki  ter  tAA  tus  as-an 
she  fear  was  that  then  by  you-NMmp  us-DT 
ter-u    mAA  pen-i    leu-th 

there-ms  not  send-CP  pour-2pAB 
•She  was  afraid  that  you  might  send  us  there 
by  then.  • 

Comparison  with  Kashmiri  shows  that  Poguli  has  four  sets 
of  pronominal  suffixes  (to  Kashmiri •s  three)  because  in 
Kashmiri  the  functions  of  Poguli 's  ergative  and  accusa- 
tive sets  are  combined  in  a  single  "anti-absolutive"  set.* 

III.  Inverse  transitions.  Although  Poguli  does  not 
have  the  anti-absolutive,  it  does  maintain  the  distinc- 
tion of  direct  versus  inverse  transitions  found  in  Kash- 
miri (cf.  Hook  and  Kaul ,  ms.).  In  direct  transitions  a 
first  person  subject  acts  on  a  second  or  third  person 
direct  object  or  a  second  person  subject  acts  on  a  third 
person  direct  object.  In  these  situations  the  direct  ob- 
ject is  in  the  nominative  case  and  the  finite  verb  has  an 
accusative  suffix  showing  the  person,  number  (and  gender; 
see  section  II  above)  of  the  direct  object.  All  other 
distributions  of  animate  persons  into  subject  and  direct 
object  positions  are  inverse.  In  them  the  direct  object 
gets  the  dative  case  and  the  verb  is  (optionally)  marked 
with   a  dative  suffix  showing  only  the  person  and   number 
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(and,  for  third  persons,  gender)  of  the  direct  object 
(the  subject  must  be  thirdi  person  if  this  optional  dative 
suffix  is  to  occur).  Compare  (9)  and  (10): 

(9)  aaU    tu      teri      pen-i      le-mi-s 
I-NM   you-NM  there-fs  send-CP  pour-1 sF-2sAC 

'I  will  send  you  there.'   (direct  transition) 

(10)  saa  ti      teru      pen-i    lev-i  -(t) 
she  you-DT  there-ms  send-CP  pour-3s-2sDT 

'She  will  send  you  there.'  (inverse  transition) 

As  in  Kashmiri  (Hook  1985),  for  some  Poguli  speakers 
the  direct  object  of  a  dependent  infinitive  can  be  passed 
up  (or  "climb";  see  Napoli  1981)  to  the  finite  form  of 
the  matrix  verb,  even  if  the  matrix  is  intransitive: 

(11)  aaU  tu     baal-nyi    e-mi-s 

I    you-NM  see-INF  come-lsF-2sAC 
'I  will  come  to  see  you.'  (direct  transition) 

IV.  "Absolutive  absolute".  As  first  noted  by  Varma 
(1938:45),  in  Kashmiri,  Shina,  Bhadrawahi,  Khashali  and  a 
few  other  northern  Indo-Aryan  languages,  direct  objects 
in  ergative  tenses  must  get  the  nominative  case,  even  if 
they  are  first  or  second  persons.  Contrast  Poguli  (12) 
and  Kashmiri  (13)  with  Marathi  (14)  and  Hindi  (15): 

(12)  yiis       ti      aaU   teru    pen-t-us 
yesterday  you-ER  I-NM  there  send-P-lsAB 

(13)  raath     suuzu-th  -as    bi    tsye    toor 
yesterday  sent-2sAA-lsAB  I-NM  you-ER  there 

(14)  kaal       tu     ma-laa  tikaDa  paaThavi-1-a-s 
yesterday  you-ER  me-DT   there    send-P-n-2sER 

(15)  kal        tuu-ne  mujh-ko  vahAA  bhej-aa 
yesterday  you-ER  me-DT    there  send-PST 

'Yesterday  you  sent  me  there.' 

The  inadmissibility  of  the  dative  case  for  direct  objects 
in  ergative  tenses  is  a  major  syntactic  isogloss  that  now 
separates  the  Indo-Aryan  languages  of  the  northwestern 
mountains  from  those  of  the  rest  of  South  Asia. 

V.  Co-occurrence  of  pronominal  suffixes.  In  Poguli 
there  are  restrictions  on  which  sets  of  suffixes  may  co- 
occur  with  which.  For  instance,  accusative  suffixes  can- 
not occur  with  dative  suffixes.  Attested  combinations  in- 
clude: 1.  Ergative  (first  and  second  person)  with  dative 
(all  persons);  cf.  (16).  2.  Absolutive  with  dative;  cf. 
(17).   3.  Ergative  with  absolutive;  cf.  (18): 

(16)  baalTi  mAA  pAAI     le-i   lo      chu-m-(i)saa 
bucket  in   water  pour-CP  poured  is-lsER-3sDTf 

'I  have  poured  water  in  the  bucket  for  her.' 
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(17)  teni   aaU    aan-t-us   -nan  -e 
he-ER  I-NM  bring-P-lsAB-3pDT-PRF? 

'He  has  brought  me  for  them.' 

(18)  ti   aaU    aan  -t-u   -t   -(u)s-e     t&s&    kice 
you  I-NM  bring-P-ms-2sER-lsAB-PRF?  her-DT  sake 

'You  have  brought  me  for  her  sake.' 

The  range  of  permitted  combinations  seems  not  to  be  so 
rich  as  in  Kashmiri,  where,  for  example,  sequences  of 
three  suffixes  may  occur: 

(19)  bi   suuzu-n   -as   -ay 
I-NM  sent-3sAA-lsAB-2sDT 

'He  sent  me  to  you  (or  for  your  sake).' 
(Kashmiri  AA  corresponds  here  to  Poguli  ER) 

Furthermore,  in  contrast  to  Kashmiri,  the  order  of 
ergative  and  absolutive  suffixes  is  not  fixed.'  Normally 
the  absolutive  suffix  precedes  the  ergative  (This  is  the 
opposite  of  the  Kashmiri  order): 

(20)  ti    aaU  aan(u)-t-us    -it  'You  brought  me.' 
you  I-NM  bring  -P-lsAB-2sER         (Poguli) 

(21)  tsye   on(u)-th   -as    bi  'You  brought  me.' 
you  brought-2sAA-lsAB  I-NM         (Kashmiri) 
(Kashmiri  AA  corresponds  here  to  Poguli  ER) 

The  order  seen  in  (18)  (ergative  suffix  before  absolu- 
tive), a  marked  order  in  Poguli,  expresses  the  speaker's 
annoyance.  Given  this  and  the  fact  that  the  Kashmiri  or- 
der at  times  forces  discontinuity  in  the  registering  of 
the  direct  object's  gender  and  number  and  its  person  (the 
form  on  (u)  in  (21)  agrees  with  the  direct  object  in  gen- 
der and  number  (masculine  singular)  while  the  suffix  -as 
marks  it  as  the  first  person),  it  is  probable  that  Poguli 
preserves  the  original  order  and  that  Kashmiri  has  inno- 
vated. A  consequence  of  this  innovation  (in  Kashmiri)  is 
that  since  the  first  suffix  now  has  to  be  subject;  and 
the  second  suffix,  direct  object  (in  all  tenses  and  no 
matter  what  the  shape  of  the  suffix) ;  the  accusative  and 
the  ergative  suffixes  could  merge  into  a  single  set  (the 
"anti-absolutive")  without  creating  ambiguity.  (It  is 
also  possible  that  the  merger  occurred  first,  and  had  as 
a  consequence  the  fixing  of  the  order  "subject  first,  di- 
rect object  second"  as  the  order  for  Kashmiri  suffixes  in 
all  tenses.)  Perhaps  a  relic  of  such  a  putative  earlier 
order  is  to  be  seen  in  the  double  marking  of  the  direct 
object  before  and  after  the  subject  suffix  that  is  occa- 
sionally found  in  the  Kashmiri  present  perfect: 

(22)  bi    chu  -s  -(a)n  -as  -ay   suuz-mut 
I-NM  is-lsAB-2sAA-lsAB-2sDT  sent-PP 

'She  has  sent  me  to  you.' 


VI.  Conclusions.  The  Poguli  data  show  that  the  dis- 
tinction between  direct  and  inverse  transitions  does  not 
depend  on  the  presence  of  anti-absolutivi ty .  Since  data 
from  Pashto  (Hook  1985)  show  that  ant i-absolut i vi ty  does 
not  depend  on  the  distinction  of  direct  and  inverse  tran- 
sitions either,  we  may  conclude  that  these  two  features, 
although  both  present  in  Kashmiri,  are  not  typolog ical ly 
linked.  Poguli  data  provides  further  support  for  the 
linking  of  the  presence  of  absolutive  pronominal  suffixes 
with  the  absence  of  dative  marking  of  the  personal  direct 
object  pronouns  in  ergative  tenses  in  Indo-Aryan.  (There 
is  corroborating  evidence  for  this  typological  link  from 
Old  Marathi  where  the  second  person  singular  suffix  -si 
was  used  absolutively  with  the  coref erential  object  pro- 
noun put  in  the  nominative.  In  modern  Marathi  -£  is  used 
agential ly  while  second  person  direct  objects  must  be  put 
in  the  dative  case.) 10 

Detailed  investigation  and  comparison  of  the  prono- 
minal suffixing  systems  in  Poguli,  Kashmiri,  and  neigh- 
boring languages  promises  a  solution  to  some  of  the 
questions  that  we  are  just  beginning  to  formulate  about 
the  origin  and  development  of  this  characteristic  feature 
of  northwest  Indo-Aryan  languages. 

Postscript.  Protected  somewhat  by  their  relative  in- 
accessibility, mountain  valley  languages  like  Poguli  have 
been  able  to  preserve  their  particular  characteristics 
even  though  spoken  by  very  small  numbers  of  people.  Fifty 
years  ago  in  a  neighboring  mountain  area  Siddheshwar 
Varma  discovered  and  described  languages  and  dialects 
spoken  in  as  few  as  five  households.  Now,  however,  the 
relative  inaccessibility  of  places  like  Pogul-Par istan  is 
fast  disappearing  .  A  "Matadorable"  road  has  been  opened 
into  the  valley  from  Makarkot  (on  the  National  Highway) 
to  Ukherhal  .  Last  summer  a  Tourist  Bungalow,  complete 
with  caukidar  and  kitchen,  was  set  to  open  at  Batroo  and 
another  one,  further  up  the  valley,  is  under  construction 
for  the  pleasure  of  skiers  and  hikers.  We  cannot  expect 
the  coming  generations  in  the  area  to  go  on  maintaining 
it  as  a  living  museum  of  languages.  It  is  much  easier  to 
do  field  work  there  now  than  in  Grierson's  time.  But 
this  very  ease  of  access  puts  the  information  preserved 
in  "lesser"  languages  at  greater  risk.  There  is  much  to 
be  done  and  little  time  left  to  do  it. 

NOTES 

*  The  research  reported  on  here  was  carried  out  in  July, 
1985  during  a  summer  of  fieldwork  primarily  devoted  to 
research  on  Kashmiri  syntax.  I  am  grateful  to  the  Ameri- 
can Institute  of  Indian  Studies  for  its  continued  sup- 
port  and   to  the  residents  of  Pogul  who  gave   freely   of 


their  time  and  hospitality,  particularly  to  Mr.  Rashpal 
Singh  of  Iniyard  and  Mr.  Milap  Singh  of  Koonchigam,  and 
to  their  families  and  friends. 

^Recently  Ruth  Schmidt  and  Vijay  Kaul  have  begun  to 
examine  the  phonology  of  Poguli  as  part  of  an  historical- 
comparative  study  of  northern  Indo-Aryan  languages. 

^The  transcription  used  here  (which  I  cannot  claim 
to  be  phonemic)  is  similar  to  that  found  in  many  descrip- 
tions of  New  Indo-Aryan  languages  except  that  I  use  doub- 
ling rather  than  macron  or  colon  to  show  vocalic  length 
and  caps  to  show  nasal  vowels.  The  symbol  £  denotes  a 
front  vowel  more  open  than  e.  In  Poguli  open  vowels  fol- 
lowing /h/  have  noticeably  lower  tone:  see  sentences  (6- 
8).  I  have  marked  this  with  a  grave  accent.  The  phonetic 
value  of  i_  and  e  when  used  for  Poguli  (and  other  Indie 
languages  cited  in  this  paper)  differs  markedly  from  i^ 
and  e  when  used  to  represent  Kashmiri  (see  note  6). 

^Abbreviations  used  in  this  paper  include: 

AA anti-absolutive    IMP imperative 

AB absolutive   m masculine 

AC accusative    n neuter 

CP.  .conjunctive  participle    NM nominative 

DT dative   p plural 

ER ergative   PP past  participle 

F future    PRF perfect 

f feminine   P{ST) past 

^In  particular  his  /ee/  showed  a  noticeable  centering 
glide  and  strongly  palatalized  preceding  consonants.  This 
individual   lived  in  Banihal  town,   some  distance  to   the 
north   of   the  Ukherhal-Makarkot  area  where  most   of   the 
data  reported  here  were  collected. 

5n.A.  ('not  applicable')  because  accusative  suffixes 
occur  only  in  direct  transitions  and  first  person  direct 
objects  are  excluded  from  them  by  definition.  See  section 
III,  'Inverse  transitions'. 

^The  transcription  used  here  for  Kashmiri  is  the  one 
developed  by  Kenneth  Hill  in  his  classes  in  linguistic 
field  methods  at  the  University  of  Michigan  in  1984.  It 
is  a  phonemic  transcription  in  which  palatalization  of 
consonants  is  indicated  by  /y/,  retroflexion  of  apical 
stops  by  upper  case  T  and  D,  and  nasality  in  vowels  by 
capitalization.  Single  /e/  is  a  central  mid  vowel  unless 
it  is  preceded  by  /y/  and  followed  by  a  word  boundary  or 
palatal  (ized)  consonant.  Then  it  is  fronted.  The  vowel 
/ee/  is  long  central  mid  except  when  fronted  by  a  preced- 
ing and  a  following  /y/.  The  vowels  /i/  and  /ii/  are  high 
central  unrounded  except  when  fronted  by  a  preceding  /y/. 
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The  long  vowels  /oo/  and  /uu/,  the  front  allophone  of 
/ee/,  and  the  non-front  allophone  of  /ii/  have  a  notice- 
able glide  toward  mid-central  position.  Only  the  front 
allophones  of  /i/  and  /ii/  may  follow  the  palatal  frica- 
tive /S/  and  the  palatal  affricates  /c/,  /ch/,  and  /j/. 


east  Rajasthan,  an  adverb  agreeing  in  gender  and  number 
with  the  direct  object  (and  with  the  subject  when  there 
is  no  direct  object)  is  used  to  express  perfective  as- 
pect. See  Hook  1982  and  Hook  and  Chauhan  1986. 

^Briefly  stated,  Kashmiri  uses  the  same  set  of  suf- 
fixes for  subjects  in  ergative  tenses  that  it  uses  for 
direct  objects  in  non-ergative  tenses.  For  examples: 

(a)  yemyi  muraadi  kheetri  tsye  sooz-oo-th-as      bi 
which  purpose  for    you-ER  send-PST-2sAA-lsAB  I-NM 

'The  purpose  for  which  you  sent  me...'  (G  790) 

(b)  bi  kar-inaav-ath   tas  saahyibas  siity  mulaakath 

I   do-CAUSE-2sAA  that  gentleman  with  introduction 
'Shall  I  introduce  you  to  that  gentleman?'  (G  985) 

In  (a)  the  suffix  -th  expresses  the  subject;  in  (b) ,  the 
same  suffix  -th  expresses  the  direct  object.  Since  the 
distribution  of  -t]i  with  respect  to  tense  is  the  comple- 
ment of  the  distribution  of  the  second  person  singular 
absolutive  suffix  -kh,  I  call  it  ANTI-absolutive .  For 
further  discussion  see  Hook  and  Koul  1984,  Hook  1984  and 
Hook  1985.  (G  plus  number  in  parentheses  identifies  the 
examples  from  Grierson  1911,  vol  1,  part  2.) 

^Grierson  (1919:427)  reports  that  dative  suffixes  in 
Poguli  are  also  able  to  precede  or  follow  ergative  ones. 

l^Note  the  nominative  second  person  singular  direct 
object  and  the  absolutive  use  of  the  second  person  sin- 
gular pronominal  suffix  in  the  following  (as  cited  by 
Master  (1964:130)  from  the  late  thirteenth  century  Sri- 
krsnacaritra  47.8  and  discussed  in  Hock,  ms) : 

(a)  ambe,   tU   maajheaa  vairiaa  kaaraNE  dukhav-al-i i-si 
mother  you-NM   my   enemy    reason   torment-P-f-2sAB 

'Mother,  you  had  to  suffer  on  account  of  my  enemy.' 

Contrast  the  construction  in  (a)  with  the  modern  Marathi 
in  which  personal  pronouns  must  get  the  dative  and  the 
second  singular  pronominal  suffix  is  used  agentially: 

(b)  ambe,    tu    ma-laa  aataa  khup   dukhav-al-a-s 
mother,  you-ER  me-DT   now  much  torment-P-n-2sER 

'Mother,  now  you  made  me  suffer  a  lot.' 
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IMPACT  OF  EXPANDING  DOMAINS  OF  USE  ON  A  STANDARD  LANGUAGE: 
CONTEMPORARY  HINDI  IN  INDIAl 

Yamuna  Kachru 


Discussions  of  Hindi,  including  linguistic  descriptions, 
treat  it  as  a  monolithic  entity.  This  paper  points  out  that 
Hindi  is  not  the  name  of  a  lanauaae.  rather,  it  is  an  appellation 
used  for  a  group  of  Qeneticallv  related  lanquaaes/dialects  of 
North  India^ Modern  Standard  Hindi  (MSH,  hereafter)  shows  effects 
of  prolonged  contact  with  several  Indian  and  non-Indian  languages. 
In  addition,  the  demands  of  its  (new)  roles  as  an  official  and  a 
pan-Indian  link  language  are  having  their  impact  on  MSH.  The 
paper  describes  the  impact  of  the  following  circumstances  on  MSH: 
(a)  the  use  of  MSH  as  an  official  language  in  the  central  as  well 
as  several  state  governments;  (b)  its  use  in  the  media;  and  (c) 
its  use  as  a  medium  of  expressing  literary  creativity  in  several 
parts  of  India.  As  a  result  of  these  uses,  contemporary  MSH  is 
exhibiting  the  following  trends:  (a)  increased  Sanskritization  in 
the  lexicon,  but  Englishization  in  the  syntax;  (b)  increased  use 
of  colloquial  Eastern  and  Western  varieites  and  switching/mixing 
with  regional  languages/dialects  in  several  genres;  and  (c)  the 
emergence  of  a  classical  'diglossic'  (Ferguson  1959)  situation. 
Implications  of  these  developments  for  linguistic  and  sociolin- 
guistic  research,  and  for  the  use  of  MSH  as  a  medium  of  education 
are  discussed  toward  the  end  of  the  paper. 


1.0   INTRODUCTION 

Even  a  cursory  look  at  the  history  of  Hindi  literature  makes  it  clear 
that  Hindi  is  not  used  as  the  name  of  a  language;  rather,  it  is  used  as 
an  appellation  for  a  group  of  dialects  and  languages  (e.g.  Avadhi,  Braj, 
Hindvi,  KhaRi  Boli,  Rajasthani,  Urdu)  which  have  been  or  are  now  function- 
ing as  media  of  literary  creativity  in  the  region  known  as  Madhyadesh.^ 
Besides,  Hindi  has  been  in  contact  with  other  languages  --  both  Indian 
and  foreign  --  for  centuries.  These  include  the  languages  of  the  New 
Indo-Aryan  group,  Sanskrit,  Persian  and  Arabic  (mainly  via  Urdu),  and  Eng- 
lish. Several  of  the  characteristics  of  the  language  known  as  Modern 
Standard  Hindi  derive  from  this  heritage.  For  instance,  the  phonology  of 
MSH  is  influenced  by  Perso-Arabic  and  Sanskrit,  and  more  recently,  by 
English.  Its  morphology  and  lexicon  have  been  enriched  by  the  same  sources. 
In  syntax,  MSH  has  developed  several  alternative  constructions  in  many 
areas,  adapting  selected  patterns  of  all  the  languages  mentioned  above. 
The  literary  conventions  of  MSH  have  been  borrowed  and  assimilated  from 
Sanskrit,  Persian  (via  Urdu),  and  English  (in  the  beginning  via  Bengali 
and  Marathi).  Several  folk  traditions  of  the  areas  surrounding  the 
Madhyadesh  have  found  their  echo  in  the  literary  traditions  of  MSH.3 
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This  trend  of  assimilating  linguistic  and  other  features  of  languages 
with  which  Hindi   is  in  contact  continues  even  now,   but  with  a  difference. 
The  difference  is  due  to  the  changed  and  changing  sociocultural ,   economic 
and  political   situation  of  the  Hindi-speaking  area.     Some  of  the  factors 
that  have  had  an  impact  on  Hindi   language  and  literature  are  literacy, 
education,   socio-economic  mobility  due  to  jobs  in  the  government  and  indus- 
try,  industrialization  of  the  region,   development  of  media  including  news- 
papers,  journals,  magazines,   aakaashvaaNii    (All    India  Radio)   and  doordarshan, 
(Indian  Television),   and  general   political   awareness  generated  by  elections. ^ 
One  can  add  to  this  list,   I   am  sure.     My  main  aim  is  not  to  identify  each 
and  every  factor  that  has  had  some  role  in  the  contemporary  developments  in 
Hindi   language  and  literature.     I  would  like  to  confine  myself  to  four  obser- 
vations:   (a)   the  use  of  Hindi   for  legislative  and  administrative  purposes 
has  led  to  an  increased  Sanskritization  of  the  lexicon,   but  Englishization 
of  the  syntax;    (b)   the  use  of  Hindi    in  the  media  (i.e.   newspapers,   aakaash- 
vaaNii,  and  doordarshan)   shows  the  pull   of  four  conflicting  or  complementary 
tendencies  --  Sanskritization,   Englishization,   increased  use  of  colloquial 
Eastern  and  Western  varieties,   and  switching  and  mixing  with  regional   lan- 
guages such  as  Bengali,  Marathi,   and  Punjabi;    (c)   the  literary  language  used 
in  criticism,  memoirs,   and  fiction  exhibits  the  same  trends  as  in   (a)   and   (b); 
and   (d)   there  is  a  unique   'diglossic'    situation  developing  in  the  Hindi 
area  which  has  attracted  very  little  attention  from  educationists  and  scholars 
of  the  region. 

2.0     OFFICIAL  HINDI 

I  would  like  to  present  data  from  a  variety  of  sources  to  substantiate 
my  remarks.     First,    I  would  like  to  present  some  data  from  the  legislative 
and  administrative  registers  to  show  that  the  language  used  in  these  con- 
texts borrows  heavily  from  Sanskrit  and  also  reflects  the  transfer  effects 
in  translation  from  English.     Consider  the  following  excerpts  from  the 
Lok  Sabha  (Indian  Parliament:   Lower  House)  Bulletins:^ 

la.     sankhyaa  971 
NuTber  971 

sarkaarii  kaary  ke  liye  sanay    kaa  niyatan 
Allocation  of  time  for  Government  Business 

lok  sabhaa  aaj  shrii  gulaam  nabii  ajaad  dvaaraa  pesh  kiye  gaye 
Lok  Sabha    today  Sri  Ghulam  Nabi    Azad    by  moved 

ek  prastaav  par  sahmat  huii  ki  antarraa.1yik  jal  vivaad 
a     motion    on    agreed       that  Inter-State  Water  Disputes 

(sanshodhan)  vidheyak,  1986,  [raajya  sabhaa  dvaaraa  yathaapaarit,] 
(Amendnent)    Bill  1986     Rajya    Sabha     by        as  passed 

par  vicaar  tathaa  paarit  kame  hetu^  3  ghaNTe  kaa  samay 
on  consideration  and  passing     for    3  hours    of    time 

niyat  kiyaa  jaae. 
allocation  be  done 
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On  a  motion  moved  by  Mr.  Ghulan  Nabi  Azad,  Lok  Sabha  today  agreed  to 
the  allocation  of  3  hours  for  the  consideration  and  passing  of  the 
Inter-State  Water  Disputes  (Anendnent)  Bill,  1986,  as  passed  by  Rajya 
Sabha. 

(From  Lok  Sabha:  Bulletin  - 
part  II  (General  Infonnation 
relating  to  parliamentary  and 
other  matters),  Monday,  March 
24,  1986/Chaitra  3,  1908  (Saka):  1) 

The  separation  of  the  postposition  ^ar   'on'   from  its  object  NP  in  the 
underlined  part  of  la.   provides  an  instance  of  the  effect  of  English  syn- 
tax.    The  order  of  the  phrases  in  the  sentence  also  points  to  the  same 
factor.     According  to  the  rules  of  Hindi   syntax,   the  sentence  should  have 
been  as  in  lb.     Compare  the  position  of  the  square-bracketed  item  in  la. 
with  the  position  of  the  same  iten  in  relation  to  the  items  underlined  in  lb.  to  see  the  difference. 

lb.     lok  sabhaa  aaj  shrii  gulaam  nabii  ajaad  dvaaraa  pesh  kiye  gaye  ek 
prastaav  par  sahmat  huii  ki  [raajya  sabhaa  dvaaraa  yathaapaarit] 
antarraajyik  jal  vivaad  (sanshodhan)  vidheyak,  1986,  par  vicaar 
kame  tatnaa  use  paarit  kame  hetu  3  qhaNTe  kaa  samav  nivat  kiyaa 
jaae. 

The  same  phenomenon  is  exhibited  by  the  texts  in  2  and  3: 

2.       kyaa  sarkaar     ne  caaluu       vittiiy      varS    sahit  gat  tiin 
Q.  government    ag.  current    financial    year   with    past  three 

varSS'ke  dauraan  bainko'me'baRhtii       Dakaitiyffkii       or 
years  of  during    banks    in  increasing  dacoities  toward 

dhyaan       diyaa    hai  jiske  pariNaamsvaruup    bhaarii 
attention  given    has  which  of  result  as        heavy 

dhan  kii    haani  aur  aam      jantaa    ko  pareshaani  huii        hai; 
financial  loss    and  ccmrai  people    to  harassment  happened  has 

...v^iether  the  goverment  have  taken  note  of  increasing  dacoities 
(robberies)  in  the  banks  resulting  in  huge  financial  losses  and 
harassment  to  the  public  during  the  past  three  years  including  the 
current  financial  year;... 

(From  Lok  Sabha:  List  of 
Questions  for  Written 
Answers,  Wednesday,  March  21, 
1986/  Phalgun  30,  1907) 
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The  underlined  modifier  bhaarii  and  the  head  noun  haani  'loss'  have  been 
separated  following  literal  translation  word  order  of  'heavy  financial 
loss'  of  English;  the  order  should  have  been  dhan  kii  bhaarii  haani  in 
Hindi.  The  awkwardness  of  the  following  passage  can  be  traced  to  the  same 
source  --  translation  from  English  without  any  consideration  of  the  rules 
of  Hindi  syntax : 

3a.   ...caaval  aur  gehO'u'kaa  adhiktam  utpaadan  kamevaale  paac 
rice  and  wheat  of  meKimLm  production   doer   five 

raajyo  ke  kramshah   naam  kyaa  haf  aur  unkaa  kramshah 
states  of  respectively  name  n*iat  are  and  their  respective 

utpaadan  kitnaa  hai... 
pnxluction  how  much  is 

...the  name  of  the  first  five  states  producing  ma^imLin  quantity  of  rice 
and  wheat  respectively  and  their  respective  quantities... 

(Same  as  2:  Q.  no.  3751) 

The  way  the  Hindi   question  is  phrased,   the  first  kramshah   'respectively' 
applies  to  the  names  of  states,   and  consequently,   the  second  instance  of 
the  word  yields  the  same  reading.     In  order  to  bring  the   'rice'   and   'wheat' 
in  the  scope  of   'respectively',   the  Hindi   sentence  should  be  phrased  as  in 
3b.: 

3b.      ...kramshah  caaval  aur  geh'Qir'kaa  adhiktam  utpaadan  karnevaale  paSc  raajyo 
ke  naam  kyaa  hafaur  unkaa  utpaadan  kramshah  kitnaa  hai... 

There  are  only  three  items  of  Perso-Arabic  origin  in  these  excerpts, 
pesh  karnaa   'to  move'    in  1,   ke  dauraan   'during'   and  pareshaanii    'har- 
assment'   in  2.     All   the  content  words  except  these  three  items   in  la.,   2  and 
3a.   are  from  Sanskrit. 

The  same  phenomenon  of  Englishized  syntax   and  Sanskrit  content  vocab- 
ulary can  be  observed  in  the  language  of  Hindi   newspapers.     Consider  the 
excerpts  in  4a  and  b: 

4a.      chailenjar  ke  visphoT     se     amerikii  antarikS  kaarykram  ko  lage       dhakke 
Challenger  of  a  plosion  from  American  space       program     to  suffered  setback 

se  ubame  kii  kSamtaa  ke  pariikSaN  ke  taur  par  kal        DelTaa  raaket  kii  uRaan 
from  emerge  of  capacity  of  test     of  as  yesterday  Delta  rocket  of  flight 

ko  dekhaa  jaa     rahaa  thaa. 
DO  seen     being  was 

The  flight  of  the  Delta  rocket  yesterday  was  being  seen  as  a  test  of  the 
American  space  program's  capability  to  atricate  itself  from  the  setback 
suffered  as  a  result  of  the  aplosion  of  Challenger. 

(jansattaa.  May  5,  '86:  2) 
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b.   ...nividaa  ke  kaagjaat  kaam  ke  kisii  bhii  din  ru.  100/  kii  adaaygii... 

The  tender  dociments  (can  be  had)  on  any  working  day  on  paynent  of 
Rs.  100/-... 

(Same  as  a.:  2) 

The  tortuous  syntax  of  4a.  and  the  use  of  dekhaa  jaa  rahaa  thaa  'was 
being  seen'  are  influenced  by  English.  Even  more  noteworthy  is  the  use 
of  ru.100/-  in  the  tender  notice  reproduced  in  part  in  4b.  According 
to  the  convention  of  Hindi,  it  should  have  been  100/-ru.  Again,  the 
news  item  is  dominated  by  Sanskrit  vocabulary.  The  same  is  true  of  the 
tender  notices  published  in  Hindi  newspapers. 

As  regards  administrative  Hindi,  the  latest  Administrative  English- 
Hindi  dictionary  (Paliwal  1986)  makes  it  clear  that  heavy  Sanskritization 
of  the  lexicon  of  administration  still  continues. 

3.0  HINDI  IN  THE  MEDIA 

The  language  of  doordarshan  presents  a  mixed  picture.  The  news 
broadcast  is  not  as  heavily  Sanskritized  as  that  of  aakaashvaaNii .  For 
instance,  the  news  report  of  the  bomb  attack  on  the  venue  of  the  seven 
nation  economic  summit  in  Tokyo  had  the  sentence  in  5,  and  the  weather 
bulletin  is  usually  introduced  as  in  6.  Items  underlined  in  5  and  6 
are  from  Perso-Arabic  sources. 

5.  isse  koii  nuksaan  hone  kii  xabar  nahiT  milii  hai. 
this  by  any  damage  happening  of  news  not  obtained  ia 

There  is  no  news  of  any  damage  from  this  (attack). 

(May  4,  1986) 

6.  aur    ab   pesh    hai    mausam   kii    jaankaarii. 
and  now  present  is  weather   of    knowledge 

And  now  the  weather  news  is  presented. 

On  doordarshan,   several   of  the  serials   (comparable  to  prime-time 
soap  operas  of  network  TV  in  the  USA)   are  written,  produced,  directed, 
and  acted  by  non-Hindi  speakers,  and  several   are  adaptations  of  well- 
known  works  by  national   and  international  writers  in  different  languages. 
In  these  programs,   there  is  heavy  influence  of  English  as  well   as  reg- 
ional  languages  such  as  Bengali,  Marathi,  and  Punjabi.     The  following 
examples  show  the  English  influence: 

7.  anu,  tun*  bahaut  acchii  laRkii  ho,  aise    hii    rahnaa. 
Anu   you  a    very   good     girl      are    like  this    renain 

Anu,  you  are  a  very  good  girl,  remain  like  this. 
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8.  merii  vaaif  ko  phon  lagaaiye 
my   wife  Do  phone  connect 

Please  put  through  my  call  to  my  wife. 

9.  cuuki  ab  anuubhav  1e  cukii   huu... 
since  now  aperience  take  ccnpleted  have 

Since  now  I  have  had  aperience... 

10.  yah    faann  par  rahnaa    saortic    turtiaare  ko  TTiiik  nahiT  baiThtaa. 
this  farm   on    living    really     you        to  right  now     sits. 

This  living  on  the  farm  does  not  sit  well  with  you. 

11 .  kyaa  tun  in  projekT  mg'  invaalv  honaa  caahoge? 
Would  you  like  to  get  involved  in  these  projects? 

(Episode  51,  Khaandaan, 19-3-86) 

In  7,  following  the  requirements  of  English  grammar,   an  indefinite  article 
ek  has  been  used  in  the  predicate  nominal,  which  violates  the  grammatical 
rules  of  Hindi.     In  8,   the  Hindi    idiom  should  have  been  X  se  phon  milaanaa 
'to  establish  telephonic  connection  with  X'.     Similarly,   the  idiom  is 
anubhav  praapt  karnaa   'to  gain  experience',  which  would  have  been  approp- 
riate  in  9.    ...tumhgraas  nahTf  aataa  is  the  expected  idiom  for  10.     The 
sentences  and  fragments  in  8-10  seem  to  be  results  of  translation  from 
English,   and  possibly  Marathi.     In  11,   there  is  the  usual  mixing  with 
English  which  is  a  characteristic  of  educated  Hindi    (or  Indian)   speakers. 

4.0     HINDI  AS  A  LITERARY  LANGUAGE 

In  literary  criticism,   Sanskritization  is  the  norm.     The  following  is 
by  no  means  an  exception: 

12.     yah    shiST  saipradaay   prabhutvashaal  i  i      varg    kaa    vah    buddhijiivi 
this  civil  comfnunity     vested  with  authority  class  of  that  intellectual 

bhaag  hai    jo     saahityik    maanyataao*  ke    dvaaraa  puure  samaaj    par 
part    is  \4iich    literary     values  through      Oiole    society   on 

us    varg    ki  i  vicaardhaaraa    kaa  prabhutva  kaayam  kartaa  hai . 
that  class  of  ideology  of    authority  maintains 

...isse   yah    niSkarS  svabhaavtah    nikaltaa  hai    ki    saahityik 
this    from   this  conclusion  naturally  emerges  that  literary 

maanyataao  aur  saahity    kii    paribhaaSaao'ke    viruddh  sangharS 
values        and  literature  of  definitions        against    struggle 

vastutah    ek  kaThin    vicaardhaaraatmak    sangharS  hai . 
in  fact     a  difficult  ideological  struggle    is 
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This  ccmnunity  of  cultured  citizens  is  that  intellectual  part  of  the 
ruling  class  which  maintains  the  authority  of  its  ideology  over  the 
entire  society —  The  conclusion  is  natural  that  any  struggle  against 
literary  values  and  definitions  is  in  fact  a  difficult  ideological 
struggle. 

(Sing»i  1983:  105) 

All  the  underscored  items  are  from  Sanskrit  or  adapted  from  Sanskrit 
and  one  not  schooled  in  this  kind  of  'High  Hindi'  will  find  it  difficult 
to  process  such  texts. 

Contrast  12  with  the  excerpt  in  13,  which  is  from  the  genre  of  mem- 
oirs and  takes  the  bilingual  competence  in  English  for  granted: 

13.   kaalej  se  lauTte  hue  kuch  der  plaazaa  mg'baiThe  rahe.  aaj 
College  from  returning  a  v*iile   Plaza  in  seated  remained  today 

tabiiyat  kaafii  xush  thii  aur  man  halkaa  thaa.  vahaST 
mind   very  happy  was  and  heart  light  was  there 

se  uTh  kar  ghar  cale  aaye.  amnaa  ko  davaaii  laa  dii. 
from  raising  home  came  away  mother  to  medicine  bring  gave 

khaanaa  khaane  ke  baad  do  gha^f^e  patr  likhe.  gyaan  ko 
meal  eating  after  two  hours  letters  wrote  Gyan  to 

ek  tej  patr  likhaa  thaa,  par  use  posT  na  kar  duusraa 
a  sharp  letter  written  had  but  that  mail  not  doing  another 

posT  kar  diyaa.  mafne  kabhii  nahiTsocaa  thaa  ki  yah 
mail  do  give  I  ag.  ever  not  thought  had  that  this 

aacinii  is  kadar  'miiPiaakar'  ho  saktaa  hai.  ...vah  vyakti 
man  this  way   mediocre   be  can        he  person 

nahTf  ek  Taaip  hai  —  aur  us  Taaip  ki  apnii  siimaae   hJf'. 
not  a  type  is   and  that  type  of  own  limitations  are 

(We)  sat  for  awhile  in  the  plaza  on  our  way  back  from  the 
college...!  had  written  a  sharp  letter  to  Gyan,  but  I  did  not 
mail  it,  instead,  I  mailed  another  letter.  I  had  never  thought 
that  the  man  could  be  so  mediocre.  ...He  is  not  an  individual, 
he  is  a  type,  and  the  type  has  its  om  limitations. 

(Rakesh  1985:  46) 

This,  of  course,  is  the  ordinary  idiom  of  colloquial  standard  Hindi 
spoken  all  over  the  Hindi  area  by  educated  Hindi  speakers. °  Such  mix- 
ing with  English  is  common  in  almost  all  Indian  languages. 

The  colloquial  Eastern  and  Western  varieties  are  different  on  all 
linguistic  levels.  In  an  earlier  paper,  I  have  discussed  the  syntactic 
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variation  in  the  Eastern  variety  as  compared  to  MSH   (Kachru  1982).     The 
phonological   and  lexical   variations  are  well-known  and  partially  docu- 
mented in  Hindi   grammars.     Till   recently,  the  Eastern  variety  was  not 
used  in  creative  literature,  or  in  the  media.     It  seems  that  the  stigma 
attached  to  the  Eastern  variety  is  fading  and   it   is  being  used  increasingly 
in  writing  as  well   as  on  the  doordarshan.     An  example  of  the  use  of  Eastern 
variety  from  fiction  is  presented  in  14: 

14a.     ...dekhne  kaa    ciiz  j_     laRkii  dekhbe     nahTi'  kartii  hai... 
seeing  of    thing    this  girl    see  enph.  not     does 

This  girl  does  not  see  things  worth  seeing,    (see  =  notice) 

b.      ...pataa  nahTtkaise  j_   laRkii  itne  dabe  rang    kii  hai.    dono' 
known    not     how     this  girl    such  dark  color  of    is       both 

baRii  to    Thiik       hii       thtf,      unke    bakhat    harrko     itnii    pareshaanii 
older  emph.  alright  enph.  were       their  time     us  to  so   much    harassment 

bhii  nahti  huii.      vo     bicaarii    log    baap  ko    kabhii 
also  not    happened  those  poor        ones  father  to  ever 

pareshaan  bhii  nahi'f  kii.    magar  _i_     laRkii    to     baap     kii  pareshaani 
harassment  also  not    did     but    this    girl      enph.  father    of  worries 

kabhii  samajhtii    hii    nahTf.  are,    jab    tuttiaarii  suurat-shakl 
ever    understand    erph.  not    acl.    when  your  face-appearance 

nahn,  turtiaare  baap     ke  paas  paisaa  nahiY,  naukrii 
not      your       father        loc.  money     not     job 

nahff,  to    tur(ko    khud    koshish  kamaa  caahiye     ki    nahff. 
not     then  yoti  to  self    effort     do       desirable  or    not 

kisii  kaalej    me'ghus    jaao    to    han^co    bhii    ni^ 
any     college  in  enter  go    then    us  to    also    face 

rahegaa    kahTT   baat    calaane     ko. 
remain  will  somewhere  topic  raise  to 

Who  knows  how  this  girl  came  to  be  so  dark.    The  two  older  ones  were 
alright.    I  was  not  harassed  so  much  at  their  time  (i.e.  at  the  time 
of  their  marriage).    Those  poor  girls  never  harassed  me.    But  this 
girl  never  realizes  her  father's  worries.    Now,  if  you  don't  have 
looks,  your  father  does  not  have  money,  does  not  have  a  job,  then 
shouldn't  you  make  an  effort  yourself?    Get  into  a  college  job,  then 
I  will  have  the  guts  to  start  negotiations  somayhere. 

(Mishra  1986:  34) 

The  following  features  in  14a  and  14b  identify  the  text  as  written  in 
the  Eastern  variety:  lack  of  agreement  between  kaa  and  ciiz,  j_  for 
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yah,  and  the  construction  dekhbe  nahff  kartii  hai  for  dekhtii  hii  nahTf 
hai  {14a);  i  for  yah  as  the  demonstrative,  bakhat"  for  vagt  'time';  vo 
bicaarii  ...  nahTT  kii  instead  of  un  bicaariyo  ne  . . .  nahIT  kiyaa,  i.e. 
a  nominative  instead  of  an  ergative  construction,  and  the  use  of  log 
'people'  to  mark  plurality  in  the  noun;  lack  of  agreement  in  koshish 
karnaa,  and  the  construction  with  dative  instead  of  genitive  in  hamko  bhii 
mtlh  rahegaa. 

On  the  one  hand  the  Eastern  variety  shows  substratum  influence,  on 
the  other,  it  is  becoming  highly  Sanskritized  as  a  result  of  the  impact 
of  the  official  language.  An  example  of  the  latter  phenomenon  is  seen 
in  15: 

15.   ...kaahe  nahTt"  hogaa  iskaa.  ...asal  m^hambhor  Tahalne 
why  not  happen  will  her   reality  in  I  dawn  walk 

gaye  the,  tabhii  hamko  bhii tar  kii  sab  baat  pataa  calii. 
done  had  then  enph.  I  to  inner     all  matter  known  became 

...ekmaarc  se  nayaa  adh^adKh  aanevaalaa  hai.  uske 
first  march  frcm  new  ordinance  about  to  come  is  it  of 

anusaar  praadiyaapako  kii  niyuktiyaa* bhi i  likhit  pariikSaa 
according  professors   of  appointment  also  written  ecamination 

ke  aadiaar  par  hongii.  maane  khulii  pratiyjgitaa 
on  basis  on  be  will  meaning  open  competition 

hii  samjho.  ...  yah  hai  raajniiti  kaa  aadnii.  ab  yah  khud 
aiph.  understand  this  is  politics  of  man   now  he  himself 

cunaav  laRnaa  caahtaa  hai.  uske  liye  caahiye  paisaa.  isiiliye 
election  fic^t  wants      that  for  need   money  that's  why 

jo  hai  rikt  sthaano  kaa  vigyaapan  nikalvaa  diyaa  hai. 
wh-  is  vacancies   of  advertisanents  published    has 

...kaafii  paisaa  banaa  legaa   yah 
much  money  make  take  will  he 

Why  won't  she  get  it  (the  job)?  Actually,  I  went  for  a  walk  at  dawn;  it 
is  then  that  I  came  to  know  the  real  facts.  A  new  ordinance  is  going  to 
be  promulgated  on  March  1st.  According  to  that,  even  the  professors  will 
be  appointed  on  the  basis  of  written  tests.  That  means  open  conpetition. 
This  one  (the  Vice  Chancellor)  is  a  man  of  politics.  ...Now,  he  himself 
wants  to  fi^t  the  election.  For  that  he  needs  money.  That  is  v^y  he 
has  had  the  vacancies  advertised.  He  will  make  a  lot  of  money. 

(Mishra  1986:  35) 

In  15,  there  are  three  instances  of  variety-specific  usage,  namely, 
kaahe  nahlY  hogaa,  hamko... sab  baat  pataa  calii,  and  isliye  jo  hai ,  the 
rest  of  the  underscored  items  are  examples  of  Sanskritization  which  char- 


82 


acterizes   'officialese'    in  the  Hindi   area. 

This  mixture  of  extreme  Sanskritization  and  Eastern  variety-specific 
features  may  sound  peculiar  to  those  used  to  MSH;   however,   it  is  a  reality 
of  the  Eastern  regions  of  the  Hindi-speaking  area. 

Some  examples  of  Western  colloquialisms  are  the  use  of  the  ergative 
marker  in  the  obligative  construction,   as  in  usne  abhii   jaanaa  hai   instead 
of  usko  abhii   jaanaa  hai    '(S)he  has  to  go  just  now',   the  use  of  the  posses- 
sive form  as  a  general   oblique  form  of  the  personal   pronouns  as  in  mere 
ko  pataa  nahTT  thaa  instead  of  mujhe  pataa  nah'TT  thaa   'I  didn't  know', 
and  regularization  of  certain  irregular  forms  of  verbs  such  as  karaa/karii, 
etc.    instead  of  kiyaa/kii,    'did'    (m./f.)  etc.     These  are  becoming  common 
in  written  as  well   as  spoken  forms  of  the  Western  variety,   and  one  hears 
them  regularly  on  doordarshan  programs. 

Another  trend  which  strikes  one  in  reading  fiction,  memoirs,   and 
sketches  is  that  of  mixing  regional   languages  such  as  Bengali,  Marathi, 
and  Punjabi  with  Hindi,   often  without  any  explanation.     Examples  of  such 
mixing  are  given  in  16,   17  and  18: 

16.  'saccii  baat     kahe  jii    ki    aaj    ke  din    jidhar       'takk'    maaro 
true     matter     say        that  today  of  day  wherever    glance    hit 

bas  saahab    hii  saahab.    aur  kuRiyaa,  kaakiya^bhii  misaa    aur 
Sahab    enph.  Sahab     and  young      girls       also  misses  and 

mariaa'      banii       ghuuntii      hau  ..." 
Mem  Sahab  becoming  go    around 

The  truth  is  that  wherever  you  look  today,  all  you  see 
are  Sahab  (British  gentlemen).  And  young  girls  are  also 
going  round  pretending  to  be  (British)  Misses  and  Ladies. 

(Sobti  1977:  60) 

In  this  Punjabi-mixed  passage,  the  single  quotes  surrounding  the  word 
takk  'glance'  alert  the  reader  that  it  is  a  borrowed  item,  but  no  such 
warning  is  given  with  kuRiyaa  and  kaakiyef^  'young  girls'.  Bilingual  com- 
petence in  Punjabi  and  Hindi  is  taken  for  granted  in  such  texts. 

The  example  in  17,  by  an  author  from  the  Eastern  region,  shows  mix- 
ing with  Bengali : 

17.  tiisre    pahar,      faNaa  kii  kaakii  (aamaader  na'  kakii-maa)  dauRtii 
late     afternoon  Fana's       aunt       our  Na'  aunt  running 


hSaphtii  aayii 
panting  came 


"kii  khabar?    1 atk i aa  baaRi i te  nei i ! 


What  news        Latika    home    in  not 

Matikaa  se     nahlY,  tunse       kaam   hai.  ...faNaa,  maane  malay 
Latika   with  not      you  with  job     is  Fana,  i.e.    Malay 
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ke  pulise  aresT  koreche. . .  haa,  kolkaataa  theke  pulisa-  lok 
to  police  arrest  done  have  yes  Calcutta  from  police  of  people 

esechilo.  ...  haa,  haathkaRaa  haate  diye,   katiore  doRii-rassii  me 
cone  had    yes  handcuffs  hand  in  giving  waist  ropes     in 

bSScJh  kar,  saRak  par  Tahlaate  hue,  haajat  ni^  le  gaiyaa.  ... 
tying    road  on  walking    lock  ip  in  take  went 

hSa,  sirf  kitaab  ke  liye!  ghar  kii  puurii  khaanaatalaashii  — 
yes  just  book   for   house  of  ccnplete  search 

tann  tann  karke  kaagaj-pattar,  TaaipraaiTar,  baii-sab  niye  gelo!" 
thoroughly    papers  letters  typewriter   books  all  take  went 

"keno?  keno??" 
why  why 

"Kii  jaanii!  aaj  subah  ko  jamaanat  par  chuuTkar  aayaa  hai. 
vi*iat  Know  today  morning  bail    on  free   cone  has 

kal,    kalkattaa  calaa  jaaegaa!" 
tomorrow  Calcutta  move  go  will 

Late  afternoon,  Fana's  aunt  (the  one  we  called  Aunt  Na')  came  running 
and  panting.  ... 

"What  news?  Latika  is  not  hone." 

"I  want  to  see  you,  not  Latika.  ...Fana,  i.e.  Malay  has  been 
arrested  by  the  police,  ...Yes,  the  policemen  had  come  from 
Calcutta.  ...Yes,  they  took  him  to  the  lock-ip,  handcuffed 
with  a  rope,  marching  him  on  the  streets —  Yes,  just  for 
the  book!  The  house  was  searched,  thoroughly,  books,  papers, 
typewriter,  they  took  everything!" 

"Why?  Why?" 

"Who  knows!  He  came  back  this  morning,  free  on  bail.  He  is 
leaving  for  Calcutta  tonrorrow." 

(Renu  1984:  102) 

In  the  above  excerpt,  there  are  some  passages  which  are  totally  in 
Bengali,  namely,  the  second  and  third  underscored  items.  Others  are  a 
mixture  of  Bengali  and  Hindi.  Those  not  familiar  with  Bengali  will  have 
a  hard  time  interpreting  the  text. 
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In  18,   there  is  mixing  with  Marathi: 

18a.     ...usii    mulge  ne  puuchaa  hai  —  "kasaa    vaaTto  re?" 

the  same  boy     ag.  asked    has        how      feels     hey  you 

"mii  ta     kharokharac  aaggaaRii  jhaalaa." 
I    srph.  really  train        became 

The  same  boy  asked,  "Hey,  you!  How  does  it  feel?" 

"I  have  really  become  a  train." 

(Bakshi  1984:  14) 

akkaa  baRii  kanjuus  hai. 
now  I  this  csay  can    that  mother  very  miserly  is 

...unhohe  ek  pishvii  banvaa  rakhii  hai  laal  rang  kii. 
she  ag.   a  bag   make  cause  kept  has  red  color  of 

aNaa  ek  baar  DyuuTii   se  lauTe    to   ek  laal  jhaNDii  le 
father  once  duty  frcm  returned  then    a   red  flag   take 

aaye  the.  bahut  dino  tak   to  ham  us  jhaNDii  kaa  upyog 
come  had  many  days  till  errph.  we  that   flag    of  use 

rel-rel  khelne  me'karte  rahe,  phir  ek  din  akkaa  ko  kuch 
train  playing  in  doing  kept  then  one  day  mother  to  something 

suujhaa  to  uskii  pishvii  sii  Oaalli 
occurred  then  its  bag    sew  poured 

Now  I  can  say  that  Mother  is  very  miserly —  She  has  sewn  a  red 
bag.  Once,  when  Father  returned  from  his  job,  he  brought  a  red 
flag.  For  a  long  time,  we  used  it  to  play  train,  then  one  day, 
something  occurred  to  Mother  and  she  made  a  bag  out  of  it.... 

(Bakshi  1984:  14) 

In  18a.,  the  use  of  Marathi  in  the  direct  quotes  is  perhaps  for 
characterization,  i.e.,  it  is  used  to  identify  the  two  characters  explicit- 
ly as  Marathi  speakers.  But  the  excerpt  in  18b.  is  from  the  narrative 
part  of  the  text.  There  is  no  lexical  gap  in  Hindi  which  is  filled  with 
items  such  as  pishvii  'bag',  even  if  one  grants  the  use  of  the  terms  of 
address  for  'father'  and  'mother'  for  locating  the  text  in  its  proper 
geographical/regional  context. 

This  trend  of  mixing  with  regional  languages  is  noticeable  in 
doordarshan  advertisements  also.  Products  are  shown  and  remarks  about 
then  are  made  by  people  from  various  regions  in  their  own  languages  be- 
fore the  concluding  remark  or  jingle  is  broadcast  either  in  Hindi  or 
English. 


5.0  MOTIVATIONS  FOR  MIXING 

One  can  speculate  on  the  reasons  for  these  kinds  of  mixing.  It 
seems  to  me  that  in  literature,  the  mixing  with  regional  dialects,  var- 
ieties, and  New  Indo-Aryan  languages  is  motivated  by  the  fact  that  authors 
from  various  regions  are  asserting  their  regional  identities.  There  does 
not  seem  to  be  any  inhibition  in  mixing  freely  items  from  various  sources, 
there  is  no  self-consciousness,  no  apprehension  of  'pollution'  that  may 
plague  the  purist.  In  the  earlier  periods  of  Hindi  literature,  such  mix- 
ing was  confined  to  the  regional  dialects  within  the  Hindi  area;  now,  with 
the  consciousness  of  the  role  of  Hindi  as  a  national  official  as  well  as 
link  language,  writers  from  various  regions  are,  as  it  were,  contributing 
toward  developing  a  national  character  of  the  language.  Whether  this  trend 
is  healthy,  or  desirable,  is  another  question.  Most  literary  scholars  and 
linguists  I  had  the  occasion  to  discuss  these  questions  with  agreed  that 
the  trend  seems  to  be  inevitable,  and  in  the  absence  of  any  authoritative 
agency  that  could  'impose'  a  uniform  'standard',  it  is  difficult  to  see 
how  this  could  be  blocked.  In  fact,  there  does  not  seem  any  need  to  block 
any  development  at  this  stage;  these  recent  developments  may  result  in 
enriching  the  language. 

The  doordarshan  advertisements  and  programs,  of  course,  are  motivated 
by  quite  a  different  factor.  The  audience  of  the  TV  programs  is  nation- 
wide, hence,  the  language  has  to  be  flexible.  In  popular  programs  as  well 
as  public  opinion  surveys,  impromptu  interviews,  advertisements,  etc.,  the 
language  is  not  of  primary  concern.  The  way  people  speak  in  language- 
contact  situations  is  found  natural  and  acceptable,  hence,  there  is  no  attempt 
to  edit  their  language.  This,  of  course,  is  true  of  TV  advertising,  inter- 
views, etc.,  in  native  English-speaking  contexts  of  the  U.S.A.,  too.  In 
addition,  the  various  'accents'  or  languages  in  U.S.  advertisements  may 
be  associated  with  various  products  (e.g.  Italian  with  pasta  or  French  with 
perfume);  in  the  Indian  context,  mixing  with  various  Indian  languages  is 
intended  to  show  the  nationwide  popularity  and  appeal. 

6.0  QUESTION  OF  STANDARDIZATION 

The  trends  identified  in  this  paper  raise  some  very  important  ques- 
tions for  standardization  which  are  of  concern  to  policy  makers  and  edu- 
cationists. First,  both  Sanskritization  and  Englishization  have  created 
a  situation  such  that  the  official  language  used  in  administration  and 
laws,  and  to  a  lesser  extent  in  newspapers,  literature,  aakaashvaaNii , 
and  doordarshan  is  markedly  different  from  the  colloquial  standard.  This 
is  creating  a  type  of  'diglossic'  situation  in  the  Hindi  area.  Second, 
the  emergence  of  marked  differences  between  the  Eastern  and  Western  var- 
ieties is  bound,  sooner  or  later,  to  lead  to  a  situation  where  problems 
of  standardization  will  have  to  be  tackled.  For  instance,  certain  Western- 
isms  have  been  accepted  as  legitimate  variations  to  the  standard  at  least 
in  some  texts  used  in  schools,  as  is  clear  from  the  excerpt  in  20:^ 
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20.  ...tabhii    gurujii    ne    hani§'    bataayaa    ki    tun  ne    baRe    ho  kar 
then  arph.  teacher  erg.  us        told       that   you  erg.  big    be  CP 

netaa,    vakiil,    DaakTar,      injiintyar    aadi    bannaa    hai  to 
leader    lawyer     doctor        engineer      etc.    become    is    then 

tun  jo  bhii    banaa     cahte  ho    vah  abhii  se   apne   man 
you  whatever  become   want        that  now   from  self's  mind 

me  dhaaraN  kar  lo    aur  usii  ke     anusaar       jii-jaan     se 
in  hold       do  take  and  that  enph.  according  to  life      with 

parishran  karo 
labor        do 

Then  the  teacher  told  us  that  if  we  wanted  to  become  leaders,  lawyers, 
doctors,  engineers,  etc.,  when  we  grew  up,  then  we  should  keep  that  in 
mind  from  now  on  and  work  hard  accordingly. 

The  underlined  clause  in  20  is  acceptable  in  Western  colloquial   Hindi, 
though  in  MSH,   the  obligative  construction  requires  a  dative  ko,   as  in 

tum  ko  baRe  ho  kar  bannaa  hai.     The  same  text  also  contains  the 

following  with  an  obligative  construction  that  conforms  to  the  MSH  norm: 

21.  yadi  vidyaarthiyo  ko    jiivan  me  mahaan  vyakti       bannaa    hai,    to 
if     students        to    life     in   great  individual  become  is     then 

gnhg    '^cchj^  vidYJtarthii'    byinaa  hoqaa. 
them   to  good  student  become  be  will 

If  students  wish  to  become  great  men  in  their  lives,  they  will  have 
to  become  'good  students'. 

The  question  arises,   if  the  obligative  construction  in  20  is  accept- 
able as  a  legitimate  variant,   is  the  nominative  construction  ham  kitaab 
paRh  liye  h^T,   common  in  the  Eastern  variety,   equally  acceptable  as  a 
variant  of  the  MSH  ergative  construction  mlT  ne  kitaab  paRh  lii   hai    'I 
have  read  the  book'?     Of  course,   such  acceptance  of  variation  will   have 
profound  implications  for  the  nature  of  MSH.     The  debate  on  these  questions 
has  not  begun  yet.°     In  view  of  the  wide  impact  of  media,   especially 
doordarshan,   however,   some  discussion  of  these  issues  seems  to  be  necessary. 

7.0     CONCLUSION 

Increased  use  of  Hindi   in  media  and  literature,   its  expansion  to 
domains  such  as  administration  and  law,   and  the  demands  placed  on  it  as  a 
'link'    language  in  the  multilingual   context  of  India  have  all   contributed 
to  several   related  developments  that  have  resulted  in  changing  the  char- 
acter of  Hindi.     Whereas  Hindi,   since  its  history  began,   exhibited  the 
tendency  to   'mix'   regional   dialects  and  Sanskrit  for  certain  purposes, 
mixing  with  the  New  Indo-Aryan  languages,  Arabic-Persian,   and  English  is 
changing  the  structure  of  the  language.     The  development  of  distinct 
varieties  such  as  the  Eastern  and  Western  are  posing  other  problems 
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related  to  standardization,  educational  medium,  etc. 

For  the  linguist,  the  challenges  are  obvious.  The  first  is  that 
of  description.  What  tools  do  we  have  to  describe  a  Hindi  which  is  not 
a  language  but  a  well-knit  group  of  codes  with  domain-specific  as  well 
as  wider  uses?  Depending  upon  the  domains  of  use,  some  of  the  codes  are 
fairly  homogenous,  e.g.  colloquial  Hindi  or  Hindustani.  Others  are  mixed 
with  elements  from  regional  dialects;  regional  languages  such  as  Bengali, 
Punjabi,  Marathi;  classical  languages  such  as  Sanskrit,  Arabic-Persian, 
and  a  recently  indigenized  foreign  language,  English.  Next,  for  a  socio- 
linguistic  description,  the  question  arises:  What  kind  of  methodology  is 
appropriate  to  use  in  determining  the  domains  of  use  of  Hindi  in  its  var- 
ious forms?  It  seems  to  me  that  the  present  state  of  the  Hindi  area  sug- 
gests the  anthropological  method  of  intensive  and  extensive  participant 
observation  rather  than  the  sociological  method  with  its  requirements  of 
survey  data  and  measurable  causes  (Fasold  1984). 

For  the  educationist,  the  problem  is  that  of  achieving  goals  in 
literacy  and  at  the  primary  and  higher  levels  of  education.  At  this  point, 
I  am  unable  to  find  any  statistical  data  on  the  rate  of  failure  at  the 
primary  and  other  levels  of  schooling  in  the  Eastern  regions  of  the  Hindi- 
speaking  area.  The  syntactic  differences  between  the  Eastern  and  MSH  var- 
ieties are  so  great  that  a  significant  rate  of  failure  can  be  expected. 
If  this  apprehension  is  proved  correct,  the  statistical  evidence  will  raise 
a  serious  question:  Should  the  Eastern  variety  be  used  for  educational 
purposes  at  the  elementary  levels  and  MSH  be  taught  as  a  second  language 
at  some  stage  of  post-primary  education? 

For  linguists  interested  in  language  and  society,  Hindi  provides  a 
unique  opportunity  of  studying  the  interplay  of  various  socio-political, 
linguistic,  and  cultural  forces. 


NOTES 


1.  I  am  grateful  to  the  University  of  Illinois  for  granting  me  a  sab- 
batical leave  of  absence  (Spring  1986)  and  to  the  American  Insti- 
tute of  Indian  Studies  for  awarding  me  a  Senior  Faculty  Research 
Fellowship  in  1986-87  which  enabled  me  to  conduct  field  work  in 
India.  This  paper  is  partially  based  on  the  work  I  did  during 
January  -  May  1986  while  in  India. 

2.  It  is  encouraging  to  note  that  this  fact  is  slowly  being  recog- 
nized by  literary  scholars  as  well.  This  is  evident  from  the 
editorial  entitled  kyaa  hindii  kSetriiya  bhaaSaa  hai?  'Is  Hindi 
a  regional  language?'  in  samiikSaa,  a  periodical  devoted  to 
literary  criticism,  in  its  issue  of  July  -  September  1986. 

3.  The  folk  traditions  and  dialects  of  the  Hindi  area  are  still 
contributing  to  the  revitalization  of  contemporary  Hindi  drama, 
as  pointed  out  in  Rai  1986. 


4.  aakaashvaaNii   and  doordarshan  are  the  names  of  the  All    India 
Radio  and  the  Indian  Television,   respectively. 

5.  The  following  convention  of  transcription  has  been  followed  in 
the  paper:     (a)   the  doubling  of  vowels   indicates  a  long  tense 
vowel;    (b)   capital   letters  indicate  a  retroflex   flap;    (c)   cap- 
italization and  conventional   spelling  have  been  used  for  insti- 
tutionalized words  borrowed  into  English  from  Indian  languages, 
e.g.,   proper  names  and  names  of  institutions  such  as  the  Lok 
Sabha,   the  lower  house  of  the  Indian  Parliament. 

6.  Sobti  1977  contains  the  following  exchange  between  several  well- 
known  writers  of  Hindi  and  P. A.  Barannikov»  a  famous  scholar  of 
Hindi   from  the  Soviet  Union: 

'aap  kuch  kah       rahe    the!' 

you  something  saying  were 

'sirf  itnaa  hii    ki  aap  log    na  battii  bujhaate  hai,  na  gul  karte 
only  this  enph.  that  you  pi.    not  larp  douse  not  extinguish 

hcff,  ab    aap    'laaiT'  'aaf  karte  hcff'.' 
new  you     light     off     do 

(hindiivaalo'kii  or  se  sirf  khisiyaanii  hasii) 
Hindiwallhs  from         only  entarrassed  laughter 

'aap  ko  'bas'  bhii  'baii    caans'    miltii  hai.' 
you  to    bus    also    by     chance     obtains 

'  'baii  caans'    isliye  ki  'regular'  nahff  miltii.' 
'by'  'chance'      because      'regular'  not     obtained 

(raajendra  yaadav  kaa  javaabii  hanlaa  ~  kariib-kariib  misfaayar) 
Rajendra   Yadav's        answering    attack       nearly  misfired 

'aap  nimantraN  yaa  daavat?*  'aksept'  karte  haf yaa  'rifyuus'  karte 
you  invitations  accept    do  or     refuse     do 

haf.  yah  bhii  kahte     sunaa  hai    ki      'pichle   mahiine  merii  'sisTar' 
this  also  saying    heard   have  that  last      month     my        sister 

kii  'mairij'  thii.'  'mere  'ankal'  kii     'Deth'  ho  gaii  hai.'  'aaj 
of  marriage   was        my      uncle    of     death    be  went  has     today 

meraa  'apaaiNTmeNT'   'DaakTar'  sehai.'  — 
my        appointment     doctor   with  is 

'You  were  saying  something! ' 

'Only  this  that  you  no  longer  extinguish  lanps  (bujhaanaa  =  a  native 
Hindi  verb  meaning  'douse  a  flame,  fire';  gul  kamaa  =  Perso-Arabic 
borrowing  meaning  the  same:  YK),  you  (switch)  'off  the  'light'.' 


(Entarrassed  laughter  from  the  Hindi  writers) 

'You  also  get  the  'bus'  'by  chance'.' 

'  'By  chance'  because  we  do  not  get  it  'regular'  (i.e.,  regularly:  YK).' 

(Rajendra  Yadav's  repartee  almost  misfired.) 

'You  also  'accept'  or  'refuse'  invitations  (nimantraN  =  Sanskrit  word  for 
invitation;  daavat  =  Perso-Arabic  borrowing  with  the  same  meaning:  YK). 
I  have  also  heard  people  say  'my'  'sister's  marriage'  took  place  last 
month.'  'My  'uncle'  is  'dead'.'  'Today  I  have  an  'appoinbrent'  with 
the  'doctor'.'... 

Barannikov's  comments  are  justified  in  view  of  how  English-mixed  Hindi 
is  used  by  educated  speakers  of  Hindi. 

7.  The  excerpts  in  20  and  21  are  taken  from  Nibandh  Rashmi  (n.d.,  pp. 
104  and  29),  published  by  Nina  Publications  (Regd.).  The  book  is 
sold  in  Delhi  and  claims  to  be  'for  primary  school  children  accord- 
ing to  the  syllabus  of  the  Education  Department.' 

8.  Except  for  the  editorial  in  samiikSaa  referred  to  in  footnote  2, 
I  did  not  come  across  any  discussion  of  the  issues  related  to 
standardization  or  variation  in  any  other  forum  in  India. 
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THE  ERGATIVE  CASE  IN  HINDI-URDU* 
Baber  S.A.  Khan 


Postpositions  in  Hindi-Urdu  are  traditionally  considered  as  case  mark- 
ers. Some  of  the  cases  realised  by  postpositions  are  treated  as  structural 
while  other  oblique  cases  are  understood  to  be  inherent  ones.  As  such,  koo 
is  usually  described  as  realising  the  dative-accusative  case  while  nee  marks 
the  ergative  case.  Tlie  assignment  of  dative  and  ergative  case  to  NPs  in 
subject  position,  a  position  which  is  ordinarily  assigned  the  nominative 
case,  has  remained  problematic.  TTie  problem  of  specifying  the  case  assign- 
ing elements  to  subject  NPs  may  be  circumvented  if  postpositions  are 
treated  as  markers  of  inherent  case ,  assigned  at  D-structure  and  associated 
with  theta  roles,  while  the  structural  case  is  restricted  to  the  nominative 
and  accusative  case  realised  at  S-structure  and  marked  by  the  absence  of 
postpositions. 


0.  INTRODUCTION: 

Ergativity  in  Hindi-Urdu  (H-U)  has  been  described  as  a  surface  phenomenon 
that  is  strictly  linked  to  perfective  aspect.  However,  the  ergative  case  is  not  assigned  to 
subject  NPs  in  all  or  every  perfective  sentence.  Neither  is  the  ergative  marker  nee 
necessarily  associated  with  transitivity  or  with  such  semantic  properties  of  the  re- 
ferents of  NPs  as  'animacy'  or  'definitness'.  In  fact,  no  syntactic  rules  except  for  the 
rule  of  verb  agreement  appear  to  crucially  refer  to  the  ergative  alignment.  Even  the 
verb  agreement  rule  is  not  totally  sensitive  to  the  ergative-absolutive  pattern.'  These 
and  other  facts  have  raised  questions  about  the  relevance  of  the  ergative-absolutive 
alignment  to  the  rule  system  of  H-U.  It  is  not  totally  unreasonable  to  ask  whether  the 
so-called  ergative  case  has  anything  to  do  with  ergativity  at  all.  If  nee  is  not  the  marker 
of  ergative  case  then  it  is  worth  exploring  what  it  stands  for  and  how  it  gets  assigned 
to  subject  NPs  in  perfective  sentences. 

This  paper  attempts  to  provide  answers  to  questions  like  the  one  posed  above. 
The  first  part  of  the  paper  consists  of  arguments  suggesting  that  postpositions  in  H-U, 
contrary  to  common  belief,  do  not  reflect  structural  case.  In  general,  they  underscore 
the  thematic  roles  and  mark  the  inherent  case  assigned  to  NPs  at  D-structure  when 
NPs  are  ^-marked  by  verbs  (Chomsky  1986).  The  discussion  will  focus  primarily  on 
roles  signaled  by  the  postpositions  nee  and  koo.  It  is  assumed  that  oblique  case  as- 
signed and  marked  by  other  postpositions  can  be  accounted  for  by  the  same  principles. 

The  second  part  discusses  the  mechanism  by  which  case  is  assigned  to  NPs 
marked  with  nee  .  This  will  be  done  within  the  formal  framework  of  Government  and 
Binding  theory.  To  be  assumed  are  the  general  principles  and  conditions  of  the  Theta 
and  Case  theories  as  developed  in  the  relatively  recent  works  of  GB  theory. 
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1.  POSTPOSITIONS  AS  CASE  MARKERS: 

Postpositions  like  nee  and  koo  have  traditionally  been  described  as  structural  case 
markers.  Nee  is  referred  to  as  marker  of  ergative  case  while  koo  is  generally  described 
as  marking  the  dative-accusative  case.  It  is  my  contention  that  these  j>ostpositions  do 
not  signal  structural  case  at  aJl.  Instead,  they  mark  the  thematic  roles  (e.g.  agent,  pa- 
tient, experiencer)  assigned  by  the  verbs  to  the  NPs.  The  data  given  below  provide 
ample  evidence  for  treating  these  postpositions  as  markers  of  inherent  case  associated 
with  ^-roles.  In  the  discussion  that  follows,  I  shall  assume  that  the  term  dative  does 
not  refer  to  the  semantic  notion  of  recipient.  Instead,  as  commonly  described  in  the 
literature,  the  term  dative  corresponds  to  the  syntactic  term  indirect  object  just  as  the 
accusative  case  refers  to  direct  object. 

Given  that  grammatical  relations  are  con  figuration  ally  defined,  the  unmarked 
linear  order  of  their  occurrence  in  H-U  appears  as  below: 

Subject-  Indirect  Object  -  Direct  Object  -  Verb       (Gambhir  1981). 

NPs  ordinarily  occupying  the  indirect  object  (dative)  position  are  marked  by  koo.  Yet, 
that  position  in  H-U  is  not  always  associated  with  koo  and,  hence,  koo  cannot  be  con- 
sidered a  defining  element  for  a  syntactic  category.  The  following  examples  illustrate 
that  the  choice  of  a  specific  postposition  depends  on  the  semantic  role  assumed  by  the 
NP  and  has  nothing  to  do  with  the  NP's  occurring  at  the  indirect  object  position. 

(1)  mt  nee    apne    doost    koo    kitaab   dii 

I     ERG   self    friend   D/A    book   gave^ 
'I  gave  the  book  to  my  friend.' 

(2)  (a)         mt  nee    apne    doost    see     kitaab   hi 

I     ERG   self    friend   from  book    took 
'I  got  the  book  from  my  friend.' 

(b)         *  mX  nee    apne    doost  koo  kitaab   lii 

(3)  (a)        mt  nee    meez   par  kitaab   rakhii 

I     ERG    table   on     book      put 
'I  put  the  book  on  the  table.' 

(b)        *m'tnee    meez    koo  kitaab   rakhii 

The  verb  deenaa  'to  give'  in  (1)  subcategorizes  for  a  recipient  NP  marked 
with  koo  in  the  indirect  object  position.  However,  the  verbs  leenaa  'to  take'  and 
rakhnaa  'to  put'  in  (2)  and  (3)  respectively  necessitate  that  the  NPs  in  the 
indirect  object  position  be  marked  with  instrumental  postposition  see  'from'  and 
the  locative  postposition  par  'on'.  The  cases  assigned  to  NPs  in  the  indirect 
object  position  are  clearly  not  structural,  a  case  assigned  independently  of 
thematic  relations  (Chomsky  1986). 
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There  is  also  little  evidence  for  koo  to  be  considered  an  accusative  case 
marker.  The  passive  rule  in  H-U  provides  support  for  the  view  that  koo  does 
not  mark  the  accusative  case.  Passive  morphology  is  said  to  absorb  the  case.  One 
would  expect  that  koo,  if  a  marker  of  accusative  case,  would  disappear  with  the 
application  of  the  passive  rule.  However,  when  the  passive  rule  promotes  a  direct 
object  to  subject  position,  koo,  when  present  in  the  corresponding  active  clause, 
may  still  appear  with  the  new  subject  indicating  that,  in  fact,  it  is  not  an  accusa- 
tive case  marker  at  all. 

(4)  (a)         unhoo    nee     puliis    koo   bulayaa 

they      ERG    police   D/A   called 
'They  called  the  police.' 

(b)  puliis  koo  bulayaa  gayaa 
police  D/A  called  went 
'The  police  were  called.' 

The  occurrence  of  koo  with  NP  in  the  direct  object  position  is  optional  and 
depends  on  the  semantic  properties  of  the  NP.  It  is  generally  believed  that  koo 
marks  the  direct  object  for  'definiteness'  and  'animacy'  (Kachru  1966;  Moravcsik 
1978;  Wierzbicka  1981).  As  the  examples  below  indicate,  neither  of  these 
semantic  properties  is  sufficient  to  condition  the  occurrence  of  koo. 

(5)  is      meez     koo   udhar    rakhoo 
this    table    D/A    there    put 
'Put  this  table  over  there.' 

(6)  (a)         ham    nee     ek     baccee     koo    bhaagtee  huwee    dekhaa 

we     ERG      a      child     D/A     running  saw 

'We  saw  a  child  running.' 

(b)  us  nee  kisii  §axs  koo  ittelaa  nahlT  kii 
he  ERG  any  person  D/A  information  not  did 
'He  didn't  inform  anyone.' 

(c)  kuch   ciizoOj   koo    turn  sambhaaloo,    kuch   e^   koo  mt 
some    things    D/A    you  take  care         some    <j>    D/A    I 
sambhaltaa  huu 

take  care      am 

'You  take  care  of  some  of  the  things, (while)  I  take  care  of 

(the  remaining)  some.' 

In  sentence  (5)  above,  an  inanimate  object,  meez  'table',  is  marked  with 
koo.  In  (6a),  koo  appears  with  an  indefinite  NP  overtly  marked  with  the 
indefinite  article  ek  'a,  an'.  Koo  also  occurs  with  NPs  that  are  marked  with 
indefinite  determiners  like  kisii  (6b)  and  kuch  (6c). 


It  seems  that  the  occurrence  of  koo  with  the  direct  object  is  conditioned  by 
two  semantic  factors:  the  NP  involves  a  specific  reference  and/or  is  assigned  the 
^-role  of  patient/ re cipient/experiencer  by  the  verb.  The  claim  that  koo  signals 
specific  reference  is  supported  by  the  fact  that  the  direct  object  NP,  in  contexts 
involving  non-specific  reference,  does  not  cooccur  with  koo. 

(7)  voh    machlii   (*koo)     pasand   nah"?  kartaa 
he      fish  D/A  like      not    does 
'He  does  not  like  fish.' 

(8)  is  faarm  par  hukuumat  hiran  (*koo)  paaltii  hai 
this  farm  on  government  deer  D/A  raise  is 
"The  government  raises  deer  on  this  farm.' 

In  both  (7)  and  (8),  the  direct  object  involves  generic  reference  that  makes 
koo  unacceptable  in  the  intended  generic  sense  of  the  sentence.  The  use  of  koo 
in  the  above  sentences  gives  the  awkward  reading  whereby  machlii  'fish'  and 
hiran  'deer'  refer  to  some  particular  member  of  the  species. 

It  is  generally  understood  that  stative  verbs  are  closely  associated  with  gen- 
eric reference  in  the  noun  phrase  while  dynamic  verbs  are  linked  to  specific 
reference  (Quirk  et  aJ.  1972).  In  H-U,  generic  reference  is  not  signaled  by  any 
overt  marker.  Instead,  NPs  signaling  generic  reference  commonly  occur  in 
clauses  that  involve  stative  verbs  like  the  im perfective  form  of  the  verb  hoo  'to 
become'  and  the  finite  form  of  the  verb  honaa  'to  be'  (Kachru  1980).  It  is  not  a 
coincidence  that  koo  is  rarely,  if  at  all,  used  in  clauses  involving  stative  verbs 
where  NP  signals  generic  reference. 

At  times  'animacy'  does  appear  to  condition  the  occurrence  of  koo  witii 
direct  objects. 

(9)  bacce  koo  uthaaloo 
child  D/A  pick  up 
'Pick  up  the  child.' 

(10)  kitaab  (-koo)  uthaaloo 
book  D/A  pick  up 
'Pick  up  the  book.' 

In  (9),  koo  appears  to  be  obligatory  with  an  animate  NP  whereas  in  (10)  it 
is  awkward  with  an  inanimate  object.  However,  when  the  sentence  in  (9)  is  read 
without  koo,  the  reading  does  provide  a  generic  sense,  albeit  very  awkwardly. 

(11)  'bacca  uthaaloo 
'Pick  up  a  child' 

It  is  interesting  to  note  that  the  noun  hacca  'child/baby',  which  may  also 
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refer  to  animaJ  babies,  appears  in  (9)  above  to  refer  to  a  human  child  only. 
When  an  NP  refers  to  an  animal  baby,  koo  becomes  optional  although  the  entity 
referred  to  is  still  an  animate  one. 

(12)  billi  (kaa  bacca  uthaloo 

(kee  baccee  ktx) 
cat      of   baby     D/A    pick  up 
'Pick  up  the  kitten.' 

It  seems  that  reference  is  a  matter  of  degree  and  animate  entities  appear  to 
involve  inherently  more  specific  reference  than  the  inanimate  objects  on  a  con- 
tinuum of  specificity.  Among  the  animate  entities,  the  human  ones  are  probably 
more  specific  than  the  non-human  ones.  Within  the  set  of  NPs  that  refer  to 
human  entities,  it  is  probably  the  case  that  a  certain  subset  of  these  NPs  such  as 
personal  names,  pronominals,  kinship  terms,  etc.,  necessarily  involve  specific 
reference  which  may  account  for  their  being  marked  with  koo  in  the  direct  object 
position.  Abstract  and  mass  nouns  appear  to  share  properties  of  NPs  involving 
non-specific  reference  and  hence  the  sentence  becomes  awkward  when  koo 
marks  an  abstract  or  mass  noun  in  the  direct  object  position. 

(13)  is     dawaa       nee    buxaar  (  koo)    utaar     diyaa 
this  medicine  ERG  fever   (DA)  brought  down 
"This  medicine  brought  the  fever  down.' 

(14)  us  nee    jaldii         jaldii      kaafii/         banaaii 

rkoo   banaayaa 
he  ERG  hurriedly  hurriedly  coffee         made 
'He  hurriedly  made  coffee.' 

The  data  discussed  so  far  suggest  that  the  occurrence  of  koo  with  the  direct 
object  is  closely  related  to  referential  properties  of  the  NP  and  has  little  to  do 
with  accusative  case. 

As  mentioned  earlier,  koo  also  highlights  the  patient/experiencer  role  of 
the  direct  object.  It  is  relatively  more  conspicuous  in  contexts  where  the  flexible 
nature  of  H-U  word  order  makes  sentences  potentially  ambiguous  in  that  either 
of  the  two  NPs  could  be  interpreted  as  agent  or  patient. 

(15)  hiiraa         hiiraa       kattaa   hai 
diamond  diamond    cut       is 
'Diamond  cuts  diamond.' 

Given  the  normal  intonation  pattern,  the  first  NP  in  (15)  is  interpreted  as 
agent  and  the  second  as  patient.  However,  for  a  large  number  of  native  speak- 
ers, the  first  NP  may  equally  well  be  interpreted  as  patient  if  a  pause  is  intro- 
duced between  the  two  NPs.  This  ambiguitj'  may  be  dispelled  by  marking  the 
patient  NP  with  koo  irrespective  of  its  place  of  occurrence  in  the  sentence. 


(16)        (a)        hiiraa   hiiree    koo  kattaa,  hai 
(b)         hiiree    koo  hiiraa   kattaa   hai 
'Diamond   cuts   diamond.' 


To  sum  up  the  above  discussion,  the  presence  or  absence  of  koo  in  the 
direct  object  position  is  partially  conditioned  by  the  referential  properties  of  the 
NP  and  partially  by  the  5-role  assigned  to  the  NP  occurring  in  that  position. 
Tliere  is  little  justification  for  treating  koo  as  a  postposition  reflecting  the  dative- 
accusative  case. 

Chomsky  (1986:193)  observes  that  of  all  the  case- assigning  lexical 
categories,  only  V,  together  with  INFL  containing  AGR,  assigns  structural  case 
at  S-structure.  Other  case- assigning  categories,  P,  N,  and  A,  assign  inherent 
case  associated  with  ^-marking  at  D-structure.  In  H-U,  however,  V  appears  to 
assign  inherent  case  (in  conjunction  with  INFL)  to  NPs  in  subject  position.  The 
examples  below  demonstrate  that  the  assigned  ^-role  of  subject  NPs,  marked  by 
postpositions  such  as  nee  and  koo  is  conditioned  by  the  selection  of  the  verb. 
The  selection  of  a  specific  postposition,  therefore,  depends  on  the  assignment  of 
the  specific  ^-role  to  the  NP. 


(17)        (a)        us     nee       coot       khaaii 
he     ERG     injury      ate 
'He  got  hurt.' 


(b) 


(18)         (a) 


(b) 


us 
he 
'He 

koo      coot 

D/A     injury 
got  hurt.' 

lagii 
felt 

us 
he 
'He 

nee        inaam 
ERG      reward 
got  a  reward.' 

paayaa 
found 

us 
he 
'He 

koo          inaam 
D/A        reward 
got  a  reward.' 

milaa 
met 

(19)        (a)        us     nee       ghussa  kiyaa 

he     ERG        anger  did 
'He  got  angry.' 

(b)        us     koo       ghussa  aayaa 

he     D/A        anger  came 
'He  got  angry.' 


The  above  pairs  of  sentences  are  identical  in  terms  of  prepositional  con- 
;nt,  yet  they  differ  in  surface  manifestation.  The  verbs  in  the  (a)  sentences 
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above  assign  the  agentive  role  to  subject  NP  that  get  marked  by  the  postposition 
nee  in  the  perfective  aspect.  The  verbs  in  the  (b)  sentences,  however,  assign  the 
dative/recipient/experiencer  role  to  the  subject  NPs  though  the  meaning  of  the 
sentence  in  each  case  is  identical  to  the  cne  expressed  in  the  corresponding  (a) 
sentence.  It  is  interesting  to  note  that  in  the  above  pairs  of  sentences,  the  subject 
NPs  in  both  sentences  denote  entities  which  are  eventual  recipients  or  experienc- 
ers  in  absolute  semantic  terms  yet  the  verbs  in  the  (a)  sentences  subcategorize 
for  NPs  that,  in  some  intuitive  sense,  are  volitional  participants  of  the  action 
while  the  NPs  in  the  (b)  sentences  indicate  an  involuntary  participation  on  the 
part  of  entities  they  denote. 

2.  THE  ASSIGNMENT  OF  NEE: 

Because  the  so-called  ergative  NP  is  marked  with  the  postposition  nee,  it 
lacks  the  property  of  verb  agreement  usually  associated  with  subject  NPs  in  the 
nominative  case.  But  the  ergative  NP  does  show  other  subject  properties  such  as 
clause  initial  position,  control  of  null  subjects  and  of  reflexive  NPs,  etc.  Given 
that  subjects  in  H-U  derive  the  syntactic  property'  of  subjecthood  by  occupying  a 
con  figuration  ally  defined  subject  position  in  the  phrase  structure  tree,  the  prob- 
lem remains  of  how  nee  is  assigned  to  subject  NPs  in  perfective  structures. 

Some  treatments  of  non-nominative  subjects  (Hermon  1981;  Davison 
1985)  suggest  that  these  NPs  originate  in  object  position  at  D-structure  and, 
after  receiving  case  at  S-structure,  move  to  a  con  figuration  ally  defined  subject 
position  in  the  derivation  of  LF.  Thus,  the  non-nominative  subjects  acquire  sub- 
jecthood at  LF  and  not  at  S-structure. 

Keeping  such  an  analysis  in  mind,  we  may  consider  the  following  represen- 
tation of  the  so  called  ergative  sentences  at  D-structure. 

(20)  S[  NP-e  INFL  VP[  NP-nee  V'[  NP  V]]] 

The  structural  representation  in  (20)  above  also  holds  at  S-structure.  Both 
^-role  and  case  is  assigned  at  D-structure  and  checked  at  S-structure  by  the 
appropriate  filters.  However,  the  ergative  NP  in  (20)  originates  inside  the  VP 
constituent  and  as  such  is  assigned  case  by  V.  If  the  so-called  ergative  NP 
behaves  like  other  subjects,  which  it  does,  it  has  to  move  out  to  a  structurally 
defined  subject  position.  Since  move-a  is  an  optional  rule  and,  at  the  same  time 
no  other  principle  appears  to  be  violated  by  the  ergative  NP  for  occupying  that 
sf>ecific  position,  there  seems  to  be  no  reason  why  the  NP  marked  with  nee 
should  be  moved  out  in  the  derivation  of  S-structure.  Such  a  movement,  there- 
fore, will  have  to  take  place  in  the  derivation  of  LF  and  the  ergative  NP,  parallel 
to  non-nominative  subject,  may  be  interpreted  as  subject  in  the  LF  component. 
We  may  illustrate  a  simplified  structural  representation  of  (20)  above,  at  the  LF 
level,  in  (21)  below. 


(21) 


INFL 


-  ^-role 


us 

/ 

nee       NP 

\ 

V 

he 

khana 

khayaa 

food 

ate 

The  structure  represented  in  (20)  suggests  the  possibility  of  deep  (or  syn- 
tactic) ergativity.  Since  there  is  no  independent  evidence  that  H-U  involve  deep 
ergativity  (see  e.g.  Kachru  and  Pandharipande  1976),  the  structural  representa- 
tion given  in  (20)  is  of  little  significance.  Moreover,  such  a  representation 
requires  the  subject  NP  to  appear  in  a  non-^-position  in  an  argument  chain.  This 
unnecessarily  complicates  the  grammar  since  no  such  subcategorization  is  needed 
for  subject  NPs  marked  with  nominative  case. 

An  alternative  analysis  may  appear  to  be  more  plausible  where  V  assigns 
nee  to  subject  NPs  under  theta-government  via  INFL.  For  the  purposes  of  such 
an  analysis,  I  shall  adopt  the  model  of  Case  theory  as  developed  in  Levin  & 
Massam  1985.  Given  below  is  the  summary  of  some  of  the  principles  and  condi- 
tions relevant  to  our  analysis. 

(22)  Z°  governor,  then  Z°  has  associated  with  it  C2, 
where  C  =  Abstract  Case.    (Levin  &  Massam  (3)) 

C^  is  to  be  viewed  as  a  feature  the  assignment  of  which  by  an  element  is 
subject  to  the  conditions  on  case  assignment. 

(23)  Conditions  on  Case  Assignment  (Levin  &  Massam  (4)) 

A.  Cjj  must  be  assigned. 

B.  C    (y  ^  x)  can  be  assigned  only  under  theta-government. 

^  (External  thetarrole  assigned  by  VP  via  INFL) 

C.  Case  is  assigned  only  under  government. 

Condition  A  is  relevant  for  nominative-accusative  systems  where  INFL 
very  often  assigns  case  to  NP/S.  However,  for  H-U,  where  inherent  case  is 
assigned  by  certain  verbs,  condition  B  is  also  relevant.  C  may  be  realised  in  a 
subject  NP,   marked   by  koo,   when  VP  assigns  the  dative/experiencer  role   as 
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external  theta-role  to  the  NP.  Elsewhere,  when  the  sentence  is  in  the  perfective 
aspect,  the  value  of  y  may  be  realised  as  nee  when  the  subject  NP  is  assigned  the 
external  S-role  of  agent.  Since  perfective  aspect  is  equally  crucial  to  the  assign- 
ment of  nee,  aspect  has  to  be  accommodated  within  the  structure  of  INFL. 

(24)  INFL   >   TENSE,   AGR,    ASP 

The  assignment  of  either  nominative  or  ergative  case  wjjl,  then,  depend  on  the 
value  of  ASP  which  may  be  stipulated  in  terms  of  a  [_,PERFECT]  feature.  If 
INFL  contains  [+ PERFECT],  it  will  assign  nee  to  the  subject  NP,  otherwise  the 
nominative  case  will  be  assigned. 

We  can  now  propose  an  alternative  structure  to  (20)  above  where  nee  ori- 
ginated inside  VP. 

(25)  S[NP-nee  I'[INFL   VP[NP  V])] 

The  structure  in  (25)  above  holds  both  at  D  and  S-structure.  NP-nee  originates 
in  subject  position  and  is  assigned  case  under  ^-government  by  VP  via  INFL. 
The  accusative  case  is  assigned  by  the  verb  and  its  realisation  is  marked  either  by 
koo  or  by  4>  depending  on  the  relevant  semantic  factors.  The  structure  in  (25) 
may  be  illustrated  as  below. 


(26) 


khayaa 
ate 


'The  boy  ate  an  apple. 


Another  important  assumption  by  Levin  &  Massam  is  the  introduction  of  a 
feature  [_CA]  as  a  property  of  all  lexical  governors.  The  feature  [+CA],  where 
CA  stands  for  Case  Assignment,  makes  it  obligatory  for  an  element  to  assign  a 
case  if  it  has  the  feature.  Such  an  assumption  is  crucial  for  the  assignment  of  nee 
in  intransitive  clauses.  Intransitive  verbs  generally  do  not  assign  case.  Yet,  there 
are  dialects  of  H-U  wherein  some  intransitive  verbs  would  appear  to  assign  nee 
to  their  single  argument. 
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(27) 


larkee  nee  cKnkaa 
boy  ERG  sneezed 
'The  boy  sneezed.' 


This  can  be  accounted  for  by  the  feature  [_1CA).  All  the  verbs  in  intransitive 
clauses  with  nee  have  the  feature  [+CA]  which  in  conjunction  with  [+ PER- 
FECT] in  INFL  causes  nee  to  be  assigned  to  the  subject.  The  structure  of  (27) 
above  may  appear  as  in  (28)  below. 


(28) 


At  this  point  the  only  problematic  case  that  needs  to  be  accounted  for  is 
the  existence  of  a  small  number  of  transitive  verbs  that  do  not  take  nee  despite 
the  perfective  aspect.  No  ready  solution  is  available  except  that  these  verbs 
should  be  treated  as  exceptions.  It  is  probably  the  case  that  for  all  such  verbs  the 
value  of  the  Abstract  Case  as  determined  by  (22)  is  [+ nominative]  which  is 
assigned  to  subject  NP,  despite  the  perfective  aspect,  by  virtue  of  the  presence  of 
the  feature  [+CA]. 

3.  CONCLUSION 


In  H-U,  the  assignment  of  nee  to  subject  NPs  in  perfective  aspect  has  been 
considered  problematic.  Within  the  GB  framework,  the  occurrence  of  nee  in  sub- 
ject position  can  be  accounted  for  if  NPs  marked  with  nee  are  treated  as 
instances  of  inherent  case  as  defined  by  some  general  principles  of  Theta  and 
Case  theories.  As  demonstrated  in  the  first  section  of  the  paper,  nee  appears  to 
mark  the  agentive  role  of  the  subject  NPs.  As  such,  it  can  justifiably  be  treated 
as  a  marker  of  inherent  case  which  is  assigned  under  ^-government  by  V  via 
INFL.  The  assumption  that  all  case  assigning  lexical  categories  have  a  feature 
[iCA]  also  accounts  for  the  few  intransitive  verbs  that  assign  nee  to  their  sub- 
jects. It  is  suggested  that  the  verbs  in  transitive  clauses  that  do  not  show  nee  are 
lexically  marked  for  the  assignment  of  nominative  case. 
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NOTES 


*  I  am  indebted  to  Alice  Davison,  Yamuna  Kachru,  and  James  Yoon  for 
their  very  helpful  comments  and  discussions.  They  are,  however,  not  responsible 
for  any  errors  and  shortcomings  that  may  still  remain.  I  am  also  greatful  to 
Adeel  Shaukat  for  his  syntactic  and  semantic  judgements. 

See  Kachru  and  Pandharipande  (above  cite)  for  examples  and  related  dis- 


2 

Following  common  practice,  I  shall  gloss  nee  as  ERG  (ergative)  and  kc 
as  D/A  (dative-accusative). 
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REMARKS  ON  DRAVIDIAN  COMPLEMENTATION* 
Sanford  B.  Steever 


The  morphological  and  lexical  bases  of  Dravldian  comple- 
mentation are  explored  by  proposing  a  series  of  rules  that 
govern  the  distribution  of  finite  predicates  in  the  Dravidian 
sentence.   The  distribution  of  two  verbs,  a-    'become'  and  en- 
'say'  is  shown  to  be  correlated  with  the  distribution  of  finite 
predicates.   They  owe  this  to  the  fact  that  they  are  negative 
absolute  exceptions  to  case-marking.   Certain  syntactic  changes 
in  North  Dravidian  are  shown  to  be  accomplished  by  increasing 
the  number  of  predicates  that  are  exceptions  to  case-marking, 
rather  than  by  blindly  imitating  Indo-Aryan  structures.   The 
hypothesis  of  brute-force  convergence  is  rejected  as  an  explan- 
ation for  these  syntactic  changes  in  favor  of  one  that  utilizes 
native  lexical,  morphological  and  syntactic  resources  in 
diachronic  syntax. 


0.   Introduction.  The  commonplace  observation  that  different  languages  may 
express  the  same  grammatical  concept  in  radically  different  ways  is  the 
foundation  of  many  language  typologies,  the  most  celebrated  of  which  is 
Sapir's.   According  to  a  somewhat  less  subtle  typology,  languages  may  be 
said  to  differ  according  as  they  encode  certain  grammatical  concepts  in 
morphology  or  syntax.   Close  to  the  center  of  Sapir's  typology,  the 
Dravidian  languages  rely  extensively  on  their  morphology  and  lexicon  to 
carry  out  those  grammatical  tasks  which  more  peripheral  languages,  such  as 
Chinese  or  English  would  entrust  to  their  syntax.   This  difference  is 
illustrated  here  with  the  use  of  morphology  and  lexicon  in  the  formation 
of  complex  sentences  in  the  Dravidian  languages.   Attention  is  focused  on 
the  crucial  roles  played  by  two  verbs  which  are  exceptions  to  the  rules 
of  case-assignment  in  the  construction  of  complex  sentences. 

Certain  conventions,  which  are  followed  in  this  paper,  are  noted  here. 
Dravidian  etyma  are  numbered  according  to  the  system  in  A  Dravidian 
etymological  dictionary,  2nd  edition,  and  are  identified  by  the  abbrevi- 
ation DEDR.   The  following  notations  are  used  in  the  interlinear  glosses 
of  the  examples:  abl  -  ablative;  ace  -  accusative;  adnp  -  adnominal 
participle;  cond  -  conditional;  cp  -  conjunctive  participle;  dat  - 
dative;  f  -  feminine;  fut  -  future;  gen  -  genitive;  h  -  honorific;  imp  - 
imperative;  inf  -  infinitive;  int  -  interrogative;  loc  -  locative;  m  - 
masculine;  n  -  neuter;  neg  -  negative;  nom  -  nominative;  npst  -  nonpast; 
obi  -  oblique;  perf  -perfect;  perm  -  permissive;  pi  -plural;  pres  - 
present;  pst  -  past;  s  -  singular;  soc  -  sociative;  vn  -  verbal  noun; 
1  -  first  person;  2  -  second  person;  3  -  third  person.  Morpheme  boundaries 
have  been  approximately  inserted  into  the  examples,  so  no  systematic  sig- 
nificance should  be  attached  to  their  placement  or  their  absence.   The 
transcription  of  the  Tamil  examples  follows  that  used  in  Steever  (1986). 
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The  Dravidian  languages  distinguish  just  two  parts  of  speech  according 
to  their  morphology  (note  1):  noun  and  verb,  both  of  which  may  occur  in 
simple  or  compound  forms.   Noun  and  verb  are  mutually  implicating  terms 
(note  2) ,  and  are  so  strongly  polarized  in  opposition  to  each  other  that 
their  bonding  has  effectively  inhibited  the  growth  of  new  morphological  and 
syntactic  categories  (note  3).   Nouns  are  typically  inflected  for  case,  a 
category  that  characterizes  the  relation  of  a  noun  to  its  predicate  within 
the  confines  of  a  single  clause  (note  4) .   Verbs  are  inflected  for  verbal 
categories  such  as  mood  and  tense.   Unlike  case,  a  verbal  category  can 
establish  a  grammatical  relation  outside  the  bounds  of  a  single  clause. 
Verbs  prove  to  be  the  fundamental  variable  in  the  Dravidian  sentence  in 
virtue  of  the  fact  that  they  combine  two  roles.   Not  only  do  they  encode 
valuble  grammatical  information  in  verbal  categories,  but  they  also  govern 
the  case-assignment  of  the  nouns  they  combine  with.   Particular  attention 
is  therefore  drawn  to  the  central  role  of  verbs  in  the  formation  of  the 
Dravidian  sentence. 

This  perfectly  balanced  and  tightly  knit  system  of  verb  and  noun  has  a 
direct  bearing  on  the  construction  of  the  complex  sentence.   The  Dravidian 
languages  lack  those  parts  of  speech  that  acquaintance  with  a  standard 
average  European  language  or  an  introductory  textbook  of  generative  lin- 
guistics would  lead  us  to  believe  are  indispensible  elements  of  the  complex 
sentence:  conjunctions  (and),  complementizers  (that)  and  adverbs  (during) . 
The  grammatical  functions  traditionally  associated  with  these  parts  of 
speech  are  assumed  by  the  morphology  of  nouns  and  verbs  in  the  Dravidian 
languages.   And  yet  the  basic  opposition  of  noun  and  verb  may  be  too  strong, 
for  the  morphology  can  be  shown  to  impose  severe  restrictions  on  the 
structure  and  variety  of  the  Dravidian  sentence.   It  stands  to  reason,  then, 
that  exceptions  to  the  rules  of  morphology  may  well  provide  the  grammatical 
means  to  circumvent  these  restrictions.   Two  Dravidian  verbs,  *£-  'become' 
(DEDR  333)  and  *en-  'say,  think'  (DEDR  868),  are  shown  to  be  negative 
absolute  exceptions  to  the  principles  of  case-assignment.   They  may  be  used 
in  this  capacity  to  get  around  the  morphological  constraints  on  the  structure 
of  the  complex  sentence. 

Section  1  discusses  the  basic  structure  of  simple  and  complex  sentences, 
concentrating  especially  on  the  distribution  of  finite  predicates  within 
these  structures.   A  rule  is  proposed  to  describe  the  distribution  of  finite 
and  nonfinite  predicates  in  sentences  of  any  internal  complexity.   Section  2 
notes  that  this  rule  raises  a  number  of  syntactic  and  pragmatic  problems  in 
the  formation  of  complex  sentences.   These  problems  are  overcome  with  the 
use  of  two  exceptions  to  case-assignment,  viz.,  *a-    'become'  and  *en-  'say, 
think'.   Section  3  then  introduces  other  predicates  which  may  on  occasion 
enter  this  elite  class  of  exceptions.   A  preliminary  semantic  classifi- 
cation of  such  predicates  is  formulated  in  light  of  these  examples.   Section 
4  discusses  the  impact  of  this  phenomenon  on  the  syntactic  development  of 
the  Dravidian  languages,  particularly  the  northern  subgroup.   The  possi- 
bility of  reconstructing  this  phenomenon  for  Proto-Dravidian  is  discussed, 
and  the  hypothesis  of  areal  convergence  with  Indo-Aryan  debated. 
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1.   Basic  Dimensions  of  the  Dravidlan  Sentence.   First,  a  few  words  about 
Dravidian  morphology  are  in  order.   It  is  transparent,  agglutinating  and 
predominantly  suffixal.   The  order  of  morphemes  in  the  word  is:  lexical 
root,  derivational  suffix  and  inflectional  suffix.   Noun  and  verb  are 
morphologically  distinguished  according;  to  the  grammatical  categories  they 
inflectionally  encode.   The  common  Dravidian  case  system  include  eight  cases: 
nominative,  accusative,  dative,  genitive,  locative,  sociative,  instrumental, 
and  ablative  (note  5) .   Nouns  may  be  additionally  marked  for  number  and 
gender.   Some  languages,  such  as  Gondi  and  old  Tamil,  can  inflect  predicate 
nominals  for  person,  e.g.,  the  suffix  -em  in  the  old  Tamil  predicate  nominal 
pentir-em  'we  women'  marks  agreement  with  a  first  person  plural  subject. 
Verbs  mark  verbal  categories  such  as  tense  and  mood.   A  verb  is  in  formal 
terms  finite  or  not,  depending  as  it  bears  inflections  for  tense  and  subject- 
predicate  agreement,  or  not.   The  finite  verb  appears  at  the  end  of  the 
sentence,  bringing  it  to  a  close.   All  other  verbs  in  the  sentence  must 
appear  in  a  nonfinite  form:  by  definition,  they  combine  with  a  following 
word,  and  so  do  not  close  the  sentence.  Nonfinite  verbs  come  in  two  varieties. 
The  first  included  all  the  forms  that  combine  with  a  following  noun  to  create 
relative  clauses  and  other  kinds  of  nominal  complements.   In  traditional  Tamil 
grammar  they  are  known  as  peyareccam  'verbs  that  anticipate  a  noun',  and 
include  the  adnominal  (relative)  participles.   I  deviate  here  from  traditional 
usages  in  including  verbal  nouns  under  the  heading  of  peyareccam,  they  differ 
from  adnominal  participle  and  head  noun  only  in  that  the  verbal  noun  morpho- 
logically incorporates  the  head  along  with  the  verb.   The  second  includes 
all  those  verb  forms  that  combine  with  a  following  verb.   Known  as  viNaiyeccam 
in  traditional  terminology,  they  include  such  forms  as  the  conjunctive 
participle,  the  infinitive  and  the  conditional.   It  is  important  to  remember 
that  a  nonfinite  verb  and  the  word  it  combines  with  may  stand  adjacent  or 
be  separated  by  intervening  material,  which  criterion  serves  to  distinguish 
them  from  mere  compounds. 

The  simple  sentence  is  a  single  clause  consisting  of  a  subject  and  a 
predicate.   The  subject  is  a  noun-phrase  that  is  inflected  for  the  nominative 
or,  in  certain  circumstances,  the  dative  case.   (Some  sentences  may  in  fact 
have  locative  subjects,  e.g.,  Kannada  nanna  hattira  hana  ide  'nearj  mej 
money3  is^',  i.e.,  'I  have  money';  however,  more  research  is  needed  to 
establish  whether  or  not  such  'locative  subjects'  do  in  fact  have  subject- 
coding  properties,  such  as  the  ability  to  antecede  a  reflexive  pronoun.) 
The  predicate  is  either  a  finite  verb  or  a  predicate  nominal.   From  the 
various  combinations  of  subjects  and  predicates  emerge  four  basic  sentence 
types:  (1)  nominative  subject  and  predicate  nominal,  (2)  nominative  subject 
and  finite  verb,  (3)  dative  subject  and  predicate  nominal,  and  (4)  dative 
subject  and  finite  verb.   The  four  basic  types  are  illustrated  here  with 
examples  from  Kannada  and  Konda. 

(1)    KANNADA 

a  avaru     adhyipakaru.  b  avaru     ba-nd-aru. 
he-h-nom  teacher-nom  he-h-nom  come-pst-3h 

'He  (is  a)  teacher.'  'He  came.' 
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(1)  KANNADA 

c  avar-ige  obba   maga. 

hs-h-dat  one-m  son-nom 

'He  has  a  son. ' 
d  avar-ige  tamilu     bar-at-te. 

he-h-dat  Tamil-nom  come-pres-3sn 

' He  knows  Tamil . ' 

(2)  KONDA 

a  map     sena  duramti       loku. 

we-nom  very  distance-abl  people-nom  'We  are  people  from  far  away, 
b  banz^raza        sur-t-an. 

barren  king-nom  see-pst-3sm 

'The  barren  king  saw.' 
c  aya   guruy-e»j  budi. 

that  guru-dat  wisdom-nom 

'That  guru  has  wisdom. ' 
d  lok-a«   kopam      va-n-ad. 

man-dat  anger-nom  come-npst-3sn 

'That  man  gets  angry.' 

Inspection  of  the  grammars  of  sister  languages  reveals  just  how  widespread 
and  basic  these  four  sentence  patterns  are. 

The  COMPLEX  sentence  consists  of  one  main  clause  at  the  extreme  right 
boundary  of  the  sentence,  and  one  or  more  dependent  clauses  preceding  it. 
Only  the  predicate  in  the  main  clause  is  finite:  it  occurs  rightmost  in  the 
constituent  structure,  commands  all  other  predicates  and  is  itself  commanded 
by  none.   The  predicate  in  each  dependent  clause  is  a  nonfinite  verb,  either 
a  peyareccam  or  viNaiyeccam,  according  to  the  demands  of  the  particular 
structure.   Examples  from  Tamil  and  Konda  are  used  below  to  illustrate  a 
variety  of  complex  sentences,  and  the  important  role  that  nonfinite  verbs 
play  in  their  formation. 

(3)  TAMIL 

a  mazai     pey-tu   veyil    ati-ttu  vaNavil      toNR-iy-atu. 

rain-nom  rain-cp  sun-nom  beat-cp  rainbow-nom  appear-pst-3sn 

'It  rained,  the  sun  beat  down  and  a  rainbow  appeared.' 
b  teyvattiN  arul   kitai-kkak  kavinar   unarcciyutaN  pat-iN-ar. 

god-gen    grace  get-inf     poet-nom  feeling-soc   sing-pst-3h 

'The  poet  sang  with  emotion  to  obtain  God's  grace,'  or 

'The  poet,  having  obtained  God's  grace,  sang  with  emotion.' 
c  makaN    poy  coNN-il    appa        ati-pp-ir. 

son-nom  lie  tell-cond  father-nom  beat-fut-3h 

'If  the  son  lies,  his  father  will  beat  him.'_ 
d  naN    mantiri       va-nt-a        ceyti  ket-t-eN. 

I-nom  minister-nom  come-pst-adnp  news   hear-pst-ls 

'I  heard  the  news  that  the  minister  had  come.' 
e  mantiri      va-nt-at-ai      avaN    par-tt-iN. 

minister-nom  come-pst-vn-acc  he-nom  see-pst-3sm 

'He  saw  that  the  minister  had  come. ' 
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(4)    KONDA  _     _ 

a  or   nend  vaz-i    darman  ki'a. 

one  day*   come-cp  help    do-imp 

' Come  one  day  and  help  us . ' 
b  anasi       un-de    bas-t-an. 

brother-nom  eat-inf   sit-pst-3sm 

'The  elder  brother  sat  down  to  eat.' 
c  embe   sur-t-i»ja  na£u      sil-ed. 

where  look-cond  town-nom  not  be-pst-3n 

'Wherever  they  looked,  there  was  no  town.' 
d  ru-n-i  guid-ed    so-R-ad. 

plough-npst-adnp  field-loc  go-pst-3sn 

'She  went  to  the  field  that  they  plough.' 
e  aya   bodel      sur-t-ika   id?ed. 

that  woman-nom  see-pst-vn  leave-neg-pst-3sn 

'That  woman  did  not  leave  alone  what  she  saw.' 

Use  of  the  conjunctive  participle  is  illustrated  in  (3a)  and  (4a) ;  the 
infinitive  in  (3b)  and  (4b) ;  the  conditional  in  (3c)  and  (4c) ;  the  adnom- 
inal  participle  in  (3d)  and  (4d) ;  and  in  (3e)  and  (4e)  the  verbal  noun, 
which  fuses  an  adnominal  participle  with  a  pronominal  head. 

The  distribution  of  finite  and  nonfinite  predicates  in  the  Dravidian 
sentence  leads  us  to  state  Rule  A  below,  a  first  approximation  to  the 
rule  that  governs  the  occurrence  of  finite  and  nonfinite  predicates. 

RULE  A  -  THE  DISTRIBUTION  OF  FINITE  PREDICATES 

i.    Each  Dravidian  sentence  has  exactly  one  finite  predicate. 

ii.   The  lone  finite  predicate  occurs  rightmost  in  the  surface 
constituent  structure,  commands  all  other  predicates  and 
is  itself  commanded  by  none. 

iii.   All  other  predicates  are  nonfinite. 

Inspection  of  example  (3a)  bears  out  the  predictions  of  Rule  A:  the  single 
finite  verb,  toNRiyatu  'it  appeared',  occurs  in  the  rightmost  clause  in  the 
surface  constituent  structure.   The  two  remaining  predicates,  peytu  'raining' 
and  atittu  'beating',  are  both  conjunctive  participles,  which  are  nonfinite 
verb  forms.   Note  that  Rule  A  does  not  distinguish  between  finite  verbs  and 
predicate  nominals:  both  are  treated  equally  as  finite  predicates.   Their 
distribution  is  identical:  predicate  nominals  occur  only  where  finite  verbs 
occur,  and  vice  versa.   In  essence.  Rule  A  defines  finiteness  as  a  syntactic 
property;  for  only  those  predicates  are  considered  finite  which  occur  in 
the  specially  designated  position  identified  in  the  second  clause  of  Rule  A. 
Finiteness  is  subsequently  interpreted  in  morphological  terms,  according  to 
the  lexical  class  of  the  predicate.   If  the  finite  predicate  is  a  verb,  it 
must  be  marked  for  tense  and  subject-predicate  agreement  (note  6).   If,  on 
the  other  hand,  it  is  a  noun,  it  must  be  marked  for  the  nominative  case 
and,  in  some  languages,  for  subject-predicate  agreement. 

Nouns  can  never  have  nonfinite  forms  according  the  rule  above.   Hence, 
predicate  nominals  can  never  appear  in  dependent  clauses.   The  third  clause 
of  Rule  A  properly  applies  only  to  nonfinite  verb  forms.  This  asymmetry 
between  nouns  and  verbs  gives  rise  to  some  problems,  as  we  shall  see  below. 
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2.   Multiple  Finte  Predicates  In  the  Dravidian  Sentence.   Despite  the  ingenuity 
with  which  nonfinite  verbs  are  used  to  construct  complex  sentences,  a  number  of 
syntactic  and  pragmatic  dilemmas  quickly  surface.   Rule  A  assigns  just  one 
finite  predicate  to  each  sentence,  no  matter  how  internally  complex  it  might 
be.   This  creates  numerical  and  structural  asymmetries  in  the  distribution  of 
finite  and  nonfinite  predicates.   Each  sentence  has  just  one  finite  predicate, 
but  may  have  indefinitely  many  nonfinite  predicates.   The  former  appears  only 
in  main  clauses,  the  latter  only  in  dependent  clauses.   Another  disparity 
emerges  in  the  distribution  of  predicates.   Predicate  nominals  have  only  finite 
forms;  so,  according  to  Rule  A,  they  may  never  appear  in  a  dependent  clause. 
It  follows  from  this  restriction  that  only  one-half  of  the  basic  clause-types, 
noted  above,  may  appear  in  a  dependent  clause:  those  with  a  predicate  nominal 
are  entirely  barred. 

The  syntactic  restrictions  imposed  by  Rule  A  lead  to  a  particularly  thorny 
pragmatic  constraint.   The  grammar  of  complex  sentences,  as  expressed  in  this 
rule,  provides  no  means  of  forming  direct  discourse.   Speaker  A  cannot  report 
what  speaker  B  said,  without  first  transforming  the  finite  predicate  in  speaker 
B's  speech  into  a  nonfinite  form,  embedded  under  a  verb  of  reporting  in 
speaker  A's  speech. 

(5)   TAMIL 

a  ni       va. 

you-nom  come- imp 
'You  come! ' 
b  avaN    eNNai  varac      coMaN. 

he-nom  I-acc  come-inf   tell-pst-3sm 
'He  told  me  to  come. ' 

Thus,  the  finite  imperative  verb- form  va  'come!'  in  (5a)  must  be  transformed 
into  the  nonfinite  infinitive  vara  'to  come',  because  the  verb  of  reporting, 
coNNaN  'he  told',  is  the  only  verb  that  Rule  A  will  designate  as  a  finite 
predicate.   By  the  same  token,  it  would  be  virtually  impossible  to  embed  a 
predicate  nominal  under  a  predicate  of  reporting.   This  consequence  of  Rule  A 
goes  against  the  general  belief  that  direct  discourse  is  both  less  marked  and 
more  common  than  indirect  discourse. 

To  solve  these  and  related  problems,  two  verbs  are  introduced  here  whose 
exceptional  behavior  allows  them  to  circumvent  in  a  clever,  yet  principled 
way  the  constraints  that  Rule  A  imposes  on  the  formation  of  complex  sentences 
in  constraining  the  distribution  of  finite  predicates.   They  are  *a^-  'become' 
and  *en-  'say,  think'.   They  share  a  common  property,  one  that  is  virtually 
unique  among  the  verbs  of  the  Dravidian  languages.   They  may  take  as  their 
direct  objects  expressions  of  any  syntactic  category  and  internal  complexity. 
Furthermore,  they  do  not  impose  on  these  objects  any  morphological  indication 
of  their  objecthood.   They  may  take  as  their  objects  interjections,  single 
words,  phrases,  sentences,  or  whole  chunks  of  discourse;  they  may  combine 
with  nouns  and  verbs  with  equal  facility.   They  do  not  require  their  direct 
objects  to  undergo  any  morphological  adjustment,  such  as  assuming  a  nonfinite 
verb-form  or  bearing  a  case-marker.   And,  in  fact,  they  cannot  do  so.   They 
are,  in  Lakoff's  terms,  negative  absolute  exceptions  to  case-assignment. 
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In  as  much  as  the  theory  of  case-assignment,  and  not  the  theory  of 
finiteness,  is  responsible  for  the  morphological  interpretation  of  direct 
objects,  the  verbs  *^-  'become'  and  *en-  'say,  think'  effectively  shield 
their  direct  objects  from  the  jurisdiction  of  Rule  A.   On  the  other  hand, 
since  these  two  verbs  are  negative  absolute  exceptions  to  case-assignment, 
they  need  not  subcategorize  objects  that  are  subject  to  case-assignment, 
viz.,  nouns.   This  frees  them  to  combine  with  expressions  of  any  syntactic 
category  and  any  morphological  shape.  What  is  most  relevant  for  our 
purposes,  however,  is  that  they  may  combine  with  a  finite  predicate. 

The  secret  of  their  success  in  overcoming  the  constraints  imposed  by 
Rule  A  derives  not  just  from  their  morphological  exceptionality,  but  also 
from  the  fact  that  they  are  morphologically  verbs.   As  such,  they  enjoy  all 
the  formal  and  morphological  privileges  of  that  class.   When  they  are 
inflected  as  verbs,  they  may  be  embedded  along  with  their  direct  objects  in 
a  larger  syntactic  pattern,  according  to  the  variety  of  complex  sentences 
discussed  above  in  Section  1 .   Although  the  inflected  forms  of  these  two 
verbs  often  appear  to  behave  like  complementizers  and  conjunctions,  they 
formally  belong  to  the  class  of  verbs.   And  it  is  only  on  this  assumption 
that  we  can  appeal  both  to  their  exceptionality  regarding  case-assignment 
and  to  their  ability  to  combine  with  nonfinite  forms  so  as  to  explain  how 
they  can  embed  a  finite  predicate  in  a  dependent  clause,  and  thus  over- 
come the  constraint  against  multiple  finite  predicates  in  the  sentence. 

The  following  examples  demonstrate  the  various  uses  to  which  reflexes 
of  *en-  'say,  think'  are  put  in  the  formation  of  complex  sentences  with 
multiple  finite  predicates. 

(6)  TAMIL 

a  naN    avaN    nallavaN      eN-Ru   niNai-kkiR-eN. 

I-nom  he-nom  good  man-nom  say-cp  think-pres-ls 

'I  think  that  he  is  a  good_man. ' 
b  atai    utai-tt-ay    eN-Ral    uNNaic   cummaka  vitamatteN. 

it-acc  break-pst-2s  say-cond  you-acc  alone    leave-fut-neg-ls 

'If  you  broke  it,  I  won't  leave  you  alone.' 
c  avaN    cettuppoNaN  eN-kiR-a       ceyti  ket-t-eN. 

he-nom  die-pst-3sm  say-pres-adnp  news   hear-pst-ls 

'I  heard  the  news  that  he  had  died.' 
d  avaN    iftke  va-nt-an      eN-patu     unmai      tiN. 

he-nom  here  come-pst-3sm  say-vn-nom  truth-nom  indeed 

'It  is  indeed  true  that  he  came  here.' 

(7)  KANNADA 

a  nannannu  nanna   padige     bittu-bidi  endalu     janaki. 

I-acc     I-gen   place-dat  leave-imp   say-pst-3sf  Janaki-nom 

'Janaki  said,  "Leave  me  alone."' 
b  emtiarinalli  dose   tinona  an-ta   mane   bittu    hora-t-a. 

MTR-loc       dosai  eat-Fiort  say-cp  house  leave-cp  start-pst-3sm 

'He  went  out  in  order  to  eat  dosai  at  the  MTR. ' 
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(8)  KONDA 

a   ma  buba        rey-n-a       i-R-an. 

my  father-nom  6eat-npst-ls  say-pst-3sm 

'My  father  said,  "I  will  beat  you."' 
b   u-n-a        i-Ri^a    anam      sil-ed. 

eat-npst-ls  say-cond  food-nom  not  be-pst-3sn 

'There  was  no  food  to  eat,"  lit.,  "There  was  no  food,  if  you  say 

"I  want  to  eat."' 

(9)  KUVI 

a   evasi   nehasi        inj-ihi  nanu   nam-i-Yi. 

he-nom  good  man-nom  say-cp   I-nom  believe-npst-ls 
'I  believe  that  he  is  a  good  man.' 
b   e     samomita  ro   mosali        anataki  tir-vi     man-j-i 
that  time-loc  one  crocodile-nom  wht      wander-cp  be-pst-2s 
inj-ihi  burhaTa      vec-c-e. 
say-cp   old  man-acc  ask.-pst-3sn 
'Then  the  crocodile  asked  the  old  man,  "Why  were  you  wandering?" 

(10)  KOLAMI 

a  an     sa-t-un     en-a    ar-t-an. 

I-nom   go-impf-ls  say-cp  cry-pst-ls 

'I  cried  because_(lit. ,  saying)  I  was  going  away.' 
b  amd     anu    va     en-a    it-t-an. 

he-nom  I-dat  come-imp  say-cp  tell-pst-3sm 

'He  told  me  to  come.' 

(11)  OLLARI 

a  raytul      tre  tre       in-ji   sindup-eti  say-d-ar. 
farmer-nom  tre  tre-onom  say-cp  call-cp     be-fut-3sm 
'The  farmer  would  call  out,  "Tre  tre"  (to  the  sheep).' 
b  endrika    bofjga      ke-n-ed     in-ji   adin    pakkan 
cfabs-nom  holes-acc  do-pst-3pn  say-cp  it-gen  side 
bokka     ke-d-an. 
hole-acc  do-npst-3sm 
'Claiming  that  crabs  dug  the  holes  (in  his  neighbor's  tank  bund), 
he  will  dig  a  hole  in  its  side.' 

Reflexes  of  *en-  'say,  think'  embed  predicate  nominals  in  (6a)  and  (9a); 
ordinary  finite  verbs  in  (6b),  (6c),  (8a),  (8b),  (9b),  (10a),  and  (lib); 
speech-act  verb-forms  in  (7a),  (7b)  and  (10b);  and  onomatopoeia  in  (11a). 
The  verb  *en-  may  itself  appear  in  a  number  of  different  forms:  as  a 
finite  verb  in  (7a)  and  (8a);  as  a  conditional  in  (6b)  and  (8b);  as  a 
verbal  noun  in  (6d) ;  as  an  adnominal  participle  in  (6c) ;  and  as  a  con- 
junctive participle  in  the  other  examples.   The  verb  has  its  full  lexical 
meaning  in  (7a),  (8a)  and  (lib);  and,  in  these  examples,  may  be  said  to 
be  "base-generated."  In  the  other  examples,  however,  it  seems  not  to 
contribute  a  lexical  meaning  to  the  meaning  of  the  whole  sentence; 
accordingly,  it  may  be  considered  to  be  "transformationally  inserted." 
Despite  the  individual  differences  to  be  found  among  these  sentences, 
all  share  the  common  property  of  embedding  a  finite  predicate  under  the 
verb  *en-. 

Reflexes  of  the  common  Dravidian  verb  *ar-    'become'  are  also  used  to 
serve  the  same  kinds  of  syntactic  functions,  as  the  set  of  examples 
overleaf  demonstrates. 
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(12)  TAMIL 

camaN  vanka    ventum       aN-al       vanku    all-atu 
goods  buy-lnf  want-fut-3sn  become-cond  buy-imp  become-neg-vn 

kataiyai   vit-tu    iRanku. 

store-acc  leave-cp  get  down-imp 
'Unless  you  are  going  to  buy  the  goods  you  want  to  buy,  leave 
the  store. ' 

(13)  KANNADA 

yaru        avana   edurige  aduttiralilla       ad-ar-u 

no  one-nom  he-obl  against  speak-pst  perf-neg  become-cond-and 

hengsara  mulaka   i     mltu  kaveriya  kivi  muttuttittu. 

women-gen  through  this  word  K.-gen    ear   reach-pst  perf-3sn 
'Though  no  one  has  spoken  directly  to  him  about  this  matter, 
word  reached  Kaveri's  ear  through  women's  gossip.' 

(14)  PENGO 

hi-d-n-ay       ac-i       hi-d-a. 

give-tp-fut-2s  become-cp  give-tp-imp 

'If  you  are  going  to  give  it  to  me,  give  it.' 

(15)  Pf^Jl 

in       nato  andar         pat-t-o-me-d-a       ag-i 
you-obl  what  confusion-nom  befall-pres  prf-3sn  become-cp 

in       bulkak-mo-t. 

you-nom  err-pres-2s 
'What  is  confusing  you  that  you  should  make  such  a  mistake?' 

These  four  examples  suffice  to  show  us  that  reflexes  of  the  verb  *a- 
'become'  can  be  used  to  embed  finite  predicates.   In  (12)  the  conditional 
form  aNal  'if  become'  embeds  the  finite  predicate  ventum  'be  wanted', 
while  the  negative  verbal  noun  allatu  'that  which  does'not  become',  based 
on  al-  'not  become'  the  negative  stem  variant  of  *a-  'become',  embeds 
the  finite  imperative  verb  v5nku  'buy!'   Further  examples  can  easily  be 
found  in  the  relevant  grammars  and  texts. 

These  examples  necessitate  a  revision  of  Rule  A  to  accommodate  the 
sentences  above  with  multiple  finite  predicates.   The  number  and  position 
of  finite  predicates  in  the  Dravidian  sentence  vary  directly  with  the  number 
and  position  of  instances  of  these  two  verbs  in  their  capacity  as  comple- 
mentizers.  They  are  called  FINITE  PREDICATE  EMBEDDERS,  or  FPEs  for  short. 

RULE  B  -  THE  DISTRIBUTION  OF  FINITE  PREDICATES,  REVISION 

i.     The  Dravidian  sentence  has  exactly  n+1  finite  predicates, 

where  n  equals  the  number  of  FPES  and  1  represents  the  finite 
predicate  in  the  main  clause. 

ii.    The  finite  predicate  in  the  main  clause  occurs  highest  and 
rightmost  in  the  constituent  structure,  commands  all  other 
predicates  and  is  itself  commanded  by  none.   Each  FPE  embeds 
a  finite  predicate  in  a  dependent  clause.   The  finite  predicate 
associated  with  a  FPE  occurs  highest  and  rightmost  in  the 
sentence  commanded  by  the  FPE,  commands  all  other  predicates  in 
that  sentence  and  is  itself  commanded  by  none. 


iii.   All  other  predicates  are  nonfinite. 
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This  rule  accurately  describes  the  distribution  of  finite  predicates  in  the 
Dravidian  sentence.  Using  the  various  forms  of  *a^-  'become'  and  *en-  'say, 
think',  a  sentence  can  embed  all  of  the  four  basic  sentence  types,  and  form 
direct  discourse.  Thus,  verbs  that  are  negative  absolute  exceptions  to  the 
principles  of  case-marking  can  take  finite  predicates  as  their  direct  objects 
and  embed  them  in  a  dependent  clause  (note  7) . 

3.   Further  Exceptions  to  Case-Assignment.   Rule  B  explicitly  states  what 
Dravidian  linguists  have  tacitly  understood  for  a  long  time.   What  has  not 
been  as  clearly  recognized  heretofore  is  that  many  other  predicates  may  on 
occasion  enter  the  elite  class  of  FPEs.   Although  the  identity  of  these 
lexemes  varies  from  one  language  to  the  next,  the  following  examples  indicate 
that  they  are  largely  drawn  from  predicates  of  perception,  cognition, 
communication,  and  comparison.   Note  also  that  both  nouns  and  verbs  can 
serve  as  FPEs. 

(16)  TAMIL 

a  ni       eNNa  cen-cu  iru-kka-num  teriyum-a? 

you-nom  what  do-cp   be-inf-must  know-fut-3sn-int 
'Do  you  know  what  you  must  do?'      _  _ 

b  anta  alu      rompa  nallavaN      matiri    iru-kk-aN 
that  man-nom  very   good  man-nom  likeness  be-pres-3sm 
par-tt-iy-i? 
see-pst-2s-int 
'Did  you  understand  how  good  that  man  is?' 
c   .  .  .  moy-tt-a        tumpi    pazam     cettu  .  .  .  (naRRinai  35.3) 
swarm-pst-adnp  bee-nom  fruit-nom  think  like-cp 
'Mistaking  the  swarming  bees  for  a  fruit  .  .  . ' 

(17)  TODA 

i  mox  i*  xis-s-k  nw'l't-k-in. 
this  boy-nom  what  do-pst-3s  see-fut-ls 
'I  will  see  what  this  boy  did.' 

(18)  KANNADA 

a  ninge    estu  usna      agide  gott-a? 

you-dat  how   heat-nom  become-pres  prf-3sn     knowledge-nom-int 

'Do  you  know  how  hot  it  has  become? ' 
b  nanu   hel-t-ini     kelu. 

I-nom  tell-pres-ls  hear- imp 

'Listen  to  what  I  am  saying.' 
c  pasarise   tumbi-gal   kuvalayangal       aral-du-ve 

spreading  bee-pl-nom  water  lily-pl-nom  open-pres-3np 
gettu  .  .  .  (pampa  bharata  5.63). 
think  like-cp 

'Mistaking  the  swarming  bees  for  a  blossoming  water  lily  ..." 

(19)  TELUGU 

a  ayanuku  kasta   dhairyam     niwu       ceppali. 

he-dat   little  courage-nom  you-nom    say-may 

'You  might  say  he  has  little  courage.' 
b  cep-t-anu    vinu. 

tell-fut-ls   hear-imp 

'Listen  to  what  I  am  going  to  tell  you.' 
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(20)  KONDA 

a  embe   so-R-a     ma-n-an     veR-si   sl-d-a. 

where  go-pst-tpe  be-npst-3sm  tell-cp  give-imp 

'Tell  me  where  he  has  gone.' 
b  nan     unri  mata  veR-n-a       ve-n-id-a? 

I-nom   one   word  tell-npst-ls  hear-npst-2s-int 

'Are  you  listening  to  what  I  am  telling  you? ' 

(21)  PARJI 

nito   ron     pat-t-o-me-d-a       cur-i   ver. 
which  crying  befall-pres  prf-3sn  see-cp  come-imp 
'See  which  woman  is  crying,  and  come  (back).' 

Observe  that  this  set  of  examples  includes  FPEs  that  are  morphologically 
nouns:  mitiri  'likeness,  manner'  in  (16b)  and  gottu  'knowledge'  in  (18a) 
are  predicates  that  both  embed  finite  predicates.   They  appear  to  have 
entered  this  class  of  FPEs  on  the  strength  of  their  semantic  similarity 
with  the  verbs  that  more  commonly  migrate  into  this  class.   These 
occasional  FPEs  embed  either  predicate  nominals,  as  in  (16a),  (16c), 
(I8c),  and  (19a),  or  finite  verbs,  as  in  the  remaining  examples  of  this 
set.   These  FPEs  may  themselves  appear  in  main  or  dependent  clauses. 
Although  the  majority  of  these  FPEs  contribute  their  full  lexical  meaning 
to  the  sentences  they  appear  in,  and  to  that  extent  fall  just  short  of 
functioning  are  pure  complementizers  (note  s) »  certain  FPEs  in  the  Havyaka 
dialect  of  Kannada  (Shastri  1971)  and  the  Ramnad  dialect  of  Tamil 
(Annamalai  1969:210)  have  lost  their  lexical  meanings  and  assumed  the 
role  of  complementizer  which  forms  of  *en-  'say,  think'  formerly  held. 

(22)  KANNADA  (HAVYAKA  DIALECT) 

a     idu  yen       isOry^  hel  kel-tu. 

this-nom     what     wonder-nom       say-cp     ask-pst-3sn 

'She  asked,    "What   is   this  wonder?'" 
b     tan>gu  9s t       kodi  hel-i       hel-t-yu  du. 

self-dat-and     some       give-imp     say-cp     say-pst-3sn     that-nom 

'She   said,  ^Give  me   some,    too."' 
c     sdtt  hag       bid-k>m-bitr  atu  hel-i 

die;^adnp     like     fall-rfl-leave-cp     become-pst-3sn     say-cp 
hagi  mad-tu. 

like  do-pst-3sn 

'She  was   lying  down  as   though  she  were  dead.' 

(23)  TAMIL  (RAMNAD  DIALECT) 

avaN    var-r-eN-Nu       coll-i  coN-N-aN. 
he-nom  come-pres-ls-prt  say-cp  say-pst-3sm 
'He  said  he  was  coming.' 

In  both  sets  of  examples,  a  verb  of  communication,  hel-  'say'  in  Havyaka 
Kannada  and  col-  'say'  in  Ramnad  Tamil,  has  replaced  the  now  obsolete 
forms  of  the  verb  *en-  'say',  anta  'saying'  in  Kannada  and  eNRu  'saying' 
in  Tamil  (note  9 ) ,  in  its  capacity  as  a  complementizer  of  finite  predicates. 
It  must  be  borne  in  mind  that  the  single  most  important  syntactic  property 
of  Dravidian  FPEs  is  not  whether  they  have  a  lexical  meaning,  but  whether 
they  can  take  finite  predicates  as  their  direct  objects.   It  is  enough  to 
show  for  now  that  such  predicates  may  temporarily  migrate  into  the  class  of 
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negative  absolute  exceptions  to  case-assignment,  and  so  behave  like  *a- 

' become'  and  *en-  'say,  think'.   Since  the  temporary  FPEs  vary  from  language 

to  language,  their  identity  must  be  verified  anew  for  each  language. 

The  verb  *a-  'become'  seems  to  be  a  metonym  for  all  of  the  predicates 
of  comparison  that  function  as  FPEs:  old  Tamil  cettu  'think  like',  Tamil 
pola  'resemble'  and  matiri  'likeness',  old  Kannada  gettu  'think  like',  etc. 
The  verb  *en-  'say,  think',  on  the  other  hand,  is  a  metonym  for  all  the 
predicates~of  perception,  cognition  and  communication  that  can  serve  as 
FPEs:  Tamil  colla  'tell',  parkka  'look  at',  teriya  'be  known',  ceyti  'news'; 
Kannada  hilu  'say',  kelu  'listen',  nodu  'look  at',  gottu  'knowledge',  nija 
'truth,  fact';  Telugu  ceppu  'say',  telusu  'be  known',  vinu  'listen',  nijam 
'truth';  Pengo  hur-  ' see ' ;  Konda  ven  'hear',  veR  'tell';  Parji  cur-  'look  at' 
etc.   Study  of  texts  will  almost  certainly  bring  to  light  additional  pre- 
dicates that  may  on  occasion  pass  into  the  set  of  FPEs.   With  the  aid  of 
these  temporary  FPEs,  many  putative  cases  of  parataxis  and  asyndeton,  which 
have  been  reported  in  the  Dravidian  languages,  may  be  given  a  more  satis- 
factory and  systematic  explanation. 

There  is  no  need  to  revise  Rule  B  to  accommodate  these  other  examples. 
As  long  as  the  grammar  of  each  language  lists  the  FPEs  it  sanctions,  this 
rule  will  continue  accurately  to  describe  the  distribution  of  finite  and 
nonfinite  predicates  in  the  Dravidian  sentence.   Since  this  system  and  this 
rule  are  so  pervasive  and  engrained  in  the  daughter  languages,  it  is  not  too 
outlandish  to  reconstruct  Rule  B  —  or  something  very_like  to  —  for  the 
Proto-Dravidian  era.   Both  Rule  B  and  the  two  verbs  *a-    'become'  and  *en- 
'say,  think',  in  their  capacities  as  complementizers,  seem  to  belong  to  the 
common  Dravidian  heritage. 

4.   Diffusion,  Convergence  and  Syntactic  Evolution.   Up  to  this  point,  all 
mention  of  the  North  Dravidian  languages  has  been  studiously  avoided.   The 
syntactic  patterns  in  this  subgroup  deviate  markedly  from  those  discussed 
in  the  earlier  sections  of  this  paper.   Yet  the  phenomenon  of  temporary  FPEs 
may  be  able  to  cast  new  light  on  one  facet  of  the  syntactic  evolution  of  the 
northern  subgroup.   These  languages  seem  to  have  developed  new  syntactic 
patterns  by  admitting  into  the  set  of  FPEs  virtually  any  predicate  that  sub- 
categorizes  a  sentential  complement.   The  influx  of  so  many  new  members  into 
this  set  would  have  cost  it  its  exceptionality,  which  would  have  lead  in 
turn  to  a  fundamental  reorganization  of  sentence  structure  around  the  dimen- 
sion of  finiteness.   Consider  the  following  Kurux  examples. 

(24)   KURUX 

a  as      bar 'on       bacas. 

he-nom  come-fut-ls  say-pst-3sm 

'He  said,  "I  will  come."' 
b  en     asra   nan-d-an    enghae  sannglas    bar'os. 

I-nom  desire  do-pres-ls  I-gen   friend-nom  come-fut-3s 

'I  hope  that  my  friend  will  come.' 

In  both  examples,  the  finite  verb  of  the  main  clause  combines  directly  with 
a  dependent  clause  containing  a  finite  verb,  without  the  intervention  of  a 
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FPE  acting  as  a  complementizer.   Note  that  example  (24b)  positions  the 
dependent  clause  after  the  main  clause  in  imitation  of  a  common  Indo-Ayran 
syntactic  pattern.   Yet,  if  the  syntactic  influence  were  exerted  on  North 
Dravidian  from  Indo-Aryan,  the  changes  exploited  the  Dravidian  device  of 
conjunctive-participle  postposing  to  do  so  (note  10).   The  loss  of  "comple- 
mentizers" in  this  subgroup  is  not  the  result  of  the  diffusion  of  syntactic 
patterns  from  one  language  family  to  another,  but  the  result  of  carrying  to 
its  logical  end  the  expansion  of  the  class  of  FPEs  until  it  has  lost  its 
status  of  exceptionality. 

This  instance  of  syntactic  development  in  North  Dravidian  may  be  viewed 
as  an  argument  against  an  overly  simple  theory  of  the  diffusion  of  grammatical 
features.   Even  though  this  theory  has  some  currency  in  South  Asian  linguis- 
tics, it  has  been  formulated  only  at  relatively  superficial  levels  of 
linguistic  structure,  often  with  fragmentary  data  and  cursory  analysis. 
What  Sapir  (1921:205-6)  says  about  diffusion  bears  repeating  here: 

Attempts  have  sometimes  been  made  to  explain  the  distribution 
of  these  fundamental  structural  features  by  the  theory  of  diffusion 
...  An  examination  of  such  cases,  however,  invariably  reveals  the 
significant  fact  that  they  are  but  superficial  additions  on  the 
morphological  kernel  of  the  language. 

What  is  said  here  about  morphological  drift  may  be  equally  well  said  about 
syntactic  development  in  Dravidian,  not  the  least  because  these  languages 
extensively  utilize  their  morphology  to  encode  grammatical  relations.   If 
Indo-Aryan  has  exerted  an  influence  on  the  development  of  Dravidian  syntax, 
it  has  done  so  at  a  remove.   These  changes  have  been  mediated,  however,  by 
Dravidian  grammatical  processes  operating  on  Dravidian  grammatical 
structures.   As  the  expansion  of  the  set  of  FPEs  shows,  the  actual  devel- 
opments have  followed  a  course  charted  by  Dravidian  forms  and  rules. 

This  brief  discussion  shows  us  how  ingeniously  the  Dravidian  languages 
employ  their  morphological  and  lexical  resources  to  create  complex  syntactic 
structures.   Two  sets  of  devices  have  figured  prominently  in  the  discussion. 
Nonfinite  verbs  are  used  widely  to  form  dependent  clauses,  which  are  ulti- 
mately linked  with  the  finite  predicate  in  the  main  clause  of  the  sentence. 
FPEs  are  used  to  embed  finite  predicates  in  dependent  clauses.   They  owe  their 
ability  to  do  this  to  the  fact  that  they  are  negative  absolute  exceptions  to 
case-assignment,  and  so  may  combine  with  a  direct  object  of  any  category 
without  first  having  it  undergo  a  morphological  change  to  mark  objecthood  or 
subordination.   Nonfinite  verbs  are  less  marked  than  FPEs:  while  all  verbs 
may  have  nonfinite  forms,  not  all  verbs  can  readily  be  exceptions  to  the 
principles  of  case-assignment.   The  syntactic  behavior  of  FPEs  presupposes 
the  syntactic  behavior  of  ordinary  verbs,  as  well  .   While  neither  morpho- 
logy nor  lexicon  can  shoulder  the  entire  burden  of  accounting  for  syntactic 
concepts  such  as  constituency  and  dependency,  they  nevetheless  contribute 
significantly  to  the  grammar  of  complex  sentences  in  Dravidian. 
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Further  research  may  well  corroborate  the  initial  observation  that  in  the 
Dravidian  languages  morphology  encodes  grammatical  relations  that  would  be 
encoded  by  syntactic  means  in  other  languages.   This  study  may  help  to  bring 
into  greater  relief  the  principles  that  govern  the  interaction  of  morphology 
and  syntax  as  a  function  of  a  language's  typological  character.   Such 
principles  would  then  contribute  to  the  study  of  language  typology  and 
universals.   It  is  perhaps  not  too  much  to  expect  that  genetically  unrelated, 
but  typologically  similar  languages,  such  as  Sinhala,  would  resort  to  devices 
similar  to  those  found  in  Dravidian  to  construct  complex  sentences.   The  pre- 
requisites for  such  a  language  appear  to  be  a  highly  limited  number  of  parts 
of  speech,  the  significant  use  of  finiteness  in  constituent  structure,  an 
elaborate  morphology,  and  determination  of  case-assignment  by  the  verb. 


*    The  author  wishes  to  thank  Bh.  Krishnamurti ,  James  Gair,  Hans  Hock, 
and  S.N.  Sridhar  for  their  perceptive  commends  on  earlier  versions  of  this 
paper.   Responsibility  for  all  errors  rests  solely  with  the  author. 

1.  See  Steever  (1986)  for  a  discussion  of  the  typological  features  of  the 
Dravidian  languages.   Although  individual  grammars  may  enumerate  many  more 
parts  of  speech,  they  can  generally  be  resolved  into  various  combinations  of 
nouns  and  verbs.   So-called  indeclinables  may  prove  to  be  defective  nouns  or 
verbs,  and  assigned  to  one  category  or  the  other  on  a  case-by-case  basis. 
There  is  also  a  set  of  clitic  particles,  e.g.,  *-o^  'or,  whether',  which 
play  an  important  role  in  the  syntax  of  complex  sentences  (see  also  note  7) . 
Clitics  are  more  marked  than  free  words  because  they  presuppose  the  existence 
of  the  words  to  which  they  are  cliticized,  although  the  converse  is  not 
necessarily  true. 

2.  The  opposition  of  verb  and  noun  constitutes  the  logically  minimal  system 
of  parts  of  speech.   No  language  can  have  only  one  part  of  speech,  just 
lexical  bases  that  are  indifferently  inflected  as  verbs  or  nouns.   As  Sapir 
(1921:119)  suggests,  the  two  must  be  defined  in  opposition  to  each  other. 

3.  New  categories,  innovated  from  within  or  borrowed  from  outside  Dravidian, 
are  drawn  into  the  orbit  of  either  nouns  or  verbs.   For  example,  the  Sanskrit 
privative  prefix  a^-,  ava-  is  reanalyzed  in  Tamil  as  the  noun  avam  'emptiness, 
void,  nullity',  with  the  stem  variants  ava-  and  £-.   See  Steever  1986  for  a 
fuller  discussion. 

4.  Case  is  assumed  to  be  different  from,  though  correlated  with,  thematic 
role.   Furthermore,  the  genitive  case  does  not  easily  fit  this  heuristic 
characterization  of  case  since  in  Dravidian  it  is  adnominal.   Case  is 
commonly  held  to  describe  a  relation  between  a  noun  and  a  verb,  but  the  geni- 
tive relates  two  nouns.   It  might  be  characterized  —  unsatisfactorily,  I 
think  —  as  a  case  that  relates  one  noun  to  a  verb  through  the  means  of 
another  noun. 
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Furthermore,  this  jury-rig  definition  applies  more  properly  to  the 
abstract,  rather  than  the  localistic,  cases.   The  latter  may  be  said  more 
properly  to  relate  a  noun  to  a  clause  or  sentence  as  a  whole,  rather  than 
just  to  the  verb. 

5.  The  first  four  cases  are  considered  to  be  abstract  or  nonlocalistic; 
the  second  four,  to  be  concrete  or  localistic.   The  shape  of  the  first 
four  is  relatively  easy  to  reconstruct  to  the  protolanguage;  that  of  the 
second  four  is  not.   The  concrete  cases  are  often  supplemented  by  and, 
ultimately  replaced  by,  postpositions  derived  from  independents  nouns  or 
verbs  in  nonfinite  forms. 

6.  A  language  such  as  Malayalam  lacks  personal  endings  to  mark  subject- 
predicate  agreement  on  verbs.  'Therefore,  it  raorpohologically  interprets 
the  syntactic  property  of  finiteness  is  a  different  way. 

7.  Besides  nonfinite  verb  forms  and  FPEs  belonging  to  the  class  of  verbs, 
a  number  of  postclitic  particles  are  also  used  to  form  complex  sentences. 
Clitics  such  as  *-o^   'whether,  or',  *e^-  'even,  and',  Kannada  -alia  'isn't  it?', 
and  Telugu  gada  'isn't  it?'.   Included  among  these  sentence  types  are 
dubitative,  correlative  and  appositive  constructions.   For  the  purpose  of 
describing  the  distribution  of  finite  predicates  in  the  Dravidian  sentence, 
these  clitics  are  to  be  treated  on  an  equal  footing  with  the  FPEs  in  Rule  B. 
No  new  category  of  finite  predicate  embedders  needs  to  be  devised;  we  need 
only  to  expand  the  set  of  FPEs  to  include  these  clitics.   (See  note  9,  where 
the  Tamil  verb  eN-  'say,  think'  becomes  the  postclitic  particle  -Nu  'ibid' 
without  losing  its  status  as  a  member  of  the  set  of  FPEs.) 

8.  What  is  essential  in  these  examples  with  reflexes  of  *en-  'say,  think' 
is  not  whether  they  have  a  full  lexical  meaning;  for  in  some  cases  they  do, 
in  others,  they  do  not.   What  is  germane  is  that  they  may  combine  with  finite 
predicates,  embedding  them  in  dependent  clauses.   It  must  be  borne  in  mind 
that  the  notion  of  complementizer,  as  it  applied  in  Dravidian,  is  an  epiphe- 
nomenal  function,  not  a  basic  formal  category  realized  in  a  particular  part 
of  speech. 

9.  The  form  -Nu  is  a  clitic  particle  that  is  formed  through  the  contraction 
of  the  conjunctive  participle  eN-Ru  'saying'.   In  the  present  example,  it 
could  be  analyzed  either  as  a  clitic  particle  that  embeds  a  finite  predicate 
(note  7) ,  or  as  a  boundary  marker  that  is  syncategorematically  introduced  into 
the  structure  of  a  matrix-complement  sentence. 

10.  Though  example  (24b)  exhibits  a  word-order  that  is  typical  of  modern 
Indo-Aryan,  not  Dravidian,  it  can  be  shown  (Steever  1987)  that  such 
constructions  developed  from  native  Dravidian  constructions  with  the  help 
of  a  common  rule  that  permits  the  postposing  of  certain  dependent  clauses 
rightward  over  the  finite  verb.   Examples  from  Tamil  and  Kannada  are  shown 
overleaf,  in  which  a  dependent  clause  with  a  conjunctive  participle  is 
postposed  rightward  over  the  finite  verb. 
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TAMIL 

(a)  avaN    oti     va-nt-aN. 
he-nom  run-cp  come-pst-3sm 
'He  came  running. ' 

(a')   avaN    va-nt-aN      oti. 

he-nom  come-pst-3sm  run-cp 
'He  came  running.' 

(b)  avaN    varu-kiR-eN  eN-Ru   coN-N-aN. 
he-nom  come-pres-ls  say-cp   tell-pst-3sm 
'He  said  he  was  coming. ' 

(b')   avaN    coN-N-aN      varu-kiR-eN   eN-Ru. 
he-nom  tell-pst-3sm  come-prcs-ls  say-cp 
'He  said  he  was  coming.' 
KANNADA 

(c)  avalu    nagutta    matad-id-alu. 
she-nom  laugh-cp   speak-pst-3sf 

'She  spoke  with  a  laugh  (lit.,  laughing).' 
(c')   avalu    matidi-id-alu   nagutta. 
she-nom  speak-pst-3sf   laugh-cp 
'She  spoke  with  a  laugh.' 

It  is  easy  to  see  that  the  rule  which  is  responsible  for  this  alternation 
could  be  exploited  in  the  production  of  the  now  dominant  word-order  found 
in  the  Kurux  examples  in  (24) . 

See  Hock  (1982,  1984)  for  arguments  in  favor  of  linguistic  conver- 
gence which  are  based  on  crosslinguistic  influence  mediated  through  the 
grammatical  structures  of  the  different  languages.  Although  I  do  not 
necessarily  agree  with  what  is  said  about  the  historical  development  of 
the  Dravidian  languages  in  particular,  the  methodology  is  sound  in  that 
it  insists  on  relating  linguistic  changes  to  linguistic  structures, 
whatever  the  ultimate  motivation  for  those  changes  may  be. 
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Reflexives  and  reciprocals  in  Dravidian  languages  are 
manifested  by  either  a  verbal  reflexive  (VR)  or  by  a  reduplicated 
structure  or  by  both.l  Amritavalli  (1984)  claims  that  the  verbal 
reflexive  'anaphorizes '  the  anaphor  in  some  cases.  In  this  paper, 
we  shall  show  that  both  -  the  reflexive  and  reciprocal  in 
Dravidian  languages  -  can  be  accounted  for  by  a  unitary  process  of 
case  copying  and  that  there  is  no  need  to  include  a  rule  which 
anaphorizes  the  anaphor  in  the  Grammar. 


1.1  The  aim  of  this  paper  is  to  argue  for  an  analysis  which  provides  a 
unitary  solution  to  account   for  reflexive  and  reciprocals  in  Dravidian 
without  adding  any  new  rules  to  the  Grammar.  Let  us  consider  sentences 
such  as  (1)  and  (2)  from  Kannada  (VR  stands  for  verbal  reflexive). 

(1)  nanu  nannannu   hodedu-kon-de 

I     I  +  ACC    hit   Vr'  AUX 
'I  hit  myself. ' 

(2)  ram  j  avanannu  )  hodedu-kon-da 
Ram  1  he  +  ACC  (  hit    Vr'  AUX 

tannannu2 \ 
_he  +  ACCj' 
Ram  hit  himself.'    (Amritavalli, 1984) 

Notice  that  there  is  no  overt  reflexive  pronoun  as  such  in  these  sentences 
and  there  is  a  verbal  reflexive  kon/kol  occurring  in  the  verb.  The  question 
that  arises  is  :  How  is  the  coreference  established  in  such  sentences? 
According  to  Amritavalli  (1984:3),  it  is  the  verbal  reflexive  kon/kol  that 
'anaphorizes'  pronouns  in  some  way  exempting  them  from  the  requirement  of 
'Disjoint  Reference'.  Her  claim  is  that  anaphors  in  Kannada  and  other 
Dravidian  languages  and  also  in  English  have  a  'bipartite  structure'  :  one 
is  a  simple  pronoun  and  the  other,  the  anaphoric  part,  is  due  to  a  process 
which  fills  in  'values  for  an  abstract  agreement  matrix  AG'   (ibid:l).3  That 
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is,  in  the  verbal  reflexive  kon/kol  anaphorization  process,  only  an  abstract 
feature  matrix  is  copied  to  the  second  identical  occurrence  of  the  noun 
phrase  and  Amritavalli  claims  that  an  abstract  anaphorization  rule  should  be 
included  in  the  grammar.  Such  an  anaphorization  rule  in  her  approach, 
however,  is  not  required  in  those  cases  where  an  anaphor  occurs  in 
reduplicated  form  in  sentences  such  as  (3)  containing  a  dative  subject  and 
in  sentences,  such  as  (4)  containing  reciprocals. 

Ka  (3)  mohan-ige   tann-a-  mele   tana- ge  nambike  ilia 

Mohan  DAT  self  GEN  on   self  DAT  trust   is  not 

anta  gopal  helidanu 

that  Gopal  said 

'Gopal  said  that  Mohan  does  not  hit  himself.' 

(Amritavalli,  1984) 


Ka  (4)  avaru-,^  abbru-anna  obbaru-^   hodedaru4 


they     someone+ACC  someone 
"They  hit  each  other.' 


hit 


Notice  that  the  second  part  of  the  anaphor  agrees  with  the  subject  (the 
dative  in  sentence  (3)  and  ^  in  sentence  (4)).  According  to  Amritavalli,  it 
is  the  reduplication  of  the  anaphor  together  with  the  case  agreement  that  is 
crucial  for  establishing  corefernce  in  sentences  such  as  (3)  and  (4). 

Subbarao  and  Saxena  (1984)  propose  that  ref lexivization  in  Dravidian 
languages  is  manifested  by  a  bound  anaphor  and  the  verbal  reflexive  kon/kol. 
The  constitution  of  the  bound  pronoun,  they  argue,  is  a  reduplicated  form 
consisting  of  a  pronominal  object  and  a  pronominal  copy  of  the  subject.  They 
argued  that  the  bound  anaphor  occurs  in  a  reduplicated  form  in  all  the 
constructions  containing  a  bound  anaphor,  namely,  the  reflexive  sentences 
containing  the  verbal  reflexive  kon/kol,  the  dative  subject  construction  and 
the  reciprocals . 5 

We   shall   show  that  Subbarao  and  Saxena 's  analysis  which 

provides  a  unitary  explanation  to  account  for  reflexives  and  reciprocals 
should  be  preferred  to  Amritavalli 's  analysis  which  gives  two  different 
explanations  for  reflexives  and  reciprocals  and  also  requires  an  additional 
anaphorization  rule  to  be  included  in  the  grammar.  Data  in  this  paper  will 
be  drawn  mainly  from  Telugu.  Crucial  data  divergent  from  Telugu  will  be 
provided  from  Kannada,  Tamil  and  Malayalam  to  substantiate  our  analysis. 
Section  1.2  of  this  paper  deals  with  reflexives.  Section  1.3  deals  with 
reciprocals.  And  section  1.4  evaluates  the  proposals  concerning  reflexives 
and  reciprocals  with  a  view  to  choosing  an  explanatorily  adequate  analysis. 

1.2  It  is  important  to  mention  at  the  outset  that  there  is  no  reflexive 
pronoun  as  such  in  Dravidian.  The  reflexive  meaning  is  imparted  either  by  a 


123 


reduplicated  bound  anaphor  together  with  a  verbal  reflexive  such  as  kon/kol 
or,  solely  by  a  reduplicated  bound  anaphor  in  the  dative  subject 
construction.  Consider  the  following  sentences.  (The  bound  anaphor  and  the 
verbal  reflexive  are  underscored) . 


Te  (5)  nannu  nenu  kottu  -  komi-  anu 
I  +ACC  I  hit'  VR  AUX 
•I  hit  myself. ' 

Te  (6)  wadu  tanani  tanu  kottu  -  konn  -5.du 
he'    he  +ACC  he    hit    VR     AUX 
'He  hit  himself. ' 

Ka  (7)  nannannu  nanu  hodedu 
I  +ACC    I      hit 
"I  hit  myself. ' 

Ka  (8)  ram  avanannu  avanu  hodedu 
Ram  he  +  ACC  he    hit 
'Ram  hit  himself. ' 

Ta  (9)  ennai  nan  aditt-i  -  kon  -  den 
I+ACC  I  hit  VR'  AUX 
' I  hit  himself. ' 

Ta  (10)  kamala  avalai  avala  aditt-i  -  kon  -  dal 
Kamala  she+ACC  she  hit  vF"  AUX 
'Kamala  hit  herself. ' 

Ma  (11)  nan  yenne  tann-e6   adiccu 
I    I+ACC  1+  EMPH   hit 
'I  hit  myself. ' 

Ma  (12)  ram  tanne  tann-e     adiccu 
Ram  he+ACC  he+EMPH   hit 
'Ram  hit  himself. ' 

A  close  look  at  the  data  reveals  that  Telugu,  Kannada,  Tamil  and 
Malayalam  have  a  reduplicated  occurrence  of  the  pronoun  and  all  the 
languages  except  Malayalam  have  a  verbal  reflexive  kon/kol.  Further,  in 
Malayalam  the  second  occurrence  of  the  reduplicated  pronoun'  in  all  three 
persons  is  tanne. 


kon 

VR' 

-  de 

AUX 

Jdu  - 

kon  - 
VR" 

da 
Aux 
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According  to  the  Base  approach  for  reflexives  proposed  in  Subbarao  and 
Saxena  (1984),  a  remote  representation  of  sentences  (5)  and  (6)  is  as  in 
(5. a)  and  (6. a)  respectively. 

(5. a)  nenu   nannu  kott  +  [SELF]?  +  AUX 
I      I+ACC   hit 


(6. a)  kamala  tana  +  ni    kott  +  [SELF]  +  AUX 
Kamala  she    ACC    hit 

The  feature  [SELF]  is  realized  as  the  verbal  reflexive  kon.  In  (5. a) 
nenu  which  is  the  logical  subject,  is  copied  into  the  object  position  to  the 
right  of  the  object  pronoun,  and  then  the  subject  is  optionally  deleted. 

In  sentence  (6. a)  tanu8 ,  a  third  person  bound  pronominal  form  of  the 
subject,  is  copied  into  the  object  position  to  the  right  of  the  object 
pronoun.  It  should  be  emphasized  here  that  the  subject  is  normally  deleted 
only  in  first  and  second  persons,  as  they  have  unique  reference  but  not  in 
third  person. 

Notice  further  that  the  second  occurrence  of  the  reduplicated  bound 
anaphor  can  be  optionally  deleted  yielding  a  sentence  such  as  (13)  from 
sentence  (5)  in  Telugu.  Such  optional  deletion  is  possible  because  the 
verbal  reflexive  imparts  the  reflexive  meaning.  As  we  shall  see  such  a 
deletion  is  not  possible  in  sentences  with  the  dative  subject  where  a  verbal 
reflexive  is  not  present. 

Te  (13)  nannu     kottu  -  konn  -  anu 
I  +ACC    hit'    VR     AUX 

Evidence  in  support  of  our  claim  that  the  subject  in  the  first  and  the 
second  persons  is  deleted  comes  from  sentences  in  which  the  subject  is 
followed  by  markers  such  as  aiti  'as  for',  kabatti  'because',  mStram  'as 
for'  and  e,  emphatic  marker. 

Te  (14)  nenu  aiti    nannu  nenu  progudu-  ko-  lenu 

I     as  for  I  +ACC  I    praise   VR  cannot 
'As  for  me,  I  cannot  praise  myself.' 

Te  (15)  miru  kabatti   mimmalni  miru  wimarsincu  -  ko 
you   because   you  +ACC  you   criticize    VR 

galugu  tunnaru 
are  able  to 
'Because  it  is  you,  you  can  criticize  yourself. ' 
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Let  us  now  deal  with  Amritavalli's  analysis  of  reflexives.  She 
considers  sentences  such  as  (1)  and  (2)  from  Kannada  (repeated  here  for 
conveneince)  which  are  synonymous  to  sentences  (7)  and  (8)  above. 


Ka  (1)  nanu  nannannu   hodedu  -  kon  -  de 

I    I  +  ACC    hit      VR"   AUX 
'I  hit  myself. ' 


hodedu  -  kon  -  da 
hit      VR'    AUX 


Ka  (2)  ram  Tavanannu 
Ram  \he  +ACC 

j  tannannu 
l^he  +ACC^ 
"Ram  hit  himself. ' 

According  to  Amritavalli,  kon/kol ,  the  verbal  reflexive,  "anaphorizes" 
pronouns  in  some  way,  exempting  them  from  the  requirement  of  Disjoint 
Reference'  (1984:3)  and  'the  accusative  pronoun  nannannu  is  the  realization 
of  a  feature  matrix  which  is  copied  from  the  matrix  of  the  subject' 
(1984:25).  Such  'copying'  of  the  feature  values  from  an  antencedent  NP  such 
as  nanu  'I'  to  the  second  identical  NP,  nannannu  'I+ACC,  according  to 
Amritavalli,  is  the  process  of  anaphorization  which  in  fact  is  triggered  by 
the  verbal  reflexive  kon/kol.  The  point  which  we  intend  to  draw  attention  to 
is  that  under  this  analysis,  only  an  abstract  feature  matrix  is  copied,  but 
not  any  specific  case  marking.  We  shall  show  later  that  this  abstract 
copying  as  opposed  to  copying  of  an  actual  case  marking  crucially  affects 
the  choice  of  an  explanatory  adequate  analysis. 

Now,  sentences  such  as  (1)  and  (2)  correspond  to  Telugu  sentences  (16) 
and  (17). 

Te  (16)  nenu  nannu  kottu  -  konn  -  anu 
I  I  +ACC  hit'  VR  AUX 
'I  hit  myself. ' 

Te  (17)  wadu  tanani  kottu  -  konn  -  adu 
he'  he  +ACC  hit'  VR  AUX 
'He  hit  himself. ' 

Notice  that  sentence  (16)  is  derived  from  (5)  by  means  of  a  rule  which  moves 
the  second  part  of  the  bound  pronoun  to  sentence  initial  position.  Thus,  the 
word  order  in  sentence  (1)  from  Kannada  and  (16)  from  Telugu,  according  to 
our  analysis,  is  marked  as  opposed  to  the  unmarked  word  order  in  sentences 
such  as  (5)  from  Telugu,  (7)  from  Kannada,  and  (9)  from  Tamil. 

It  should  be  pointed  out  that  in  our  analysis  the  repeated  occurrence 
of  the  identical  pronoun  nannu  nenu  'I+ACC  -  I',  tanani  tanu  'he+ACC  -  he' 
in  Telugu  has  two  characterstics  which  deserve  to  be  mentioned: 
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(i)  These  bound  pronouns  are  not  reflexive  pronouns.  They  have 
a  reduplicated  structure  and  the  second  part  of  the  bound  pronoun  is 
derived  by  means  of  a  copying  rule  which  copies  a  pronominal  form  of  the 
subject. 

(ii)  By  virtue  of  being  a  copy  of  the  subject,  the  second 
bound  pronoun  carries  the  case  marking  of  the  subject.  That  is,  the  case  of 
the  subject  is  copied  onto  the  second  part  of  the  bound  pronoun. 

Evidence  in  support  of  our  claim  that  the  case  marking  of  the  subject 
is  copied  onto  the  bound  pronoun  comes  from  the  dative  subject  construction. 
Consider  sentences  (18)  and  (19)  from  Telugu. 

Te  (18)  namTda  naku    kopam   waccindi 
I  +on   I+DAT   anger   came 
'I  got  angry  on  myself. ' 

Te  (19)  kamala  ki   tanamida  tanaki  kopam  waccindi 
Kamala  DAT   she+on   she+DAT  anger  came 
'Kamala  got  angry  on  herself. ' 

According  to  our  analysis,  naku  "I+DAT'  in  sentence  (18)  is  a  copy  of 
the  subject  naku  which  has  the  dative  case  marking  and  naku,  the  subject,  is 
subsequently  deleted9.  In  sentence  (19)  tanaki  'she+DAT'  is  a  pronominal 
copy  of  kamala  which  therefore  has  a  dative  case  marking.  This  agreement  in 
case  marking  between  the  pronoun  and  its  subject  NP  clearly  supports  our 
hypothesis  that  case  copying  takes  place. 

Notice  further  that  the  verbal  reflexive  does  not  occur  in  sentences 
with  a  dative  subject  in  Telugu,  Tamil  and  Kannada.  In  such  sentences  the 
occurrence  of  the  nonreduplicated  pronoun  yieldsungramraatical  sentences  in 
the  intended  sense.  Such  ungrammaticality  is  due  to  disjoint  reference  of 
the  pronoun  as  the  verbal  reflexive  and/or  the  reduplicated  structure  which 
imparts  reflexive  meaning  are  not  present  in  the  sentence. 

The  absence  of  the  verbal  reflexive  in  sentences  with  a  dative  subject 
is  due  to  the  fact  that  the  predicates  that  take  the  dative  subject  are  all 
intransitive  and  therefore  the  coreferent  NP  in  such  cases  is  not 
syntactically  in  the  direct  object  relation.  The  verbal  reflexive,  according 
to  our  analysis,  occurs  only  in  those  cases  where  the  coreferent  NP  is  in 
direct  object  relation  (grammatical)  with  the  predicate. 

It  is  worth  digressing  here  to  discuss  a  couple  of  sentences  from 
Telugu  in  which  it  appears  that  kon,  the  verbal  reflexive,  occurs  in 
stuctures  which  look  like  they  are  dative  subject  construction.  The  folowing 
sentences  are  illustrative. 
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Te  (21)  wadiki  papam  cuttu  -   kon  -  dilO 
he+DAT  sin    encircle  VR    AUX 
'Sin  encircled  on  him.' 


Te  (22)  wadiki  bhayam   paptu  -  kon  -  di 
he+DAT  fear     catch   VR    AUX 
'Fear  caught  hold  of  him. ' 

However,  we  believe  that  sentences  (21)  and  (22)  are  not  true  dative  sunject 
constructions.  Our  reason  is  as  follows. 

Dative  subject  structures  do  not  permit  a  conjuctive  participial 
construction  in  which  the  grammatical  subject,  such  as  kopam  'anger'  in 
sentence  (18)  above  can  be  the  subject  of  both  clauses,  and  where  the 
experiencer  is  the  object  of  the  matrix  sentence.  This  is  shown  by  the 
ungrammaticality  of  sentences  (23)  and  (24) . 

Te  (23)l*wadini  I  kopam   wacci        tinestundi 

\  he+ACC  /  anger   having  come   is  eating  up 

/*wadiki  \ 
VJie+DATj 

'Anger,  having  come  tohira,  is  eating  him  up.' 

Te  (24)(*wadini  /  noppi   wacci       citrawadha  cestundi 
\  he+ACC  I  pain    having  come  killing     is  doing 

)  *wadiki  \ 
^he+DAT  J 

'Pain,  having  come  to  him,  is  almost  killing  him.' 

On  the  other  hand,  sentences  such  as  (21)  and  (22)  can  occur  in  a  conjuctive 
participial  construction  if  the  logical  subject  occurs  in  the  accusative  and 
not  in  the  dative  case  as  in  sentence  (25)  and  (26)  below. 

Te  (25)[wadini  ) papam  cuttu     koni  pidistundi 

1  he+ACC  /sin    encircle  VR    is  bothering 

I  *wadiki\ 
V^he+DATj 
"Sin  having  encircle  him,  is  bothering  him.' 

Te  (26)  wadini   bhayam  pattu  koni  picciwadni  cestundi 
he+ACC   fear    catch  VR   crazy   '    is  making 
'Fear,  having  caught  hold  of  him,  is  making  him  crazy. ' 
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The   accusative  case  marking,   in  sentences   (25)  and  (26)  is  permissible 
because  the  NP  he  is  in  direct  object  relation  with  the  matrix  predicate. 

Kannada  and  Tamil  too  do  not  permit  kon/kol ,   the  verbal   reflexive,  in 
the  dative  subject  construction.  Sentence  (27)  from  Tamil  is  illustrative. 

Ta  (27)  avalikkt  jaladosam   (  puditt-i  wittidi   j 
he  +ACC    cold  '     \  catch   drop+AUX  (. 


*puditt*  kondadi 
catch   VR+AUX  . 


'He  caught  a  cold. 


1.3      We  shall  now  consider  the  analysis  of  reciprocals  in  Dravidian.  Let 
us  consider  the  following  sentences  from  Telugu  and  Kannada  (the  reciprocal 
is  underscored) . 

Te  (28)  wallu-^  okalla  ni  okallu->^  kottu-konn-aru 
they  someone+ACC  someone  hit  VR  AUX 
'They  hit  each  other.' 

Ka  (29)  avaruvS"  obbar-anna  obbaru--0'  hodedaru 
they  someone+ACC  someone  hit  -PST 
'They  hit  each  other.'     (Amritavalli.  1984) 

As  can  be  seen,  the  reciprocal  contains  a  reduplicated  structure,  the 
first  part  containing  the  usual  object  case  marking,  which  in  fact  is  a  copy 
of  the  nominative  case  marking  of  the  subject. 

Note  that  the  VR  is  obligatory  in  Telugu  reciprocal  constructions 
whereas  in  Kannada,  Tamil  and  Malayalam,  it  does  not  occur  at  all. 

Evidence  in  support  of  the  claim  that  the  case  marking  of  the  second 
part  of  the  reciprocal  contains  the  case  of  the  subject  was  provided  by 
Amritavalli  (1984)  from  the  dative  subject  constructions  in  Kannada  and 
Malayalam.  The  following  sentences  are  illustrative  (the  dative  is 
underscored) . 

Te  (30)  wallaki   okallu  antell   okalla  ki   padadu 

they+DAT  someone  say+COND  someone  DAT  does  not  suit 
'They  do  not  like  each  other. ' 

Ka  (31)  avar-ige  obbar-  anna  kandre  obbar-ige  aguvadilla 
they  DAT  someone  ACC  someone  DAT  happenNEG 

'They  do  not  like  each  other. '12   (Amritavalli, 1984) 
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Notice  that  the  subject  contains  the  dative  case  marking  and  so  does 
the  second  part  of  the  reciprocal. 

Thus,  according  to  Amritavalli ,   there  are   '...two  major  instances  of 
anaphorization  in  Dravidian:  anaphorization  by  Case  Agreement  and  kollu- 
anaphorization.    The   former   process   is   illustrated  in  the  Kantiada 
reciprocal. ...  The  latter  process  (i.e.  kollu  anaphorization)  is  illustrated 
by  data  by  ref lexivization  of  tanu  and  the  personal  pronouns'  (1984:18). 

It  should  be  pointed  out  that  under  Amritavalli 's  analysis,  the 
reduplicated  structure  of  the  reciprocal  is  similar  in  nature  to  the 
reduplicated  structure  of  the  reflexive  in  the  dative  subject  construction. 
Although  Amritavalli  does  not  explicitly  state,  it  is  this  reduplication 
together  with  the  case  agreement  of  the  anaphor  with  the  subject  that 
prevents  disjoint  reference  in  reflexive  sentences  containing  dative 
subjects  and  reciprocals. 

Though  Subbarao  and  Saxena  (1984)  do  not  deal  with  reciprocals,  an 
analysis  of  reciprocals  consistent  with  the  analysis  of  reflexives  proposed 
therein,  would  be  as  follows:  Reciprocals  in  Dravidian,  just  as  the 
reflexive  constructions,  have  a  reduplicated  structure.  The  second  part  of 
the  reciprocal  just  as  the  second  part  of  the  bound  anaphor  carries  the  case 
marking  of  the  subject  and  this  case  marking  is  copied  onto  it  by  a  case 
copying  process. 

To  summarize,  under  Amritavalli 's  analysis,  the  reciprocal  contains  a 
bipartite  structure: 

a  reduplicated  structure  +  an  agreement  matrix  (case) 

Under  Subbarao  and  Saxena 's  analysis,  the  reciprocal  contains  a  structure 
which  is  as  follows: 

a  reduplicated  structure  +  a  case  copy  of  the  subjectl3 

Under  Subbarao  and  Saxena 's  analysis,  all  the  subtypes  of  reflexives  and 
reciprocals  are  treated  identically;  that  is,  they  all  contain  a 
reduplicated  structure  and  case  copying  takes  place  in  all  of  them. 
Amritavalli  posits  a  case  agreement  marker  only  for  dative  subject 
reflexives  and  reciprocals.  In  sentences  with  a  reflexive  with  a  nominative 
subject,  kollu- anaphor izatin  takes  place  which  'somehow'  prevents  disjoint 
reference  of  the  bound  pronoun  and  the  bound  pronoun  is  the  'realization  of 
a  feature  matrix  which  is  COPIED  from  the  matrix  of  the  subject'  (emphasis 
added]  (Amritavalli,  1984:25). 

1.4  In  this  section,  we  shall  attempt  to  evaluate  the  proposals  of 
Amritavalli  (1984)  and  Subbarao  and  Saxena  (1984)  with  a  view  to  choosing  a 
descriptively  and   explanatorily  adequate   analysis.   In   view  of  the 
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similarities  between  the  structures  of  reflexives  and  reciprocals,  an 
analysis  which  provides  a  homogeneous  solution  to  account  for  the  facts 
should  be  preferred  to  one  which  treats  these  as  unrelated  or  partially 
related  phenomena.  Moreover,  we  will  prefer  an  analysis  which  can  capture 
most  of  the  similarities  that  exist  in  all  the  four  Dravidian  lanRuages 
under  consideration. 

1.4.1  Let  us  first  consider  the  case  of  the  identical  occurrence  of  the 
bound  pronoun. 

Under  Subbarao  and  Saxena's  analysis,  this  pronoun  always  is  in  a 
reduplicated  form  whether  the  subject  is  in  the  nominative  or  in  the  dative 
case.  The  case  marking  of  the  subject  is  invariably  copied  onto  the  second 
part  of  the  reduplicated  structure  by  a  copying  process. 

Under  Amritavalli's  analysis,  in  sentences  with  a  nominative  subject, 
the  identical  NP  is  a  simple  pronoun  with  object  case  marking  only.  The 
reduplicated  structure  occurs  only  in  sentences  with  a  dative  subject  where 
the  verbal  reflexive  does  not  occur.  Only  in  such  cases,  the  pronoun  is 
reduplicated  to  avoid  disjoint  reference.  Thus,  the  occurrence  or 
nonoccurrence  of  reduplicated  structures  crucially  depends  on  the  absence  or 
presence  of  the  verbal  reflexive.  Thus,  there  are  two  different  processes  of 
anaphorization  in  Amritavalli's  analysis: 

(i)  The  kollu- anaphorization  process;  and 

(ii)  Anaphorization  by  case  agreement. 

In  the  kollu- anaphorization  process,  only  an  abstract  feature  matrix  is 
copied  onto  the  bound  pronoun  from  the  matrix  of  the  subject,  whereas  in  the 
dative  subject  construction,  anaphorization  by  real  case  agreement  with  the 
subject  takes  place.  Notice  that  there  is  no  motivation  to  include  a  rule 
which  copies  an  abstract  case  marking  only  in  some  cases  as  opposed  to  a 
rule  which  actually  copies  the  case  marking. 

In  Subbarao  and  Saxena's  analysis,  reduplicated  structures  are  posited 
for  both  the  nominative  and  the  dative  subject  constructions,  thus  treating 
these  two  constructions  similarly.  Further,  the  second  part  of  the  bound 
anaphor  always  carries  the  case  marking  of  the  subject,  irrespective  of 
whether  it  is  in  the  nominative  or  the  dative  case.  Moreover,  in  Subbarao 
and  Saxena's  analysis,  the  problem  of  disjoint  reference  of  the  bound 
pronoun  does  not  arise,  because  of  the  reduplicated  structure  posited 
irrespective  of  the  fact  whether  the  verbal  reflexive  occurs  or  not. 

Further,  Amritavalli's  analysis  treats  differently  sentences  with  a 
nominative  subject  in  Telugu,  Kannada  and  Tamil  on  one  hand  and  Malayalam  on 
the  other,   the  former  three  languages  having  nonreduplicated  pronouns  (see 
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sentences  (1)  and  (2)  above)  and  the  latter  having  structures  which  are 
reduplicated  (see  sentences  (i|)  and  (12))-  Thus,  under  Amritavalli' s 
analysis,  Malayalam  differs  from  the  other  three  Dravidian  languages  in  two 
aspects; 

(i)  The  presence  of  reduplicated  structures  in  sentences  with  a 
nominative  subject,  and 

(ii)  The  absence  of  the  verbal  reflexive. 

Under  Subbarao  and  Saxena's  analysis,  Malayalam  differs  from  Telugu, 
Tamil  and  Kannada  only  with  regard  to  one  aspect,  namely,  the  absence  of  the 
verbal  reflexive. 

It  should  however  be  mentioned  that  both  analyses  would  have  to  treat 
the  second  occurrence  of  the  reduplicated  structure,  namely,  tanne  in 
Malayalam,  as  an  exception  with  regard  to  case  agreement  or  copying. 

Though  not  explicitly  stated,  it  is  implicit  in  Amritavalli ' s  analysis 
that: 

(i)  in  the  three  Dravidian  languages  except  Malayalam  a  nonduplicated 
bound  pronoun  occurs  when  the  verbal  reflexive  is  present,  and 

(ii)  in  such  cases  only,  abstract  case  agreement  takes  place. 

That  is,  there  is  no  single  unified  process  of  copying  of  case.  Further, 
case  agreement  is  sensitive  to  the  presence  or  absence  of  the  verbal 
reflexive  in  Amritavalli 's  analysis  whereas  it  is  not  so  in  Subbarao  and 
Saxena's  analysis. 

1.4.2  The  crucial  question  that  arises  now  is  this:  Is  there  any 
evidence  to  show  that  there  is  a  single  process  of  case  agreement  or  copying 
which  does  not  depend  on  the  presence  or  absence  of  the  verbal  reflexive?  It 
appears  that  there  is . 

Independent  evidence  for  assuming  a  single  process  of  case  agreement  or 
copying  is  found  in  the  reciprocal  constructions.  As  mentioned  earlier,  in 
all  the  four  languages,  the  second  part  of  the  reciprocal  carries  the  case 
marking  of  the  subject.  However,  the  presence  of  the  verbal  reflexive  is 
obligatory  only  in  Telugu  in  reciprocal  constructions  containing  a 
nominative  subject.  The  verbal  reflexive  does  not  occur  in  the  dative 
subject  construction  of  Telugu  or  any  of  the  other  three  languages. 

Both  Amritavalli 's  and  Subbarao 's  analyses  can  account  for  case  marking 
by  either  an  anaphorization  process  or  case  copying  process  respectively. 

However,  if  we  extend  Amritavalli 's  analysis   to  the  Telugu  reciprocal 
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constructions,  a  problem  arises  with  regard  to  the  nature  of  anaphorization 
in  such  constructions,  because  the  reciprocal  is  a  reduplicated  structure 
and  the  verbal  reflexive  also  occurs.  We  must  therefore  ask  which  of  the 
following  alternatives  we  should  choose  to  account  for  this  situation? 

(i)   Anaphorization  by  Case  Agreement 

(ii)  Anaphorization  due  to  the  verbal  reflexive  which  only 
copies  an  abstract  case  agreement  marker 

(iii)  Anaphorization  by  Case  Agreement  as  well  as  due  to  the 
verbal  reflexive. 

Choosing  alternative  (i)  would  imply  that  the  verbal  reflexive,  though 
overtly  present,  has  no  role  in  the  anaphorization  process.  Choosing 
alternative  (ii)  would  yield  the  expected  result.  Choosing  alternative  (iii) 
would  imply  that  actual  copying  does  not  depend  on  the  presence  or  absence 
of  the  verbal  reflexive  or  reduplicated  structures. 

Subbarao  and  Saxena's  analysis,  on  the  other  hand,  does  not  encouter 
this  problem  because  (i)  reduplicated  structures  are  posited  in  reflexives 
as  well  as  reciprocals  and  (ii)  case  copying  takes  place  in  both  the 
constructions  whether  the  subject  is  in  the  nominative  or  dative. 

To  conclude,  the  above  discussion  shows  that  positing  reduplicated 
structures  for  reflexives  as  well  as  reciprocals  and  including  a  case 
copying  process  in  Dravidian  languages  Cexcept  in  one  single  case  in 
Malayalam)  would  adequately  account  for  the  facts. 


*The  present  address  of  the  author  is  Dept.  of  Laniguistics ,  University 
of  Oregon. 

1  This  paper  is  based  on  the  findings  of  the  Univesity  Grants 
Commission  project  on  'A  Study  of  Syntactic  Typology,  Convergence  and 
Language  Universals  in  South  Asian  languages.'  We  are  thankful  to  Professors 
Bh.  Krishnamurti ,  Jayaseelan,  Alice  Davison  and  Hans  Hock  for  their  valuable 
comments  and  to  Dr.  K.  Vasanta  for  data  on  Tamil,  to  Mr.  T.  Satyanath  for 
data  on  Kannada,  to  Mrs.  Usha  Pillai  and  Ms.  Malini  Nair  for  data  on 
Malayalam.  An  earlier  version  of  this  paper  was  presented  at  a  seminar  on 
Explanations  in  Linguistics  at  the  Central  Institute  of  English  and  Foreign 
Languages,  December  1985. 

2  tan  in  tannannu  "he  +  ACC  is  a  bound  anaphor.  A  bound  anaphor  is  one 
which  cannot  occur  in  a  sentence  unless  it  has  an  antecedent.  The 
ungrammaticality  of  sentence  (ii)  in  contrast  to  sentence  (i)  is 
illustrative. 
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Te  (i)  kamala  i    sangati  ceppi       tanu       skul   ki 
Kamala  this  news     having  said  she (bound)  school  to 
wellipoyindi 
is  gone 
'Having  told  this  news,  Kamala  went  to  school.' 

Te  (ii)  *tanu       i    sangati  ceppi       kamala  skul  ki 
she (bound)  this  news    having  said  Kamala  school 

wellipoyindi 
is  gone 

As  we  shall  see  later,  reduplicated  occurrences  of  a  pronoun  too  should  be 
treated  as  bound  anaphors  as  they  can  occur  if  and  only  if  they  have  an 
antecedent. 

3  AG,  according  to  Amritavalli,  is  a  'label  for  a  schematic  complex  of 
unspecified  features  whose  values  must  be  filled  in  by  a  rule  that  refers  to 
some  NP  in  the  sentences'  (Amritavalli,  1984:12). 

4  Notice  that  there  is  no  verbal  reflexive  present  in  sentences  (3)  and 

5  Subbarao  and  Saxena  (1984)  does  not  deal  with  reciprocal 
constructions.  In  Dravidian  languages  reciprocals  too  have  a  redulicated 
structure. 

6  EMPH  stands  for  emphatic  marker.  Mohanan  (1981)  and  Amritavalli 
(1984)  claim  that  the  repeated  occurrence  of  tanne  is  in  the  accusative 
case.  We  treat  e  as  an  emphatic  marker. 

7  According  to  the  Base  approach  [SELF]  should  be  posited  in  the  base 
to  indicate  the  reflexive  meaning.  Since  this  proposal  is  made  on  the  basis 
of  data  from  a  wide  range  of  languages,  it  does  not  have  the  inherent 
defects  of  an  analysis  such  as  Helke's  (Helke,  1979)  which  is  based  on  the 
data  from  English  alone.  For  further  details  concerning  the  Base  approach 
see  Subbarao  (1983)  and  Saxena  (1984). 

8  We  treat  tan  as  a  bound  pronoun  and  not  as  a  reflexive  pronoun  and  so 
does  Amritavalli  (1984). 

9  In  order  to  prove  our  contention  that  the  subject  is  deleted  in  the 
first  and  second  persons,  we  earlier  provided  evidence  from  sentences  such 
as  (9)  and  (10)  having  aite  'as  for'. 

10  We  are  thankful  to  Professor  Bh.  Krishnamurti  for  pointing  out  these 
sentences. 
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11  ante,  the  conditional  form  of  the  verb  say,  in  Telugu,  is  labelled 
as  Emotive 'Predicate  Linker  (EPL)  which  occurs  with  the  logical  object  in 
sentences  such  as  (iii)  and  (iv)  containing  psychological  predicates- 

Te  (iii)  ri5ku   sinemalu   ante   istam 
I+DAT  movies     EPL    liking 
' I  like  movies. ' 

Te  (iv)  kamala  ki   sinerai  ^ktarz   ante  ciraku 

Kamala  DAT  movie   actors   EPL   irritation 
'Kamala  can  not  stand  movie  actors.' 

12  Amritavalli  (1984)  rightly  points  out  that  a  postposition  may  occur 
in  between  the  two  parts  of  the  reciprocal.  The  following  sentence  is 
illustrative. 

Ka  (v)  avar-ige  obbar-amele  obbar-ige   nambige  ilia 
they -DAT  someone -on   someone -DAT  trust   not 
'They  don't  trust  each  other.' 

Lehman  (1985)  too  presents  similar  data  from  Tamil.  However,  in  Telugu  even 
a  verb  in  the  conditional  form  or  the  conjuctive  participial  (CP)  form  can 
occur  in  between  the  two  parts  of  the  reciprocal.  For  example, 

Te  (vi)  walla-ki  okalla-ni   cusi okallaki 

they+DAT  someone+ACC  having  seen  somedne+DAT 

orwalenitanam   ekkuwa 

intolerance    too  much 

'They  are  jealous  of  each  other.' 

13  It  is  important  to  mention  here  that  it  is  only  the  case  marker  and 
nothing  else  is  copied  onto  the  second  part  of  the  reciprocal  as  illustrated 
in  the  following  sentence. 

Te  (vii)  kamala  ki  sita  ki  okalla-  ki  okalla  ki  padadu 
Kamala  DAT  Sita  DAT  someone  DAT  someorieDAT  suitNEG 
'Kamala  and  Sita  do  not  like  each  other. ' 

Notice  that  conjunction  in  Telugu  is  manifested  by  lengthening  the  final 
vowel  of  each  of  the  conjuncts.  This  length  cannot  be  copied  onto  the  dative 
of  the  second  part  of  the  reciprocal.  However,  both  the  case  markings  of  the 
reciprocal  can  be  lengthened.    _  _         _ 

Te  (viii)  kamala  ki  sita  ki  okalla  ki  okalla  ki   padadu 
Kamal  DAT   Sita  DAT  someone  DAT  someone  DAT  suit  NEG 
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It  is  interesting  to  note  that  the  predicate  pad  'suit'   not  only  takes  a 
dative  with  a  subject  but  also  with  an  object. 
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The  PaAcatantra  and  Aesop's  Fables:    A  comparison  of  rhetorical  structure  In 
classical  Indian  and  western  literature^ 


Sarah  Tsiang  and  Albert  Watanabe 


In  this  paper  the  rhetorical  structure  of  fables  in  the  Sanskrit  Paftcalantra  and  Greel<  fables 
of  Aesop  are  compared  with  regard  to  nominal  vs.  verbal  style,  circular  vs.  linear  organization 
and  cumulative  vs.  descending  manner  of  presentation.   We  observe  that  the  rhetorical 
structure  of  Sanskrit  and  Greek  texts  in  this  genre  differ  significantly  in  these  three  areas.  Our 
conclusions  based  on  the  analysis  of  classical  texts  support  the  conclusions  of  studies  in 
which  modern  texts  from  these  two  traditions  have  been  compared. 


1.0    Introduction 

Recent  soctolingulstic  studies  have  shown  that  there  are  considerable  differences  in  the 
organizatton  of  discourse  in  texts  written  in  the  literary  traditions  of  different  cultures  (Kaplan  1966. 
Clyne  1982,  Pandharipande  1982,  Y.  Kachru  1983).  While  current  research  is  primarily  based  on  the 
analysis  of  modern  texts,  we  propose  to  investigate  discourse  organization  in  texts  written  in  two 
classical  Indo-European  languages:  Sanskrit  and  Greek.  The  two  traditions  differ  even  in  respect  to 
what  indigenous  theories  of  rhetoric  argue  should  be  emphasized  in  a  text.  Thus  the  obsen/ation  in 
(1)  from  Dan  din's  KSvy  adarsa  contrasts  sharply  with  the  advice  of  CXiintilian  in  (2): 

(1)  ojah  samSsabhDyastvam  etad  gadyasya  jTvitam  ( 1 .80) 

The  frequent  use  of  compounds  [gives]  power  to  prose;  it  [is  its]  life-force.' 

(2)  Verbo  sensum  cludere  multo,  si  compositio  patiatur,  optimum  est;  in  verbis  enim  sermonis 
vis  est.  (Institutio  oratoria  9.4.26) 

'If  the  arrangement  [of  words]  allows,  it  is  by  far  best  to  end  a  sentence  with  a  verts;  for  the 
power  of  speech  is  in  vertis.' 

While  the  issue  of  style  in  Sanskrit  has  been  considered  in  previous  studies  (e.g.  Jacob!  1903, 
Renou  1956),  a  detailed  study  of  how  the  rhetorical  structure  of  Sanskrit  texts  compares  to  the 
rhetorical  structure  of  texts  written  in  other  languages  has  not  yet  been  made.  The  only  cross- 
linguistic  comparisons  of  rhetorical  structure  of  which  we  are  aware  are  Wells  1960  and  Nakamura 
1964.  However,  their  discussions  are  more  oriented  toward  the  formulation  of  very  broad 
generalizations  based  on  cursory  surveys  of  texts  rather  than  detailed  studies  of  selected  texts. 

To  make  a  beginning  in  examining  the  similarities  and  differences  between  the  organization  of 
discourse  in  classical  Indian  and  western  writing,  we  propose  to  investigate  rhetorical  structure  in  the 
Sanskrit  and  Greek  fable  literature.  Specifically,  we  will  examine  fables  from  the  Pancatantra  and 
Aesop's  Fables  which  describe  similar  characters  and  events  (thus  minimizing  the  role  of  content  as  a 
determining  factor  for  rhetorical  structure). 

In  examining  these  fables  we  will  consider  the  following  areas:  (1)  nominal  vs.  vert^al  style,  i.e. 
whether  the  Sanskrit  style  is  more  nominal  than  the  Greek  style  (regarding  Sanskrit  style,  see  Jacobi 
1903,  Wells  1960);  (2)  circular  vs.  linear  organization,  i.e.  whether  the  discourse  organization  of  the 
Indian  fable  is  more  circular  than  that  of  the  Greek  fable  (regarding  drcuiarity  in  the  modem  Indian 
tradition,  see  Pandharipande  1982,  Y.  Kachm  1983;  on  western  styles,  see  Kaplan  1966. 
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B.  Kachm  1986);  (3)  cumulative  vs.  descending  manner  of  presentation,  i.e.  whether  important 
information  in  the  fables  is  presented  at  the  beginning  (as  in  Aesop)  or  gradually  (as  in  the  Paftcatantra). 
These  three  areas  will  be  discussed  in  later  sections  following  a  brief  introduction  to  the  texts.  We  will 
conclude  with  a  discussion  of  our  findings  and  a  suggestion  concerning  methodology  for  further 
research. 

2.0     The  Texts 

The  PaPicatantra  and  Aesop's  Fables  are  both  major  fable  collections  in  their  respective  traditions. 
The  fables  in  the  Pahcatantra  are  linked  to  each  other  in  such  a  way  that  one  story  introduces  the  next 
within  an  encompassing  frame-story.  Aesop's  fables  are  unconnected  and  organized  alphabetically. 
While  many  versions  of  each  collection  exist,  we  selected  for  our  inquiry  PDrnabhadra's  version  of  the 
Pahcatantra  and  the  Augustana  recension  of  Aesop's  Fables.  These  versions  were  selected  for  the 
following  reasons:  (1)  They  are  both  primary  versions  of  the  texts;  (2)  they  are  both  prose  works;  and  (3) 
they  contain  the  largest  number  of  fables  with  similar  themes. 

As  far  as  the  question  of  influence  between  the  two  fable  traditions  is  concerned,  we  make  the 
following  observation.  While  previous  research  has  traced  the  migration  of  fables  from  India  to  the 
western  world  through  a  series  of  translations  (Hertel  1914),  this  migration  occurred  after  the  formation 
of  the  Greek  tradition.  Therefore  we  consider  similarities  between  the  Paftcatantra  and  Aesop's  Fables 
to  be  limited  to  the  level  of  peregrinatory  motifs  and  loci  communes. 

Below  we  list  the  five  corresponding  fables  which  we  selected  for  comparison: 

(3)  Paftcatantra  AeSQP'S  Fables 

1  a)  Crows  and  Serpent  (P  1 .5)2  Eagle  and  Fox  (A  1 ) 

b)  Heron,  Fishes,  and  Crab  (PI  .6) 

2)  Two  Geese  and  Tortoise  (P  1 .16)  Turtle  and  Eagle  (A  259) 

3)  Gold-giving  Serpent  (P  3.6)  Farmer  and  Snake  (A  51) 

4)  Ass  in  Tiger-skin  (P  4.7)  Ass  in  Lion's  Skin  (A  199) 

5)  Adultress  Tricked  by  Paramour  (P  4.8)  Dog  Carrying  l^eat  (A  136) 

3.0      The  Investigation 

We  understand  rhetorical  structure  to  mean  the  'method  of  presentation'  or  the  way  in  which  a  story 
is  told,  i.e.  how  the  discourse  organization  and  the  lexical  elements  used  by  a  writer  emphasize  different 
facets  of  the  story  and  guide  the  reader  toward  a  particular  conclusion  (in  the  case  of  fables,  the  moral). 
Thus  rhetorical  structure  is  stylistic  in  so  far  as  the  telling  of  stories  in  a  particular  way  is  a  matter  of  choice. 
For  example,  by  looking  at  corresponding  lines  in  two  versions  of  the  same  Pahcatantra  fable,  we  can 
see  a  high  degreee  of  stylistic  variation  with  respect  to  the  inclusion  of  detail,  though  t)oth  fables 
belong  to  the  same  tradition: 

(4)  atha  sa  tarribakagrfvarTi  samadaya  sanaih  sanaistajjalasayamasasada  (Kale  1.7) 
then   he   it      heron-neck   <go>  gradually        that  pond<set  back> 

Then  he  having  taken  the  heron-neck  slowly  set  back  to  that  pond.' 

(5)  kuirrako  'pi  grhnva   bakagrrvSmutpalan5lavad3k5sagamana- 
crab        so  <seize>     heron-neck    totus-stemlike  air-going 
prasSdhitacihnamargo     matsy  antikameva  pray  St  (TantrSkhy  ayika  1 .5) 
subdued-sign-conveyor  fish    near      so  <set  off> 

'So  the  crab,  being  the  sign-conveyor  of  the  subdued  bird,  having  seized  the  totus-stalk 
heron-neck,  set  off  for  the  presence  of  the  fish.' 

Similar  stylistic  variation  can  be  found  in  different  versions  of  Aesop's  fables,  as  can  be  seen  in  the 
following  corresponding  lines: 


(6)  GeSrgou     paidaophis   herpusas  apekteinen.  (A  51) 
of  farmer      son    snake  <creep>   <kill> 

The  snake,  having  crept  up,  killed  the  farmer's  son.' 

(7)  Ophis  georgou     paida  dexasentSi    podi  parakrgma      nekron  edeixen.  (A51)3 
snake  of  farmer    son     <bite>inthe     foot  immediately     corpse  <render> 

*A  snake,  biting  the  farmer's  son  in  the  foot,  immediately  rendered  [him]  a  corpse.' 

As  we  can  see  from  these  examples,  both  languages  have  the  lexical  resources  available  to 
them  for  conveying  the  essentia!  meaning,  as  in  (4)  and  (6),  and  for  embellishing  this  meaning,  as 
in  (5)  and  (7).    Moreover,  both  languages  use  case  inflection  in  the  nominal  system  to  show  the 
relationships  between  subjects  and  objects  to  each  other  and  to  the  action  described.   A  rich  verbal 
morphology  is  found  in  tx}th  languages,  as  we  can  see  for  example  in  the  frequent  use  of  the 
absolutive  (e.g.  grhitvS  'having  seized"  and  herpusas  'having  crept'  in  the  examples  above).   What  is 
of  interest  to  us  in  this  investigation  is  how  the  two  languages  use  similar  linguistic  resources  to  relate 
similar  stories  in  what  we  will  argue  to  be  dissimilar  ways,  that  is,  each  language  exhibits  a  different 
rhetorical  structure.  In  the  following  sections  we  will  discuss  in  detail  the  three  topics  in  rhetorical 
structure  enumerated  above. 

3.1      Nominal  vs.  Verbal  Style 

Sanskrit  texts  have  been  characterized  as  being  written  in  the  nominal  style  in  contrast  to  the  verbal 
style  found  in  writings  in  western  languages  such  as  Greek  (Jacobi  1903,  Wells  1966).  Nominal  style 
can  be  defined  as  a  writing  style  in  which  nominal  forms  are  preferred  to  verbal  fornis.  Characteristics  of 
nominal  style  include  the  following  traits. 

First,  finite  verbs  are  not  found  as  frequently  in  texts  written  in  the  nominal  style.  In  place  of  such 
verbs  one  finds  'nominalizations',  for  example  verbal  nouns  or  adjectives,  and  absolutives  used  to 
present  accompanying  actions.  As  a  result  of  these  nominalizations,  the  number  of  clauses  in  a 
nominal-style  text  is  reduced.   Another  characteristic  of  the  nominal  style  is  the  greater  frequency  with 
which  vert)S  (especially  the  copula)  are  omitted.   The  nominal  style  Is  contrasted  with  the  vertsal  style  in 
the  examples  bek)w: 

(8)  John  made  a  beginning  on  his  thesis,  (verbal  noun) 

(9)  John  began  his  thesis. 

(10)  It  is  the  done  thing,  (verbal  adjective) 

(11)  They  do  such  things. 

(12)  The  liar  endangers  his  reputation. 

(13)  He  who  lies  endangers  his  reputation,  (two  clauses) 

(14)  Having  applied  for  the  job  John  ate  lunch,  (absolutive) 

(15)  John  applied  for  the  job  and  then  ate  lunch. 

( 1 6)  sarpa  h  k  r  §  iia  h  (copula  deletion) 
snake    black 

'The  snake  [is]  black.' 

In  additton  to  these  general  characteristics  of  nominal  style,  Sanskrit  morphiology  permits  another 
method  of  nominalization  in  its  rules  for  compound  formation.  In  Sanskrit,  an  uninflected  verb  can  form 
part  of  a  nominal  compound,  as  in  the  adjectival  compound  shown  in  example  (17).  Here  the  participle 
nyasta  (ni-  -^as )  is  uninflected.  The  use  of  such  compounds  decreases  the  number  of  inflected  verbs 
in  a  text. 

(17)  (a)ntahpuramjal8sannanyastakanakasDtramukt5h5ravastr8bharariar7i    (P1.5) 

jala-  Ssanna-nyasta-kanakasQtra-  mukt 5h 5ra-vastra- Bbharanam 
harem  water  near  <set>    gold  necklace  pearl  doth    ornament 

Ihe  harem  having  set  down  near  the  water  gold  necklaces,  ckJthes  and  ornaments.' 
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Characteristics  of  nominal  style  were  very  much  in  evidence  in  the  Sanskrit  fables  which  we 
examined.  In  particular,  we  found  the  frequent  use  of  verbal  nouns  such  as  'ikSsagamana  'bird*, 
literally  *air-going'  (Tantrakhyayika  1.5  );  danfasamdamsena  (inst.)  "by  means  of  a  biting  together  of  the 
teeth'  (P  1.16).    Participles  frequently  functioned  as  veibal  adjectives  and  predicate  adjectives  as  in  the 
following  examples: 

(18)  kako  bhrsam  duhkhaparnSr^go  (P  1.5) 

part, 
crow   much     grief    <envelop>  body 
"the  crow,  having  a  body  enveloped  with  very  much  grief 

(19)  yavad  ahammatsyadanenapustah      suhitas  ca(P1.6) 

part.  part, 

as  long  I  fish  <eat>     <nourish>    <satisfy>and 

'So  long  as  I  by  eating  fish  was  nourished  and  well  satisfied...' 

We  also  found  extensive  use  of  absolutives,  often  appearing  in  chains  as  in  (20).  In  this  example,  the 
important  events  of  the  story  are  related  using  absolutives.  Only  the  less  significant  action  (namely  that 
of  speaking)  is  related  using  a  finite  vertj  (ylvadf. 

(20)  ity    avadharya  kuto  'pi       ksTramyScitvS  saravenikspyavalmTkSntikam 

abs.  abs.  abs. 

thus<ponder>  somewhere     milk     <ask>     dish  <put>        ant-hill  near 
upagamyovSca.  (P3.6) 
upagamya-  uvaca 
abs.  pert. 

<approach><say> 

'Thus  having  pondered,  having  asked  for  milk  from  somewhere,  having  put  it  in  a  dish, 
approaching  the  ant-hill,  he  said:' 

This  example  also  illustrates  the  reduction  in  the  number  of  clauses  resulting  from  the  use  of 
absolutive  constmctions.  Had  all  of  the  vert^s  been  finite,  the  sentence  would  contain  five  clauses 
rather  than  one  clause.  Examples  of  vertjal  deletion  were  also  in  evidence,  as  in  examples  (21 )  and 
(22): 

(21)  mamakhalumaunavratamadyaprabhrtiySvad    SkSsagamanam  (P1.16) 
my      now  silence-vow     now  on  as  long    sky-wandering 

'Now  [there  is  ]  my  vow  of  silence  as  long  as  our  sky-wandering  [is  going  on].' 

(22)  nQnamdrnarapQrno  'yam  valmlka:  (P3.6) 
surely    gold-coin  filled  this  anthill 

'Surely  this  anthill  [is]  filled  with  gold-coins.' 

We  also  encountered  the  frequent  use  of  compounds  (see  e.g.  (4)  ,(5),  and  (17)  above),  including 
those  with  uninflected  verbal  constituent  elements. 

The  Greek  fables,  on  the  other  hand,  showed  fewer  of  the  characteristics  of  nominal  style.   In 
these  fables  there  was  only  a  single  participle  used  as  a  vertsal  noun  (hoi  SdikSmenoi,  1hose  suffering 
from  injustice',  A  1 ).    No  omission  of  the  copula  occurred.   Compounds  found  in  the  Greek  fables 
were  for  the  most  part  verbal  and  consisted  of  verbs  joined  to  prepositions  as  in  (23): 

(23)  pnoserkhomai  (A  259)  anabaino  (A  1 ) 
towafd-<come>  up-<go> 
'approach'  'go  up' 

But  there  were  a  number  of  verbal  adjectives  (e.g.  ho  hupokeimenos  thamnos.  the  lying  under  (the 
tree)  bush',  A  1;  hS parakeimenS petra,  'the  lying  near  rock',  A  51).  Thus  in  these  categories  (except 
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for  the  vertDal  adjective)  we  can  see  that  the  Sanskrit  shows  more  of  a  nominal  style.  But  it  should  be 
noted  that  all  of  these  constructions  are  possible  in  Greek  and  that  the  Greek  fables  are  significantly 
less  detailed  than  their  Sanskrit  counterparts  (see  betow). 

Of  the  characteristics  of  nominal  style,  the  use  of  non-finite  verbs  to  convey  actions  in  Greek 
offered  a  clear  parallel  to  Sanskrit,  as  we  can  see  in  the  following  exanple  containing  four  participles 
and  only  one  finite  verb: 

(24)  exelthouses  de  pote    autgs  epi  nomgn  ho  aetos  aporSn       trophSskataptas      eis 
part.  part.  part. 

gen,  absolute 

<go  off>  once      it      for  food  eagle<be  at  ioss>  food     <  fly  down>     Into 

tonthamnonkaitagennSmataanarpSsas        meta  tSnhautou  neottSn 
part, 
bush  young         <snatchup>        with         own      young 

katethoinSsato.  (A1) 
aorist 
<feast> 

'It  [the  fox]  once  having  gone  off  for  food,  the  eagle,  being  at  a  k)ss  for  food,  flying  down 
Into  the  bush  and  snatching  up  the  young  [of  the  fox],  feasted  with  its  young.' 

In  this  sentence,  the  only  finite  verb  is  katathoinSsato  which  appears  only  after  many  actions  have 
been  described  in  the  preceding  chain  of  participle  constructions.  However,  such  examples  of 
'chaining'  in  the  Greek  fables  do  not  seem  to  be  strictly  parallel  to  the  chains  of  participles  found  in 
Sanskrit.    First,  clauses  in  the  Greek  fables  usually  do  not  contain  more  than  one  participle  per  clause. 
At  times,  these  participles  do  not  necessarily  lead  to  a  reduction  in  the  number  of  clauses  as  we  can 
see  in  example  (25).   In  this  example  we  have  three  participles  (two  in  one  clause)  and  four  clauses: 

(25)  hodeepitoutSideipnopathSsaspelekun  anelabekal      paragenomenos 

part.  part, 

he       at    this    <tobegrieved>    axe       <seize>  and  <tobenear> 
eis  ton  phSleon  autou  heistSkei         paratSroumenos,  hopSs, 

part, 
to  hole        of  It  <to  stand>      <towatch>  In  order  that 

anexi§l,  euthe5s  auton  patax§i. 

if  <togoout>      invnecfiately        it     <toslrike> 

'And  he  [the  farmer]  being  grieved  at  this  seized  an  axe  and  going  to  the  hole  of  it  [the 
snake]  stood  watching,  in  order  that,  if  the  snake  should  come  out,  he  might  immediately 
strike  it.' 

Furthermore,  in  the  Greek  fables  brevity  rather  than  style  may  be  the  motivating  factor  for  the 
strings  of  participles.  The  Greek  fables  are  considerably  more  concise  than  the  Sanskrit  fables  (e.g.  in 
'Crows  and  Serpent*  the  Sanskrit  fable  is  roughly  350  words  long  while  the  corresponding  Greek  fable 
is  less  than  200  words  tong;  'GokJ-giving  Serpent'  has  about  200  words  in  Sanskrit  compared  with  100 
in  Greek).  Thus  these  strings  of  participles  in  the  Greek  may  have  been  used  to  make  the  sentences 
more  concise  and  need  not  t)e  taken  as  a  confirmation  of  stylistic  choice.  With  this  adjustment ,  and 
keeping  in  mind  the  fact  that  verbal  adjectives  fonned  from  participles  do  occur  in  the  Greek,  we 
conclude  that  the  Sanskrit  fables  exhibit  a  nominal  style  while  a  vert)al  style  is  characteristic  of  the 
Greek  fables. 

3.2      Circular  vs.  Linear  Organization 

In  the  area  of  textual  organizatton,  we  found  that  the  Greek  fables  proceed  in  a  linear  fashion:  in 
general  each  begins  with  a  line  introducing  the  characters  and  describing  a  situation.  The  narration  of 
events  follows  and  the  story  concludes  with  a  moral.  On  the  other  hand,  the  Sanskrit  fables  show  a 
circular  organization:  The  moral  is  introduced  at  the  start  (or  more  precisely  at  the  end  of  the  preceding 
story,  which  serves  as  an  introduction  to  the  present  one).  The  story  then  follows  and  the  moral  may 
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only  be  referred  to  at  the  end  (only  the  first  words  being  reiterated).   The  reader  is  thus  at  times 
compelled  to  return  to  the  beginning  of  the  story  to  recollect  the  moral. 

To  illustrate  this  difference  in  organization,  we  may  look  at  the  different  ways  in  which  Greek  and 
Sanskrit  present  stories  based  on  the  same  theme  --  the  unrewarded  abandoning  of  what  one  has,  in 
the  hopes  of  gaining  something  better.    In  the  Greek  fable,  entitled  'Dog  Carrying  Meat'  (A  136).  the 
first  line  immediately  introduces  the  dog  and  describes  the  background  for  the  story: 

(26)  Ku  on  kreas  ekhousa  potamon  diebaine. 

'A  dog  with  a  piece  of  meat  was  crossing  the  river.* 

The  story  continues  by  telling  how  the  dog  drops  the  piece  of  meat  which  it  has  in  pursuit  of  another 
larger  piece  (in  reality,  a  reflection  in  the  water).  The  fable  then  concludes  with  the  nrwral  which 
explicitly  states  what  the  reader  is  to  understand  from  the  story: 

(27)  Pros  andra  pleonekten  ho  logos  eukairos. 
'For  the  greedy  man  the  story  is  appropriate.' 

In  contrast,  the  moral  of  the  corresponding  Sanskrit  fable  (entitled  'Adultress  Tricked  by 
Paramour")  first  appears  at  the  end  of  the  preceding  fable  (P  4.7):^ 

(28)  ySdrsammamapandityamtadrsamdvigunamtava 
n  abh Qj  j  aro  na  bhart  a  ca  jate  ti  s  t hasi  nagnike  (P  4.8) 

'Just  as  my  wisdom  so  is  your  cleverness.  Nought  has  come  of  [your]  lover  or  husband.  In 
the  water  you  stand,  o  naked  one.' 

The  fable  proper  begins  as  follows: 

(29)  kasmimscid  adhisthane  halikadampatfprativasatah  sma 
'In  a  certain  place  lived  a  ploughman  and  his  wife.' 

In  this  first  line,  we  are  only  introduced  to  the  ploughman  and  his  wife  whom  we  later  learn  is  much 
younger  and  very  unfulfilled.  The  wife  meets  a  young  man  and  mns  off  with  him,  taking  her 
husband's  money  with  her.  But  he  eventually  abandons  her  on  the  other  side  of  a  river,  taking  the 
money  as  well  as  her  clothes.  While  sitting  on  the  river-bank,  the  woman  sees  a  jackal  carrying  a  piece 
of  meat.  The  jackal,  seeing  a  stranded  fish,  drops  the  meat  to  pursue  the  fish,  only  to  have  the  fish 
escape  and  a  vulture  steal  the  meat.  The  woman  chides  the  jackal  for  losing  both  the  fish  and  the 
meat.  The  jackal  in  turn  rebukes  her  for  the  loss  of  her  husband's  money  (and  perhaps  her  husband) 
and  her  lover,  quoting  the  moral  (see  (28)  above). 

The  presentation  of  this  fable  is  circular  in  two  ways.   Not  only  does  the  moral  begin  and  end  the 
narrative,  txjt  also  the  jackal's  story  of  loss  brings  the  tale  back  to  the  woman's  story  of  loss.    In 
contrast,  the  organization  of  the  Greek  fable  proceeds  in  a  linear  manner.   All  of  the  major  characters 
are  introduced  at  the  beginning  of  the  story  and  the  narration  of  events  leads  directly  to  the  moral. 

Furthermore,  the  Greek  fables  relate  only  those  events  which  are  essential  to  the  story.    In 
consequence,  these  fables  tend  to  be  short  and  to  have  simple  plots.    By  contrast,  in  'Adultress 
Tricked  by  Parannouf',  the  stories  of  the  ploughman's  wife  and  the  jackal  are  artfully  juxtaposed.  In 
general,  the  Sanskrit  fables  show  a  greater  tendency  to  digress  and  to  spin  out  the  stories  in  elatwrate 
detail.  A  prime  example  of  digression  is  found  in  'Heron,  Fishes,  and  Crab',  in  which  the  water- 
creatures  fear  for  their  lives  because  their  lake  is  going  dry.  In  Kale's  version  (1 .7),  there  are  eight  lines 
(including  three  slokas)  devoted  to  explaining  the  astronomical  situation  causing  the  drought.  A  good 
compact  example  of  the  arrvsunt  of  detail  found  in  Sanskrit  fables  can  be  seen  in  the  precise 
description  of  the  sizes  of  fish  which  the  heron  consumes: 
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(30)  bhaksayitvS  bahOn  matsySn  uttamSdhamamadhyamSn 
atilautySd  bakah  kascin  mrtah  karkatakagrahSt  (P  1 .5) 

'A  heron,  having  eaten  many  fish,  large  (uttama)  small  (adhama)  and  medium  (madhyama), 
his  greed  never  satisfied,  because  of  the  grasping  of  a  crab,  died." 

The  degree  of  detail  as  well  as  the  number  of  digressions  in  the  Sanskrit  fables  stand  in  marked 
contrast  to  the  straighttonward  relation  of  facts  in  the  Greek  fables.  These  two  features  illustrating 
deviation  from  the  point  are  further  examples  of  circularity  in  the  Sanskrit  fables. 

3.3      Cumulative  vs.  Descending  Presentation 

In  ternns  of  nnanner  of  presentation,  we  found  that  the  Greek  fables  show  a  descending  style  of 
presentation  in  so  far  as  important  information  (e.g.  characters  and  events  essential  to  the  story)  is 
presented  at  once  with  supporting  information  being  introduced  later.  On  the  other  hand,  the 
Sanskrit  fables  show  an  ascending  or  cumulative  style  of  presentation  since  Important  information  is 
not  immediately,  but  is  rather  gradually  introduced,  as  the  story  unfolds. 

The  emphasis  on  the  direct  introduction  of  characters  and  events  in  the  Greek  fables  can  be  seen 
for  example  in  the  fable  entitled  'Farmer  and  Snake'  (A  51 ).  In  the  first  line,  all  of  the  major  characters 
of  the  fable  are  introduced  and  an  important  event  is  described: 

(31)  GeSrgou     paidaophis  herpusas  apekteinen.  (A51) 
of  fanner     son     snake  <creep>  <kill> 

The  snake  having  crept  up  killed  the  fanner's  son.' 

The  remainder  of  the  fable  relates  how  the  farmer  in  revenge  attempts  to  kill  the  snake  with  an  axe,  but 
missing  the  snake  splits  a  rock  in  half.  The  farmer  then  attempts  to  reconcile  himself  with  the  snake. 
The  fable  ends  with  the  snake's  reply  that  no  reconciliation  is  possible,  and  with  the  statement  of  the 
moral: 

(32)  ho  k>gos  d§loi,  hoti  hai  megalai  ekhthrai  ou  raidias  tas  metallagas  ekhousi.  (A  51) 
The  story  shows  that  great  hatreds  do  not  have  easy  reconciliations.* 

Thus  we  can  see  that  the  Greek  fable  proceeds  in  a  descending  manner  in  which  the  major  characters 
and  the  situation  are  introduced  at  the  beginning.  The  remainder  of  the  fable  describes  the 
resolution  of  this  situation  and  gives  the  statement  of  the  moral. 

In  the  corresponding  Sanskrit  fable,  the  theme  of  a  fanner's  son  being  killed  by  a  snake  is  also 
present,  though  the  events  narrated  do  not  center  around  this  event.    Rather,  the  story  reads  like  a 
desaiption  of  the  farmer's  experiences  during  a  particular  period  of  his  life.  The  first  line  of  the 
Sanskrit  fable  introduces  only  the  farmer  (the  brahman)  and  does  not  give  any  further  background  for 
the  story: 

(33)  asti    kasmimscidadhisthSneko'pibrihmanah   (P3.6) 
<be>  certain  place  brahman 

There  was  in  a  certain  place  a  certain  brahman.' 

The  next  two  lines  talk  about  the  unproductiveness  of  his  farmwork.  and  his  falling  asleep  under  a 
tree.   The  snake  is  only  very  casually  introduced  in  the  fourth  line: 

(34)  anatklDre  valmrkopari  prasSritabrhatphatatoparn  bhTsariam  bhujarngamarn  drstv8 
cJntaySm      5sa  (P3.6) 

'Having  seen,  not  far  away,  above  an  anthill,  a  frightening  serpent  having  a  great  swelling 
hood  extended,  he  reflected:' 

In  the  following  lines,  the  relationship  that  devek>ps  between  the  farmer  and  the  snake  is  detailed. 
Eager  to  become  more  successful,  the  farmer  decides  to  honor  the  snake  (whom  he  perceives  as  a 
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divinity),  and  daily  presents  the  snake  with  milk.    In  return,  he  finds  one  gold  dinar  each  day  at  the  hole 
of  the  snake.  It  is  only  in  the  fifteenth  line  that  the  farmer's  son  is  introduced  into  the  narrative,  and 
again,  the  introduction  is  not  direct  but  rather  incidental: 

(35)  athaikasmin  divase  valmTke  ksfranayanaya  putram  nirupya  brahmano gramam  jagama. 
"Then  one  day,  the  brahman  went  to  the  village,  having  appointed  his  son  for  taking  the 
milk  to  the  ant-hill.'   (P  3.6) 

The  son,  who  also  receives  a  dinar  for  his  milk-bringing,  concludes  that  the  home  of  the  snake  must 
be  filled  with  dinars,  and  he  attempts  to  kill  the  snake.    But  the  snake  lashes  back,  killing  the  tx>y.   The 
boy's  cremation,  and  the  farmer's  reaction  on  his  return  are  described,  and  then  the  moral  is  given: 

(36)  bhQtan  yo  nSnugrhnSti  grhnati  saranagatarn 
bhOtarthas  tasya  nasyanti  hamsSh  padmavane  yathS  (P 3.6) 

'He  who  does  not  treat  well  his  creatures  and  abuses  the  one  coming  to  him  for  refuge, 
his  possessions  perish  just  as  the  geese  in  the  lotus  forest.' 

As  can  be  seen  in  the  gradual  and  casual  introduction  of  characters,  the  Sanskrit  story  unfolds, 
showing  a  cumulative  style.  The  Greek  approach  is  much  more  direct.  The  actions  follow  one  after 
another  in  direct  consequence  of  each  other. 

These  two  approaches,  descending  and  cumulative  presentation,  are  also  reflected  in  the 
syntactic  constructions  used  in  the  fables.  While  both  languages  have  great  flexibility  in  word  order, 
we  found  that  the  subject  is  usually  fronted  in  the  first  lines  of  the  Greek  fables,  while  the  subject 
appears  near  the  end  of  the  sentence  in  the  first  lines  of  the  Sanskrit  fables;  cf.  e.g.  the  examples  in 
(31)  and  (33).  However,  a  broader  study  of  the  syntactic  correlates  is  necessary.  For  example,  it  may 
be  that  the  non-fronting  in  Sanskrit  follows  from  the  formulaic  beginning  astikasmimscicl(ci.  'once 
upon  a  time')  of  many  Sanskrit  first  lines.  But  the  evidence  which  we  have  collected  so  far  points 
toward  descending  presentation  in  Greek  and  cumulative  presentation  in  Sanskrit. 

4.0     Discussion 

Thus  in  the  areas  considered,  the  Sanskrit  fables  tend  to  be  more  'nominal'  than  their  Greek 
counterparts;  they  show  a  circular  organization;  and  their  presentation  of  information  is  cumulative.  In 
contrast,  the  Greek  fables  tend  to  be  more  'verbal',  exhibiting  a  linear  organization  and  a  descending 
manner  of  presentation.  While  these  conclusions  are  only  preliminary  to  an  investigation  of  rhetorical 
stmcture,  they  are  a  beginning  and,  based  upon  these  findings,  we  make  the  following  observations. 

First,  it  is  interesting,  and  even  ironic,  that  in  telling  a  story  Sanskrit  uses  the  nominal  style,  while  in 
stating  facts  Greek  uses  a  verbal  style.  This  finding  should  be  considered  in  light  of  Jacobi's 
observation  (Jacobi  1903;  see  also  Hock  1975)  that  the  nominal  style  is  more  closely  associated  with 
scientific  and  abstract  thought. 

Second,  it  is  clear  that  rhetorical  organization  is  independent  of  content,  since  the  fables  which 
we  compared  narrate  similar  events;  nor  does  the  purpose  of  presentation  affect  rhetorical 
organization,  since  fables  are  the  vehicle  for  the  expression  of  a  moral.  We  conclude  that  the  different 
rhetorical  structures  used  in  the  fables  are  the  result  of  their  being  written  according  to  the  norms  of 
their  respective  traditions  and  cultures.  In  this  view,  our  findings  offer  evidence  in  support  of  current 
research  claims  about  the  linearity  and  directness  of  western  thought  and  the  circularity  and 
digressiveness  of  'oriental'  thought. 

Finally,  in  considering  to  what  degree  a  written  text  uses  a  nominal  or  verbal  style,  we  would 
suggest  the  introduction  of  some  quantifiable  means  of  assessing  the  noun-like  or  verb-like  qualities 
of  lexical  items  which  are  not  so  much  nouns  or  verbs  per  se,  but  nominalizations  and  verbal 
derivatives  (such  as  participles  and  absolutives).  While  the  precise  formulation  of  this  quantifiable 
system  requires  further  consideration,  as  a  beginning  we  would  suggest  the  introduction  of  some 
ranking  scheme  which  could  be  applied  in  a  stylistic  analysis  based  on  whether  a  lexical  item  is  more 
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like  a  noun  or  verb  in  the  features  for  which  it  inflects.   We  suggest  for  example  the  following  ranking 
scheme: 

(37)       Noun  =  5 

Nominalization  =  4 
Participle  =  3 
Absolutive  =  2 
Finite  Verb  =  1 

Once  the  lexical  elements  of  a  text  have  been  ranked,  a  low-scoring  text  can  be  considered  as 
showing  a  more  verbal  style;  a  high-scoring  text  can  be  considered  as  showing  a  more  nominal  style. 

NOTES 

^Our  research  benefited  greatly  from  discussions  with  Professors  L.  Zgusta,  M.  Marcovich  and 
especially  R.  Pandharipande.  We  are  also  grateful  to  Professor  H.  Kahane  for  his  detailed  comments 
on  an  earlier  draft  of  this  paper.  Above  all  we  are  indebted  to  Professor  H.  l-lock  for  his  assistance  and 
encouragement  throughout  our  research. 

^References  beginning  with  £  are  found  in  HERTEL  1908;  references  beginning  with  A  are 
found  in  HAUSRATH  1962. 

^From  the  Vindobonensis  recension,  a  version  of  the  fable  based  on  the  Augustana  recension 
(version  II  in  HAUSRATH  1962). 

^There  is  a  slight  rewording  (of  two  words)  between  the  first  and  second  appearance  of  the 
verses. 
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INTERACTIONAL  SOCIOLINGUISTICS  AND  GENDER 
DIFFERENTIATION  IN  NORTH  INDIAN  SPEECH 


Tamara  M.  Valentine 


This  paper  explores  how  using  an  interpretive 
approach  to  language  can  help  to  account  for  both  the 
shared  grammatical  knowledge  and  the  differences  in 
communicative  styles  in  terms  of  female-male  gender 
differences.   The  paper  examines  the  continuous 
verbal  interaction  between  male  and  female  speakers 
in  conversations  and  explores  each  speaker's 
recurrent  speech  patterns  and  verbal  strategies.   The 
analysis  is  based  on  spoken  data  and  literary-based 
contexts  to  illustrate:  first,  a  differential  use  of 
language  strategies  and  speech  patterns  by  male  and 
female  speakers  in  cross-sex  conversation;  and 
second,  that  a  shared  knowledge  of  language, 
background  assumptions,  and  contextualizatlon 
conventions  are  essential  in  order  to  initiate 
negotiation  for  a  successful  and  effective  verbal 
interaction.   When  sociolinguistic  rules  are  not 
shared  by  speakers,  then  they  must  exercise  a  great 
deal  of  verbal  work,  and  often  the  result  is 
communication  breakdown. 

Conversational  Cooperative  Principle 

The  theoretical  principle  on  which  this  analysis  is  based 
is  that  of  Gumperz's  (1982a)  conversational  cooperation.  That 
is,  a  conversation  is  seen  as  an  on-going  process  of 
cooperation  and  negotiation,  a  continuous  interacting  and 
sharing  between  speaker-listener.   It  is  a  process  by  which 
participants  assess  each  other's  intentions  and  on  which 
responses  are  based.   However,  as  one  knows,  in  order  for  two 
participants  to  negotiate,  it  is  essential  that  they  first  lay 
some  groundwork. 

In  conversation,  in  order  for  negotiative  management  to 
begin,  a  certain  amount  of  shared  expectations  and  actions  need 
to  be  signalled.   These  signals  are  known  as  contextualizatlon 
conventions  and  are  defined  as  "constellations  of  surface 
features  of  message  form  [which]  are  the  means  by  which 
speakers  signal  and  listeners  interpret  what  the  activity  is, 
how  semantic  content  is  to  be  understood  and  how  each  sentence 
relates  to  what  precedes  or  follows."  These  cues  are  acquired 
as  a  result  of  a  speaker's  communicative  experience  which  is 
based  on  her/his  social  interpersonal  networking  position 
(role,  status,  etc.)   The  individual  acquires  different  cues 
depending  on  her/his  cultural  background,  gender,  social 
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status,  and  other  factors  in  a  lifetime.   Once  acquired, 
contextualization  conventions  operate  automatically  and  are 
habitually  used  and  perceived.   Therefore,  contextualization 
conventions  must  be  examined  beyond  the  traditional  linguistic 
analysis  and  studied  in  process  and  in  context  rather  than  in 
the  abstract. 

Inclusive  under  these  verbal  cues  are  prosodic  features, 
style  switching,  and  verbal  strategies  (i.e.  formulaic 
expressions,  lexical  and  syntactic  options,  conversational 
openings  and  closings,  topic  initiation,  flow,  and  shift,  etc.) 
which  carry  conversational  information  and  indicate  discoursal 
expectations.   When  these  interpretive  processes  are 
understood,  taken  for  granted,  and  go  unnoticed  between 
partners,  then  conversation  goes  on  successfully  as  planned. 

Even  though  all  speakers  have  the  ability  to  produce 
grammatical  sentences,  linguistic  competence  is  not  enough  for 
an  effective  conversation;   communicative  competence  must  be 
taken  into  consideration.   For  example,  one  speaker's 
knowledge  of  the  contextualization  conventions  and  her/his 
networking  relations  may  differ  from  those  of  another  speaker. 
Often,  due  to  one  participant  being  unaware  of  the  functions 
and  the  meaningfulness  of  the  relevant  cues  which  are 
exhibited  by  another  speaker,  mis judgments ,  misunderstandings, 
and  the  failure  to  communicate  occur.   Such  miscommunication 
has  been  illustrated  in  a  number  of  studies  on  conversation 
between  participants  of  different  linguistic  and  cultural 
backgrounds  (Heath  1983,  Gumperz  1982a,  Mishra  1982,  Tannen 
1982,  Young  1982).   This  observation  can  be  extended  to  the 
concepts  of  cline  of  proficiency  and  the  notions  of 
intelligibility,  interpretability ,  and  comprehensibility  in 
the  non-native  varieties  of  English  around  the  world  (see 
Kachru  1981,  Smith  ed.  1981,  Smith  and  Nelson  1985). 

This  study  operates  within  a  framework  which  assumes  that 
variability  in  linguistic  structure  is  a  systematic  feature  of 
language.   Gender  is  seen  as  one  of  several  social 
parameters.   It,  like  other  social  variables,  is  participant 
information  which  can  be  obtained  and  conveyed  through 
communication  (Cook-Gumperz  1981,  Dittmar  and  von  Stutterhelm 

1981,  Dodderidge  and  Simonet  1981,  French  1981,  Gumperz  1982a, 
Gumperz  and  Cook-Gumperz  1982,  Hansell  and  Seabrook  Ajirotutu 

1982,  Keim  1981,  Maltz  and  Borker  1982).  Gender  refers  to  the 
speech  marker  which  identifies  the  speaker/user  as  a  member  of 
a  particular  cultural  group,  marks  the  message  identifiable 
with  the  particular  group,  and  determines  an  individual's  role 
in  a  verbal  interaction.   Gender  is  communicated  through 
language  as  it  is  perceived  in  particular  contexts.   Hence 
within  this  approach  gender  is  seen  as  a  symbol  which  is 
communicatively  produced  and  has  meaning  in  conversation.  The 
question  arises,  then,  to  what  extent  are  the  social  knowledge 
and  linguistic  signs  and  functions  shared  by  female-male 
speakers  and  how  are  they  signalled  by  conversationalists. 
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Ob'iervatlons  and  Data 

This  paper  examines  typical  natural  instances  of  speech 
events  which  on  the  surface  have  all  the  features  of  the  nort 
of  turn-taking,  rules  of  question-answering,  conventions  of 
topic  initiation  and  success,  topic  flow  and  shift,  etc. 
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Analysis 

Given  the  above  observations  about  language  use  in  verbal 
interactions,  let  us  now  consider  a  few  examples  of  two-speaker 
conversations,  i.e.  female  and  male  Indian  Hindi  speakers,  and 
ask  the  following  questions:   1.   What  are  the  participants 
doing?   and  2.  How  are  they  accomplishing  it? 

Oral  Hindi  exchanges 

In  the  first  actual  verbal  exchange  below  (I),  two  native 
speakers  of  Hindi,  speaker  M,  the  husband,  and  speaker  F,  the 
wife,  are  conversing  with  each  other.   A  native  speaker  of 
Hindi  might  not  necessarily  find  the  conversation  out  of  the 
ordinary  nor  have  any  difficulties  in  getting  a  general  idea  of 
what  is  happening.   I  want  to  suggest,  however,  with  the  help 
of  the  interpretive  approach,  that  despite  the  conversation's 
simpleness  and  ordinariness  on  the  surface,  there  are  a  number 
of  peculiarities  of  which  only  speakers  M  and  F  are  aware.   And 
only  because  they  share  certain  background  knowledge  and 
communicative  skills  do  they  achieve  effective  communication 
with  each  other. 


M;   bartan  bhii  to  dhone  the. 

The  utensils  also  had  to  be  washed. 
F:   bartan  dho  diye  the  ra'af  ne  raat  ke  shyaam  ko. 

(I)  had  washed  the  night  utensils  in  the  evening, 
M:   ...abhii  ke  saare  bartan  bhii  to  hal  ... 

...There  are  all  the  Just  used  utensils  also... 
F;   are  abhii  ke  rahane  do,  phir  dhul  Jaayege. 

(pause) 
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Okay,  let  the  Just  now  ones  remain,  they  will  get 
washed  later.   (pause)   Hey,  don't  open  the  tap, 
its  making  noise. 

In  episode  (I)  above,  Hindi  speaker  M  the  husband  is 
preparing  tea  in  the  kitchen.   He  notices  that  there  are  dirty 
utensils/vessels  in  the  sink.   He  informs  his  wife,  Hindi 
speaker  F,  that  bartan  bhii  to  dhone  the  "the  utensils  (also) 
had  to  be  washed'  (1).  F's  response  to  M  is  bartan  dho  diye  the 
m'aT  ne  raat  ke  shyaam  ko  "I  had  washed  the  utensils  from  the 
(previous)  night  (just)  this  evening'  (2).  Rather  than 
explicitly  referring  to  those  utensils,  it  appears  that  speaker 
F  does  not  acknowledge  what  speaker  M  has  said  at  all,  but 
leaves  much  unsaid.  Yet,  if  we  look  closely  there  is  a  relation 
between  utterances  (1)  and  (2)  when  we  assume  that  speaker  M's 
intention  is  to  inform  his  wife  that  the  utensils  have  not  been 
taken  care  of  yet,  and  they  really  should  be  washed. 

From  another  perspective  it  may  be  that  the  female  speaker 
F  is  not  aware  of  the  dirty  vessels  in  the  sink.   If  we  look  at 
the  internal  evidence  it  is  more  likely  that  she  does  know 
about  the  dirty  utensils  to  which  M  is  referring.   In  utterance 
(2)  F  is  informing  M  of  some  vessels  she  had  already  washed. 
But  underlying  this  utterance  is  an  indirect  signalling  that 
she  does  not  feel  like  washing  more  utensils  at  this  moment. 
Speaker  M  somewhat  alters  the  conversation  as  is  evident  by  his 
uttering  abhii  ke  saare  bartan  bhii  to  h^^f  "There  are  all  the 
just  used  utensils  also'  (in  (3)).   He  does  not  treat  F's 
remark  (in  2)  as  irrelevant  but  clarifies  and  rephrases  his 
first  utterance  (1)  making  it  known  to  F  that  there  are  freshly 
used  vessels  which  need  to  be  washed.  And  in  fact,  the  above 
assumptions  are  correct  because  female  speaker  F  finally 
directly  acknowledges  male  speaker  M's  intention  and  replies 
are  abhii  ke  rahane  do,  phir  dhul  jaay'ege  "Okay,  let  them  stay, 
they'll  get  washed  later'  ( «l ) . 

There  are  a  number  of  points  which  need  to  be  discussed 
about  verbal  exchanges  in  general,  and  this  actual  Hindi 
exchange  in  particular.  First,  a  single  utterance  can  often  be 
understood  in  a  number  of  different  ways.   So,  one  may  ask,  how 
does  the  listener  recognize  the  speaker's  intentionality  and 
understand  the  appropriate,  expected  meaningfulness  of  each 
utterance?  Although  a  dialogue  is  made  up  of  single  utterances 
each  carrying  information,  a  listener  rarely  treats  each 
individual  word  or  sentence  separately.   For  if  a  language 
user  were  to  approach  conversation  in  this  manner  the 
utterances  would  seem  disjointed  and  not  coherent.   Therefore, 
the  intended  meaning  to  be  conveyed  can  only  be  relevant  and 
understandable  if  the  interaction  is  taken  as  a  single  whole 
and  each  sentence  relates  to  what  precedes  or  follows. 

Second,  such  an  exchange  as  (I)  will  have  more  meaning  or 
significance  to  those  participants  who  share  the  expectations 
about  how  the  intentions  are  to  be  communicated.  The 
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interlocutors  base  their  responses  on  what  linguistic  and 
sociocultural  knowledge  they  share  and  on  what  is  happening  at 
the  time  of  the  interaction.   With  this  information  they  can 
negotiate  minimally  and  accomplish  the  goal  of  effective 
communication  (i.e.   M's  informing  F  of  the  dirty  utensils). 

Speakers  M  and  F  are  relying  on  some  shared  understanding, 
i.e.  role,  status,  rights/duties  in  the  home,  daily  routines, 
expectations  of  home  activities,  etc.  That  the  utensils  are 
usually  washed  by  female  speaker  F  in  this  family  household 
makes  it  easy  for  her  to  recognize  that  her  husband's 
utterances  (in  1  and  3)  are  indirect  requests  for  her  to  clean 
the  recently  dirtied  utensils.  Speaker  F's  reply  (in  U )  implies 
that  they  will  be  done  later  (by  her),  but  to  leave  them  in  the 
kitchen  sink  for  now.   This  utterance  confirms  the 
interpretation.  To  reach  this  goal  the  interlocutors  are 
relying  on  indirect  inferences,  which  have  been  built  upon  past 
communicative  experiences  of  their  family  habits. 

Third,  from  this  actual  on-going  conversation  we  are  able 
to  recognize  that  the  two  participants  are  verbally  cooperating 
and  negotiating  with  each  other.  That  the  two  participants  have 
seen  beyond  the  surface  forms,  and  that  the  indirect  messages 
are  understood,  are  evidenced  in  the  successful  outcome,  i.e. 
female  speaker  F  appropriately  replies  are  abhii  ke  rahane  do, 
phir  dhul  Jaayege  'Okay,  let  those  utensils  remain,  they  will 
get  washed  later'  (1). 

Fourth,  in  addition  to  noting  the  Interpersonal  behavior 
between  speakers  and  the  simple  surface  features  of  mutuality 
of  conversation,  other  linguistic  skills  contribute  to  the 
successful  outcome.   As  has  been  pointed  out  by  a  number  of 
scholars  in  language  and  gender  research  (Bennett  1982,  Eaklns 
and  Eaklns  1976,  Edelsky  1981,  Flshman  1978,  1980,  1983. 
Leet-Pellegrini  1980,  Sattel  1983.  West  1979.  West  and 
Zimmerman  1983.  Zimmerman  and  West  1975.  to  name  a  few),  the 
dlscoursal  work  done  by  male  and  female  speakers  in  American 
English  cross-sex  conversations  is  accomplished  differently. 
Verbal  strategies  which  are  available  to  all  speakers  are 
motivated,  understood,  and  sometimes  interpreted  differently. 


Similarly,  in  previous  non-Western  studies  (Valentine  1985. 
1986),  It  is  revealed  that  when  female  and  male  Hindi  or  Indian 
English  speakers  verbally  interact  they  adopt  a  differential 
use  of  dlscoursal  strategies  and  skills.  Speech  patterns  such 
as  initiation  and  maintenance  of  discourse  topic  with  question 
and  statement  forms,  topic  flow  and  shift,  and  organizational 
linguistic  devices,  i.e.  acknowledgments,  formulaic 
expressions,  and  speaker  selection  patterns  are  varlantly  used 
by  male  and  female  Hindi  speakers. 

Returning  to  example  (I)  above,  it  is  observed  that  the 
use  of  indirect  statements  to  function  as  requests  or 
inquiries  Is  a  verbal  skill  held  by  the  male  Hindi  speaker 
(see  utterances  1  and  3).   Even  though  the  husband's  (M's) 
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statements  do  not  require  a  reply  from  his  wife  (F),  she 
Offers  one.  It  appears  that  speaker  F  has  interpreted  speaker 
M's  intention  oorreotly  for  M's  inquiry  is  answered  by  F  (4) 
when  she  tells  him  to  leave  the  utensils  and  she'll  attend  to 
them  later.  As  we  have  seen  above,  speaker  M  is  not  only 
informing  speaker  F  that  there  are  dirty  utensils  in  the  sink, 
but  also  implying  that  sinoe  she  holds  the  role  of  dishwasher 
In  the  house  they  be  washed  by  her. 

Similarly,  the  use  of  forms  of  acknowledgments  which  show 
explicit  agreement  or  negation  of  the  previous  speaker's 
utterance  is  a  common  strategy  adopted  by  Indian  female 
speakers.  In  example  (I)  female  Hindi  speaker  F's  use  of  the 
Hindi  acknowledgment  are  (1)  is  simply  showing  explicit 
agreement  with  male  speaker  M's  utterance,  letting  him  know 
that  she  has  heard  and  understood  the  intention  of  his 
indirect  requests.   This  type  of  acknowledgment  is  one  way  of 
signalling  a  shared  knowledge  of  the  situation  on  the  part  of 
F,  the  female  speaker.   This  must  indeed  be  the  case  because 
after  acknowledging  M's  remarks  with  are  (line  M)  and  telling 
him  the  dishes  will  be  washed  later,  she  initiates  her  next 
utterance  with  an  exclamation,  are,  and  switches  the  topic. 


In  addition 
organizational  d 
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to  these  features,  the  female  speaker's  use  of 
evices  such  as  boundary  markers,  fillers,  and 
are  observed  in  other  actual  Hindi  exchanges 
below  between  a  husband  and  a  wife  (speakers  M 
ely)  the  differential  use  of  such  devices  is 

this  particular  household,  the  normal  routine 
er  usually  attends  the  children's  school  for 
ter  he  has  returned  from  the  school,  the  wife, 
her  husband,  speaker  M,  what  was  happening 
e  refreshments,  who  was  there,  etc.  The  male 
ds  to  each  question  appropriately.   As  he 
ts  at  the  school,  female  speaker  F  develops 
conversation  by  encouraging  her  husband's  talk 
e  amount  of  regulating  markers  such  as  hra,  hlia 
acchaa  "good',  and  question  forms  at 
nated  places  in  her  husband's  speech.   The 
hat  the  female  speaker  F  is  setting  the  rhythm 
tion  is  correct  because  the  male  speaker 
's  questions  appropriately  on  time  and 
rate  the  happenings  at  the  school. 


M: 


inke  k 
("Did 
h^a,  m 
("Yes, 


Mrs.  K 
nahtr 
("Mrs. 
was  ta 


laas  me  gaye  the  bacco  ke? 

you  go  to  their  class,  the  children's?') 

sff  pahale  Amraag  ke  klaas  me  gayaa 

I  first  went  to  Amrag's  school.') 

hm 
to  wah'aai 
achorski  thii.   apne  par  koii  matlab  khaas  baat 
huuii  unse. 

Kachorski  was  there.   But  nothing  in  particular 
Iked  about.') 
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F:  hm 

M:  Amraag  Singh  dlkhaa  rahe  the 

apnaa  klaa^ruuro 

("Amraag  Singh  wa'*  showing  his  classroom.') 
F:  hm 

M:  ki  kahaa  kah^a  baiThte  hai 

("where  everyone  sits') 
F:  hm 

H:  paac  cenTer  hal  unke  to  hafte  ke  paac  din  matlab  they 

go  to  different  centers. 

("They  have  five  centers,  five  days  a  week,  this 

means ' ) 
F:  haaT  haa       ^^ 

M:  kahii  pe  Maths  karte 

hai  kahlf  pe  ek  compuuTar  paRaa  huaa  hai  kahii  pe 

kitaabe  paRii  huii  hat 

("Somewhere  they  do  Math,  a  computer  is  somewhere,  and 
books  are  lying.  '  ) 
F:  hm 
M:  aur  kyaa  thaa...? 

("And  what...') 
F:  ...aur  anne  ke  yah'aa  kyaa  hai? 

...and  did  you  go  to  Anne's? 
M:  Anne  ke  yahaff"  to  der^ho  gayii  thii  ab  to  jaate  kyJ^kii 

pahale  Amraag  ke  yah'aa'  gayaa  thaa 

("It  was  late  to  go  to  Anne's  because  first  I  had  gone 

to  Amraag  '  s  .  '  ) 
F:  hm 

M:  tils  minaT  to 

wah'aa'  lag  gaye. 

("Thirty  minutes  were  taken  up  there.') 
F :  acchaa 

("Okay.  ') 
M:  phir  waapas  Jaa  rahe  the... 

("When  I  was  going  back...') 
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When  an  Indian  male  and  female  speaker  interact,  then, 
either  the  two  participants  have  the  same  set  of  verbal  skills 
to  interact,  i.e.  hold  the  same  s«ts  of  discoursal  strategies 


suoh  a 

strate 

inform 

as  I  h 

that  e 

the  me 

use  of 

That  i 

a  s  s  u  m  p 

commmu 

amount 

sociol 

commun 

dlstin 

conver 

effort 

adapt 

flexib 

attltu 


s  those  11 
gles  are  u 
ation,  and 
ave  shown 
ach  speake 
anlngfulne 

the  other 
s  there  Is 
tlons.  Sin 
nlcatlng  w 

of  sharin 
ingulstlc 
Icatlve  pa 
ct  socloll 
satlon  It 

to  be  awa 
their  own 
lllty  (Gum 
de  In  conv 


sted  above 
sed  depend 

social  eo 
above.  Wh 
r  Is  somew 
s  s  ,  p  u  r  p  0  s 
's  skills/ 

a  sharing 
ce  males  a 
1th  each  o 
g  of  the  k 
rules.  In 
tterns  are 
nguistie  d 
is  essenti 
re  of  the 
strategies 
perz  1982a 
ersation, 


,  or  dif 
Ing  on  s 
ntext . 
at  is  mO 
hat,  eve 
e,  funct 
strategi 

Of  ling 
nd  femal 
ther  dal 
nowledge 

Other  w 

governe 
Iscoursa 
al  that 
Other  pa 

to  achi 
).  When 
miscommu 


ferent  soo 
hared  know 
The  former 
re  probabl 
n  if  unccn 
ion,  and  1 
es  for  ver 
uistic  and 
es  are  con 
ly,  there 

Of  each  0 
ords,  alth 
d  and  dire 
1  strategi 
each  parti 
rtlclpant  • 
eve  commun 

speakers 
nlcation  1 


lolingulstic 
ledge,  participant 

is  highly  unlikely 
e,  however,  is 
sciously,  aware  of 
ntention  of  the 
bal  communication. 

sociocultural 
s  t  a  n  1 1  y 
must  be  some 
t  h  e  r  »  s 

ough  cross-sex 
cted  by  the 
es,  for  a  fuller 
cipant  make  an 
s  skills  and  to 
icatlve 
adopt  this 
s  less  likely. 


Conversational  Exchanges  from  Literary  Texts. 

Episodes  extracted  from  written  Hindi  and  Indian  English 
fiction  writing  similarly  show  some  cases  where  the  male  and 
female  interlocutors  share  expectations  about  how  the 
intentions  are  to  be  communicated  and  where  communicative  goals 
are  met. 
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III. 
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Vishwa,  rummaging  through  his  drawer,  sayss,  "I 
forgot  to  pick  up  my  shirt  at  the  laundry.   Now  I  have 
no  clean  shirts.  I  wanted  to  take  a  shower." 

"aap  Jaakar  nahaalye.   maT  kamiiz  nikaale  detli  h'u vT.  • 
kiran  ne  uThte  hue  kahaa.   uThkar  apnii  almaarii 
kholii  aur  ek  ghulii  kamiiz  nikaalkar  vishwa  ko  pakraa 
dii.   wakt-bewakt  ke  liye  hameshaa  kuch  kapRe  apnii 
almaarii  me  rakhtii  hai. 

"GO  take  a  shower.   I'll  get  you  a  shirt",  Kiran 
said  getting  up.   Getting  up  she  opened  the  closet  and 
taking  out  a  clean  shirt  handed  it  to  Vishwa.   For 
such  times  she  always  kept  some  clothes  in  the  closet. 


In  example  ( 
J  Anand  (ig**?: 
11-defined.  Th 
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IV)  from  the  English  fiction  writing  of  Mulk 
12),  The  untouchable,  the  context  is  more 
le  brother,  Bakkha,  enters  the  house  after  a 

his  sister,  Sohini,  is  trying  to  light  a 
ha  disappointedly  finds  there  is  no  water  in 
(line  1)  nor  in  the  pitcher  (line  3)  he 
ne  7).  His  sister  Interprets  her  brother's 
e  3  and  his  exclamation  in  line  7  as  meaning 
to  get  the  water,  would  you  get  it'?   That  the 
as  seen  beyond  the  surface  meaning  of  oh  is 
the  shared  understanding  between  brother  and 
erpersonal  networks.   She  is  able  to  perceive 
>e  contextualization  cues  which  are  influenced 
mmunicative  habits,  i.e.  the  family  chores, 
ter,  etc.   The  brother's  verbal  and  nonverbal 
clamation  oh  I  (line  7)  lead  the  female 
to  the  actual  meaning  of  the  male  speaker's 
her  decision  to  meekly  offer  to  get  the  water 
t  this  interpretation  is  correct  is  verified 
rmal  uttering  of  the  affirmative  Hindi 
cchaa  *okay'  (line  11). 


fo 


He  put  the  earthen  pan  over  the  little  stove. 

*There  is  no  water  in  that,'  his  sister  said. 

*I  will  get  some  water  from  the  pitcher,'  he  said, 

as  he  casually  made  towards  the  corner. 

'There  is  no  water  in  the  pitcher  either,'  she 
swered. 

*0h!'  he  exclaimed  under  his  breath,  tired  and 
asperated.  And  for  a  moment  he  stood  defeated  where 
had  bent  down  to  the  pitcher. 

"I  shall  go  and  get  some  water,'  said  Sohini 
ekly. 

''Acha,'  agreed  Bakkha  without  any  show  of 
rmality . 


As  the  above  examples  intimate,  even  before  the  interaction 
takes  place  the  participants  infer  what  the  interaction  is 
about  and  what  is  expected  of  them.   All  extralinguistic  cues 
(such  as  interpersonal  networking  patterns,  family 
expectations,  gender  roles,  etc.)  and  contextualization 
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eonventlonn  (such  as  indirect  requests,  question-statement 
forms,  minimal  responses,  etc.)  are  considered  before  making  a 
response.   But  once  involved  in  the  verbal  exchange,  the 
responses  or  negotiating  procedure  relate  to  what  the  speaker 
intends,  (e.g.  getting  a  clean  shirt  or  washing  the  dishes)  and 
how  the  listener  interprets  the  utterances  and  responds  (by 
getting  the  shirt  or  washing  the  dishes.) 


VV;      ana      vvi;      ot:i.uw,     wneie     uue     i/U     a     xcii^k     ui      ^iiai   J-nf,     aiiu 

negotiating  between  participants,  the  utterances  are 
Interpreted  literally.  Because  there  is  a  lack  of 
conversational  cooperation  the  situations  become  extremely 
awkward,  and  misunderstanding  and  miscommunication  occur. 


In  example  (V)  from  the  Indian  English  writer  Romen  Basu's 
short  story  A  gift  of  lOve  (ig?**)  an  impatient  female  speaker 
purposely  understands  the  literal  meaning  of  the  question  "Do 
you  have  courses  in  marine  engineering?"  addressed  to  her  by  a 
male  speaker.   And  in  example  (VI)  from  Bhabendranath  Saikia's 
short  story  The  cavern,  (I98I)  a  female  speaker  understands 
the  routine  expression  "I'll  meet  you  some  other  time"  to  mean 
*we  will  meet  another  time,  let's  set  a  date'. 

V. 

"Now,  what  is  it  you  wish  to  register  for?"   she  asked 

impatiently. 

"Do  you  have  courses  in  marine  engineering?" 
"I  don't  have  anything.   I  only  work  here. " 

...I  had  more  questions  to  ask  but,  feeling  discouraged, 

kept  silent. 

VI. 

"It's  very  late  now,   I'll  meet  you  some  other  time." 
The  words  tumbled  out  of  Mr.  Prener's  lips. 

"Will  you,  really?"   Mary's  eyes  brightened.   "Thank  you 
very  much  sir.   Thank  you  very  much." 

In  these  examples,  although  the  participants  may  agree  on 
the  Overall  purpose  of  the  interaction,  there  are  differences 
in  the  expectations  and  rights,  as  well  as  in  the 
interpretation  of  the  discoursal  strategies  involved.  When  the 
interlocutors  notice  the  relevant  cues,  understand  the 
speaker's  choice  of  strategy,  and  are  aware  of  their  intentions 
the  interpretive  processes  are  taken  for  granted  and  the 
conversation  is  created,  developed,  and  sustained.  Such  an 
awareness  helps  to  establish  personal  contact  and  to  negotiate 


157 


the  proper  Interpretations,  However,  in  passages  (V)  and  (VI) 
above  one  participant  is  not  willing  to  adapt  or  to  anhieve 
communicative  flexibility.   One  speaker  reacts  differently  to 
the  cue  than  expected  and  the  outcome  is  misinterpretation  and 
misunderstanding. 

If  we  turn  our  attention  to  the  Hindi  exchange  (VII)  below, 
an  excerpt  from  Upendranath  Ashk's  (1976)  tauliye ,  we  may  get 
the  Impression  that  the  conversation  between  the  male  and 
female  speakers  is  successful,  otherwise  it  would  have 
collapsed  long  ago.   But  after  a  closer  look,  we  see  that  the 
conversation  is  not  developing  as  smoothly  as  it  could. 

In  this  episode  the  female  participant  Madhu  (M)  and  the 
male  participant  Vasant  (V)  are  discussing  the  subject  of 
appropriate  social  behavior.  However,  if  we  look  at  the 
dialogue  closely  we  find  that  the  male  Hindi  speaker  (V)  does 
not  consistently  and  appropriately  interpret  the  female  Hindi 
speaker's  (M)  verbal  strategies.   Although  the  conversation 
progresses,  there  is  a  non-sharing  of  soclollnguistic  knowledge 
between  the  Interlocutors,  and  the  interpretations  become 
misdirected,  discoursal  strategies  come  into  conflict, 
frustration  erupts,  and  miscommunlcation  seems  inevitable. 

VII.  ^  ^  . ^ 

1.  V:    ...-  maT  Jab  ha'staa  huuT,  Jll  khol  kar  haTstaa 

htju'  aur  Isllllye  uushil  aur  nimmO.  .  . 

When  I  laugh,  I  laugh  openly  and  that's  why  Ushl 

and  Nlmmo. . . 

2.  M;    aapne  phir  uushil  aur  nlmmo  kii  baat  cheRli. 

mujhe  h^snaa  buraa  nahTf  lagtaa.   par 
samay-kusamay  kaa  bhll  dhyaan  honaa  caahlye.  us 
din  paarTll  me  aate  hil  uushil  ne  mere  kaan  par 
cuTakll  le  111  aur  nlmmo  ne  merll  'aa'khe  band  kar 
111.   koli  samay  thaa  us  tarah  ke  h?sii.mazaak 
kaa.   muJhe  h'Ssli-mazaak  se  nafrat  nahTT, 
badtamllzll  se  nafrat  hai. 

Again  you  bring  up  Ushl  and  Nlmmo.   I  don't  hate 
laughing.  But  keep  in  mind  the  time  and  the 
place.   At  a  party  one  day ,  Ushl  covered  my  ears 
my  eyes.   It  wasn't  the  time 
I  don't  hate  laughing,  but  I  do 


and  Nlmmo  covered 

for  such  a  Joke. 

hate  bad  manners. 

3. 

V: 

uushil .  . . 
Ushl... 

t|. 

M: 

parle  sir  kii  bad 

badtamilz  hai.   madan  kl  warS-gaaTh 
ke  din  wo  sab  aaye  the.   nlmmo  itnli  cancal  hai 
par  wo  to  balTh  gayli  ek  taraf  ye  nawaabzaadll 
aa  balThii  mere  samne  salNDal  samet,  t'Sa'ge 
pasaare  aur  wo  uske  gande  salNDal  -  merll  saaRli 
ke  bilkul  nazdlik  aa  gayel  aap  is  badtamllzll  ko 
shauk  se  pasand  kare,  mai  ise  hargiz  bardaasht 
nahtr  kar  saktli.   Jise  balThne,  uThne,  bolne  kaa 
salllkaa  nahTT,  wo  aadmil  nahTf  -  pashu  hai 
On  the  other  hand,  she  has  bad  manners.   They  all 
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came  to  my  birthday  party.   Nimmo  i>\    usually 
restless  but  she  sat  on  one  side,   but  the 
princess  sat  right  In  front  of  me  with  her 
sandals,  sprawling  and  those  dirty  sandals  -  they 
came  so  close  to  my  sari.   You  may  like  this  bad 
behavior,  but  I  can't  approve  of  it.   One  who 
doesn't  know  how  to  sit,  stand,  and  talk  is  not 


human  -  he/she  is  an  animarT 


J,       V:  pashu I   to  tum  mujhe  pashu  samajhtil  ho? 
tum  aadmli  kii  sahaj  bhaawanaacT  ko  nirmam 
warjnaa'S  kii  beRlyo  rt'e  se  bliadh  kar  raknaa 
caahtli  ho... 

Animal !  Then  you  consider  me  an  animal?   You  want 
to  keep  a  person's  natural  behavior  tied  up  in 
chains  of  rules. . . 

To  support  the  argument  that  communication  is  breaking  down 
in  this  female-male  speaker  exchange  the  following  details  are 
given.   First,  in  turn  (1),  after  talking  about  'laughter',  a 
new  discourse  topic  is  abruptly  introduced  by  the  male  speaker, 
Vasant  (V),  that  of  their  cousins  Uushii  aur  Nimmo  'Ushi  and 
Nimrao'.   The  female  speaker,  Madhu  (M),  acknowledges  and 
affirms  this  shift  and  repeats  in  her  turn  (2)  aapne  phir 
uushii  aur  nimmo  kii  baat  cheRli  "again  you  bring  up  Ushi  and 
Nimmo'.  She  then  resumes  her  talk  on  another  subject.   She 
begins  to  narrate  an  event  which  she  overtly  prefaces  with  us 
din  paarTli  me  *at  a  party  one  day'.  Vasant,  however,  does  not 
notice  or  maybe  does  not  understand  her  linguistic  marker  of 
extended  talk  for  he  abruptly  intervenes  with  a  new  subject  in 
(3)»  uushii .  Although  it  may  appear  that  the  female  speaker 
Madhu  has  completed  her  turn,  she  has  not  given  up  the  floor 
but  tries  again  to  continue  her  talk;  this  time  proceeding 
with  the  male  speaker's  topic  of  uushii  .  Moreover,  it  is 
Obvious  that  Vasant  does  not  interpret  the  progressive 
development  of  Madhu's  topic  "manners  and  appropriate 
behavior'  (2  and  H)    because  he  ignores  the  progression  and 
only  focusses  on  a  part  of  her  turn.  It  is  apparent  that  he 
has  misinterpreted  Madhu's  strategy  because  he  only  focusses 
on  Madhu's  final  statement  (in  »>),  jlse  balThne,  uThne,  bolne 
kaa  saalikaa  nahTr,  wo  aadmli  nahit  -  pashu  hai   one  who 
doesn't  know  how  to  act  is  not  human  -  but  an  animal'.   He 
interprets  Madhu's  development  of  topic  as  a  personal  attack, 
hence  shifting  the  topic  in  (5)  to  pashu!  to  tum  mujhe  pashu 
samajhtil  ho?  "Animal!   So  you  think  I  am  an  animal?'   As  a 
result,  the  communication  appears  irreparable  and 
communication  seems  to  be  hindered. 

Conclusion 
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flociocultural  assumptions  and  contextual! zatlon  conventions 
may  differ  according  to  speaker  as  well.   When  the  rules  are 
not  shared  or  no  effort  is  made  toward  communicative 
flexibility  the  conversation  may  break  down  and  conversational 
cooperation  may  not  be  achieved. 

Understanding  the  strategies  and  patterns  in  cross-sex 
conversations  can  offer  insight  and  help  clarify  some  of  the 
many  problems  of  discourse  in  societies  today.  Such  research 
can  show  how  the  social  diversity  and  variation,  not  only  in 
terms  of  varied  gender,  but  ethnicity,  age,  social  class,  etc. 
affect  conversational  management  and  can  provide  insight  into 
the  processes  of  discourse  in  heterogeneous  speech 
communities,  i.e.   communities  of  varied  gender,  age,  and 
social  status  as  well  as  of  different  languages. 

NOTES 


Firth  (1957)  and  Malinowski  (1932)  similarly  shared  the 
view  of  including  the  entire  cultural  and  social  setting  in 
which  participants  interacted  and  looked  beyond  the  individual 
speaker.   Their  use  of  context  Of  situation  included  a 
description  of  the  regularities  of  social  life,  typical  roles 
and  situations.  That  is,  once  a  speaker  enters  a  conversation, 
the  interaction  is  in  a  sense  already  determined.   Speakers 
come  into  a  conversation  with  a  certain  set  of  social  roles, 
habits,  styles,  etc.  and  are  not  actually  free  to  do  or  say 
whatever  they  want.   The  interpretive  approach  enlarges  the 
scope  of  rules  governing  language  use  to  include  communicative 
acts,  events  and  appropriateness.   From  this  perspective  the 
speaker's  extensive  knowledge  of  language  use  underlies 
effective  communciatlon . 

2 

The  main  reason  the  term  gender  is  used  in  this  paper  as 

opposed  to  the  term  sex  is  that  the  former  has  meaning  in 

terms  of  culture  whereas  the  latter  is  basically  a  biological 

attribute. 

^The  Indian  couple  were  visiting  scholars  from  Tehrl 
Garhwal,  Uttar  Pradesh.   The  wife  was  a  Post-Doctorate  in  the 
Department  of  Botany  and  the  husband  was  a  Post-Doctorate  in 
the  Department  of  Chemistry.   They  were  in  their  late-30's,  had 
two  grammar  school-aged  children  (a  boy  and  a  girl),  and  lived 
for  two  years  in  graduate  student  housing  at  the  University  Of 
Illinois,  Urbana-Champaign.  Their  native  language  was  Hindi, 
which  they  spoke  in  the  home,  and  among  family  members  and 
friends.  In  all  other  verbal  interactions,  they  spoke  English. 

The  couple  was  asked  to  tape  record  their  conversations 
during  various  times  of  the  day  in  their  home.   The  chunks  of 
data  in  question  were  those  taped  after  the  family  dinner  when 
the  husband  and  wife  usually  sat  down  together  and  had  tea. 
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The  interpretation  of  meaning  was  checked  with  other 
speakers  of  similar  social  and  linguistic  backgrounds. 

The  English  text  of  Upendranath  Ashk's  tauliye  is  fror 
Towels ,  translated  by  Edward  S.  Haynes  and  Romesh  Shonek. 
Indian  Literature  (1981)  21:3.65-85. 
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THE  GOVERNMENT  AND  BINDING  ANALYSIS  OF  NEPALI  EOUi  AND 
SUBJECT-RAISING    CLAUSES* 

William  D.  Wallace 


The  principles  of  the  Govemment  and  Binding  (GB)  framework  treat  distinctly 
clauses  which  are  traditionally  labeled  EQUI  and  subject-raising  clauses.  In  this  paper,  I 
shall  examine  the  GB  analysis  of  such  clauses  and  discuss  the  analysis  with  data  from 
Nepali.    First,  I  shall  describe  Nepali  EQUI  and  subject-raising  constructions.  In  so 
doing,  we  shall  observe  three  distinctions  between  these  constructions  which  are 
idiosyncratic  to  the  grammar  of  Nepali.  I  shall  then  apply  the  GB  principles  to  Nepali 
EQUI  and  subject-raising  clauses. 

The  principal  focus  of  the  paper  will  be  to  determine  whether  the  GB  analysis 
con-ectly  predicts  the  distinctions  Nepali  makes  between  EQUI  and  subject-raising 
constructions.  We  shall  see  that  the  GB  analysis  does  generally  make  the  correct 
predictions  and  provides  further  insights  into  the  grammar  of  Nepali.  However,  the 
basic  analysis  of  these  constructions  fails  to  predict  accurately  one  feature  of  Nepali 
EQUI  constructions,  indk:ating  that  further  research  is  needed  on  the  theory  and  on 
Nepali  EQUI  and  subject-raising  constructk)ns. 


I.    Nepali  EQUI  and  subject-raising  constructions 

Among  the  complement  structures  of  Nepali  are  two  which  require  the  oblk^ue  infinitive  in 
-Xia-^  The  complement  structure  illustrated  in  (1)  and  (2)  represents  the  Nepali  EQUI  con- 
structk)n.2  The  class  of  EQUI  verbs  includes  ihalnu  1o  start',  saknu  1o  be  able",  cahanu  1o  want*, 
and  kbOJDU  10  try*.  All  the  verbs  in  this  class  appear  in  similar  syntactk:  stnx^tures,  and  t  shall  use 
only  one  or  two  for  purposes  of  illustration.^ 

(1)  (a)   ma       bhdkJekhi       aDDaHna    jana       thalchu      (0149) 

l-nom.   tomorrow-from   offica-in         go-inf.     start-prs.lS 
'I  shall  start  going  to  the  offfce  tonwrrow' 
(b)  ranvle     gau-ma    mal  sama      thalyo    (0149) 

Ram-erg    village-ln   thing-obj.  movo-inf.  star1-pst-3MS 
'Ram  has  started  to  move  his  things  to  the  village' 

(2)  (a)   yo   khola    ko  tama      sakcha  ra  (0140) 

this  river-ob]  who-nom  cross-inf  can-prs-3MS  Q 
"Who  can  possibly  cross  this  riverT^ 
(b)  mai-le  tyo  oTW        paDhna  sakina        (0 140) 
l-«rg     that    letter-obj  read-inf    can-pst-neg-1  S 
'I  coukJ  not  read  that  letter* 
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The  other  complement  structure  is  formed  with  the  oblique  infinitive  plus  lagnu.  This 
structure,  illustrated  in  (3),  I  shall  refer  to  as  the  Nepali  subject-raising  construction.^  IflQOU  is  the 
only  verb  of  this  subject-raising  class. 

(3)  (a)   buwa         kata     jana    lagnubhayo 

father-nom  where  go-inf    begin-hon-pst 
"Where  was  father  going?" 
(b)  ram  at>a  bhat  khana  lagyo 

Ram-nom  now    rice     eat-inf  ljegin-pst-3MS 
•Ram  was  beginning  to  eat  his  meal* 

Syntactically,  the  EQUI  and  subject-raising  structures  appear  very  similar.  Both  require  an 
infinitive  complement  with  subject-verb  agreement  indicated  on  the  matrix  verb;  cf.  (1-3).  Both 
stmctures  permit  intransitive  (1a,  2a,  3a)  and  transitive  (lb,  2b,  3b)  complement  clauses.  Both 
allow  complements  formed  from  passive  clauses  (4-5)  and  dative-subject  clauses  (6- 7). 5 

(4)  (a)   us-ki      svasni      Dakina    saktina 

him-gen  wife-nom  call-ps-inf  can-prs-neg-3FS 
'His  wife  cannot  be  summoned' 
(b)  patrapatrikaharu-ma  akalajhakala  nepali  adhunik  katha-ka 

magazine-pl-ln  by-flts-and-starls  Nepali    modern     story-gen 

maulika  svarupharu  dekhina     thaleka       hun   (SK11) 
original    form-pl-nom    see-ps-inf  start-pp-MP  be-prs-3MP 
The  original  forms  of  modern  Nepali  short  stories  have  started  to  appear 
sporadically  in  magazines' 

(5)  (a)   sviTar-ko       agaDi-pachaRi-ko  paTa  joDine      lageko  thiyo   (SK  210) 

sweater-gen   front-back-gen  side     join-ps-inf  begin-pp-MS  be-pst-3MS 

"The  front  and  back  sides  of  the  sweater  were  beginning  to  be  put  together* 
(b)  utaraheko     ghar-agaDi-ko     kauDa-ma  hariyo  cana         poll        utsah-sath 
be-vlsible-pp   house-front-gen    sun-in  green   chick-pea  burn-cp  enthusiam-with 

"horha"  tayar  garina      lageko         thiyo  (SK  53) 
horha       ready  do-ps-inf  begin-pp-MS  be-pst-3MS 

'In  front  of  the  visible  houses,  "horhas"  were  beginning  to  be  readied  with 
enthusiam  by  baking  the  green  chick  peas  in  the  sun" 

(6)  (a)   ma-lai  bhok      lagna  thalcha 

me-dat  hunger  feei-inf  start-prs-3MS 
'I  am  starting  to  get  hungry' 
(b)  ma-lai    rames-ko      ghrina   lagna  khojyo 

me-dat    Ramesh-gen  hate        feel-inf  try-pst-3MS 
'I  tried  to  hate  Ramesh' 

(7)  (a)   ma-lai  bhok      lagna  lagyo 

me-dat  hunger  feel-inf  begin-pst-3MS 
'I  was  t>eginning  to  get  hungry' 
(b)  keTi-lai  jaro    huna    lagyo 

girl-dat     fever  be-inf    begin-pst-3MS 
'The  girl  was  beginning  to  get  a  fever* 

Alttiough  the  Nepali  EQUi  and  subject-raising  constructions  have  these  similarities,  they  also 
have  several  syntactic  differences.  As  shown  above,  both  EQUI  verbs  and  lagnu  may  appear  with 


infinitive  complements,  iagnu  also  p>ermlts  finite  complements,  as  in  (8).  In  this  construction,  no 
suface  subject  of  lloou  appears. 

(8)  (a)   kohi  ae  jaslo  usAai    lagyo  (SK99) 

some-nom  come-pp  as        her-dat  ^eel-pst-3MS 
"It  seemed  to  her  as  though  someone  had  come* 
(b)  ma-lai  mai-le  kahf  bhul  bhane       jasto  lagena  (SK191) 

me-dat  l-erg      some  mistake-obj  say-pst-1S  as       feel-pst-neg-3MS 
'It  didnl  seem  to  me  as  though  I  had  said  something  wrong' 

Similarly,  Iagnu  may  appear  in  noncomplement  structures  with  (9a)  and  without  {9b)  a 
grammatical  subject.^  The  EQUI  verbs,  on  the  other  hand,  always  require  grammatical  subjects; 
cf.  (10).^  Thus,  Iagnu  does  not  require  a  grammatical  subject,  whille  EQUI  verbs  generally  do  in 
their  basic  forms.  And  this  distinction  carries  over  to  the  complement  constmctions  each  verb 
class  allows. 

(9)  (a)   hami     ghar    lagyau 

we-nom  house  set  out-pst-1  P 
"We  set  out  toward  home' 
(b)  ranvlai    bhok    lagcha 

Ram-dat  hunger  feel-prs-3MS 
'Ram  is  hungry* 

(10)  (a)   hami-le  kam     thalyau 

we-erg     jotKJbj  start-pst-3MP 
'We  started  the  job' 
(b)  *kohi  ae  jasto  thalyo 

some    come-pp   as        start-pst-3MS 

••There  started  as  though  someone  had  come'  (Cf.  (1-2).) 

Among  the  examples  cited  thus  far,  we  can  see  another  difference  between  EQUI  and 
subject-raising  constructions  in  Nepali.  When  used  in  noncomplement  stmctures,  EQUI  verbs 
require  ergative  matldng  on  their  surface  subjects  (11a),  while  Iagnu  requires  nominative  marking 
(lib).  This  suggests  that  the  basic  forms  of  EQUi  verbs  all  subcategorize  direct  objects  and  are 
treated  by  Nepali  speakers  as  transitive  verbs.  Iagnu.  however,  is  treated  as  an  intransitive  verb 
in  noncomplement  stmctures. 

(11)  (a)   ram-le     (*-0)       kam      thalyo 

Ram-erg.  (*-nom)    job-obj.    stan-pst-3MS 
'Ram  started  the  job' 
(b)  ram-0     (*-le)    ghar      lagyo 

Ram-nom  C-erg)  house    set  out-pst-3MS 
'Ram  set  out  toward  home' 

In  the  complement  stmctures  we  have  examined,  somewhat  different  nxjrphological  princip- 
les apply  when  the  matrix  verb  is  in  one  of  the  perfective  tenses:  Both  complement  constmc- 
tions require  nominative  marking  for  the  matrix  subject  when  the  complement  infinitive  is  intransi- 
tive (12)  or  passive  (13).  Note  that  in  general,  boVh  intransitive  (14a)  and  transitive  (14b)  senten- 
ces require  nominative-marked  subjects.  However,  when  the  complement  infinitive  is  transitive, 
then  the  morphology  of  EQUI  and  subject-raising  matrix  subjects  differs.  When  the  EQUI  comple- 
ment is  transitive,  then  the  matrix  subject  requires  ergative  marking  (15a).  But  when  the  subject- 
raising  complement  is  transitive,  then  the  matrix  subject  must  be  marked  as  nominative  (15b). 
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(12)  (a)   ram-0      (*-le)     ghar     jana    thalyo 

Ram-nom  (*-erg)     house   go-inf   start-pst-3MS 
•Ram  started  to  go  home' 
(b)  ram-0      (*-le)    ghar     jana    lagyo 

Ram-nom  (*-erg)    house  go-inf   begin-pst-3MS 
■Ram  was  beginning  to  go  home' 

(13)  (a)   keTl-0  (*-le)     Dakina     sakina 

glrl-nom  (*-erg)  call-ps-Inf  can-pst-neg-3FS 
"The  girl  could  not  be  sumnfwned' 
(b)  kam-0  (*-le)    garina      lagyo 

job-nom  (*-erg)  do-ps-inf  begin-pst-3MS 
'The  job  was  beginning  to  be  done' 

(14)  (a)   hami     ghar    gayau 

we-nom  house  go-pst-3MP 
'We  went  home" 
(b)  keTi       Dakii 

girl-nom  call-ps-pst-3FS 
The  girl  was  summoned 

(15)  (a)   ram-le    (*-0)      kam  gama  thalyo 

Ram-erg  (*-nom)  work   do-inf   start-pst-3MS 
'Ram  started  to  do  the  job' 
(b)  ram-0     (*-le)     kam  gama  lagyo 

Ram-nom  (*-erg)  work  do-inf   start-pst-3MS 
'Ram  was  beginning  to  do  the  job' 

Thus,  the  EQUI  construction  is  'transitive'  or  'Intransitive'  for  the  Nepali  subject-marking  mles 
depending  on  the  transitivity  of  the  complement  Infinitive.  In  the  subject-raising  constmction,  on 
the  other  hand,  lagnu  is  always  considered  'intransitive'  for  the  subject-marking  mles.^ 

A  third  distinction  is  found  between  EQUI  and  subject-raising  constructions:  Nepali  offers 
both  'intransitive'  and  Iransitive'  dative-subject  clauses,  as  illustrated  in  (16).®  The  former  (as  in 
(16a))  include  only  an  experiencer  NP  (ram)  and  a  sensation  NP  (bboh).  The  latter  (cf.  (16b))  offer 
an  experiencer  NP  (lam),  a  sensation  NP  (man),  and  a  patient  NP  (ti  mancheham^  which  controls 
verb  agreement. 

(16)  (a)   ranvlai     bhok-lagyo 

Ram-dat    hunger-feel-pst-3MS 
•Ram  got  hungry' 
(b)  ram-lai      ti         mancheham  man-pardainan 

■Ram-dat    those  man-pl-nom      mlnd-befall-prs-neg-3MP 
'Ram  doesn't  like  those  men' 

As  illustrated  above,  both  the  EQUI  (6a)  and  subject-raising  (7a)  construction  permit 
intransitive  dative-subject  clauses  as  complements.  EQUI  verbs  also  permit  transitive  dative- 
subject  clauses  to  appear  as  complements  (17).  Such  dative-subject  clauses,  however,  may  not 
appear  as  complements  in  the  subject-raising  construction  (18).  But,  both  EQUI  and  subject- 
raising  constmctions  allow  similar  dative-subject  clauses  when  the  'patient  NP'  is  in  a 
prepositional  phrase,  as  illustrated  in  (19).  Thus,  in  the  EQUI  constructions,  any  dative-subject 
clause  type  is  permissible.  But  the  subject-raising  construction  does  not  permit  complements 
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where  the  dative-subject  "patient*  would  control  verb  agreement  in  the  corresponding  finite 
clause. 

(17)  (a)    ram-lai      yi         kitapharu     man-pama      thaleka       chan 

Ram-dat    these  book-pl-nom  mind-bofall-inf  start-pp-MP  bo-prs-3MP 
■Ram  has  started  to  like  these  books' 
(b)   ama-lai  dherai      okhati  cahina  thalyo 

mom-dat  much-EM  medcine-nom  need-inf  start-pst-3MS 
'Mother  has  started  to  need  much  medicine' 

(18)  (a)   'ram-lai     yi        kitapharu  man-pama     I  age 

Ram-dat      these  book-pl        mind-befall-inf  begin-pst-3MP 
(b)  'ama-lai  dherai      okhati  cahina  lagyo 

mom-dat    much-EM  medicine-nom  need-inf  begin-pst-3MS 

(19)  (a)   ma-lai  rames-ko  ghrina-lagna  khojyo 

me-dat  Ramesh-gen         hate-feel-inf     try-pst-SMS 
'I  tried  to  hate  Ramesh' 
(b)  ma-lai  tyo  kitap-ko  yad-huna       lagyo 

me-dat  that  book-gen  memory-be-inf  begin-pst-3MS 
'I  was  Ijeginning  to  remember  that  book' 

In  an  analysis  of  EQUI  and  subject-raising  constructions,  we  want  to  account  for  the 
similarities  in  the  two  constructions:  Both  require  infinitival  complements  and  matrix  subjects  that 
control  verb  agreement.  And  we  want  to  account  for  distinctions  between  the  two  constmctions: 
(1)  subject-raising  vetbs  permit  finite  complements  where  no  matrix  subject  appears,  and 
nontinite  complements  where  a  matrix  subject  does  appear,  and  the  EQUi  constructions  only 
altow  the  latter  form;  (2)  where  the  complement  clause  is  transitive,  then  the  EQUI  constmction  is 
treated  as  transitive  for  subject-marldng  rules  while  the  subject-raising  construction  is  treated  as 
intransitive;  and  (3)  while  both  constmctions  allow  intransitive  dative-subject  clause  comple- 
ments, only  the  EQUI  construction  permits  the  transitive  dative-subject  clause  complement. 

II.  A  review  of  Government  &  Binding  principles^  ° 

Nepali  complement  structures  involve  various  GB  principles  of  grammar.  To  simplify  the 
discussion  of  EQUI  and  subject-raising  constructions,  I  shall  first  outline  the  basics  of  the  GB 
framework  on  which  the  foltowing  analysis  is  based.  Let  us  assume,  for  the  purposes  of  this 
section,  that  the  complement  structure  (20)  represents  both  EQUI  and  subject-raising  clauses. 

(20)  [NPnom]  -t-INFL  S  (S  [  NP*  -INFL  VP  [  (NPobj]  V-na]  ]  ]  V  (-t-AGR,  +TNS) 

The  categorial  rules  of  the  base  (21-22)  account  for  the  familiar  clause  stmctures  of  Nepali. 
Rule  (21)  accounts  for  the  SOV  clause  structure  of  Nepali.  Rule  (22)  accounts  for  the  expansion 
of  this  pattern  with  complement  clauses. 

(21)  (a)   S      -»  NP   INFL  VP.  where  INFL  includes  the  features  I±AgreementI,  I±Tense] 
(b)  VP    ■*  (NP)  V 


(22)  (a)    NP    -»  (S)   NP 
(b)   VP    -»(S)   V 
(C)    S      -»(COI^P)    S 
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All  lexical  items  are  specified  for  subcategorization  features  in  the  base.  Such  features  and 
the  clause  structures  of  (21-22)  are  retained  at  each  level  of  gramnnar  as  stated  in  the  Projection 
Principle  (Chomsky  1982:29): 

(23)  Prciecticn  Principle 

Representations  at  each  syntactic  level  (i.e.  LF,  and  D-  and  S-structure)  are 
projected  from  the  lexicon,  inthat  they  observe  the  subcategorization 
properties  of  lexical  items. 

Only  one  transformation  mle  is  necessary  to  account  for  the  nrevement  of  constituents. 
Move-g  leaves  NP-trace  in  the  position  from  which  the  affected  constituent  is  nvjved.  Move-a 
may  apply  to  move  elements  within  all  three  components  of  the  grammar  —  syntax,  phonological 
form  (PF),  and  logical  form  (LF). 

The  theory  of  government  defines  the  relationship  between  the  lexical  head  of  a  con- 
struction and  its  complements.  N,  V,  A,  P,  and  INFL  are  governors,  and  the  maximal  phrases  NP, 
VP,  AP,  PP,  and  S  are  absolute  barriers  to  government.  Thus  V  governs  the  object  NPs  it 
subcategorizes  within  VP;  V-na  governs  [NPobj]  in  (20).  And  +INFL  governs  the  subject  NP 
within  S;  +INFL  governs  [NPnom]  in  (20).  Note  that  -INFL  (-TNS,  -AGR)  is  not  a  governor,  and  so 
the  subject  position  of  an  infinitive  complement,  NP*  in  (20),  is  ungoverned. 

Case  theory  deals  with  the  assignment  of  abstract  Case  and  its  morphological  represen- 
tations. All  lexical  NPs  must  be  assigned  Case  at  S-structure,  or  the  Case  filter  will  be  violated 
(Chomsky  1982:49);  cf.  (24).  Case  is  assigned  to  an  NP  by  the  head  which  governs  it  within  its 
phrase.  Thus  the  Case  rules  in  (25)  hold;  cf.  (26).  Note  that  -INFL  is  not  a  Case-assigner;  there- 
fore, in  (20),  NP*  cannot  receive  Case  within  the  complement  clause. 

(24)  Case  Filter 

*NP,  if  NP  has  phonetic  content  and  no  Case. 

(25)  (a)   An  NP  govemed  by  +INFL  is  assigned  subject  Case. 

(b)  An  NP  governed  by  V  is  assigned  object  Case. 

(c)  An  NP  governed  by  P  is  assigned  oblique  Case. 

(d)  An  NP  govemed  by  V  is  assigned  dative  Case.''  ^ 

(26)  [NPsub]  +INFLVP[PP[  [NPoblJP]  [NPdat  y[  [NPobj]  V]] 
hami        gau-ma  ram-lai      dThi         paThauchau 
we-nom  viilage-in  Ram-dat   letter-obj  send-prs-3MP 

"we  will  send  a  letter  to  Ram  in  the  village* 

e-theory  concerns  the  assignment  in  Logical  Form  (LF)  of  semantic  roles  (8-roles),  such  as 
'agent',  'patient',  to  NP  arguments  that  have  referential  value.  All  positions  subcategorized  by  a 
lexical  head  are  assigned  S-roles.  Complements  of  V,  for  example,  are  assigned  e-roles  by  the 
verb.  INFL,  the  nonlexical  head  of  S,  does  not  assign  S-roIes.  Subject  NPs  may  or  may  not  be 
assigned  S-roles  by  VP.  Generally,  where  a  verb  assigns  Case  to  an  object  NP,  then  the  subject 
NP  receives  a  9-role  through  VP.  In  LF,  and  in  all  other  components  of  the  grammar  through  the 
Projection  Principle,  the  8-Criterion  must  be  met  (Chomsky  1982:36);  cf.  (27).  As  NPs  may 
receive  8-roles  through  trace,  the  residue  of  Move-a.  and  as  the  8-Criterion  must  be  met  at  D- 
and  S-structures,  NPs  cannot  be  moved  from  or  into  a  non-8-marked  position.  Otherwise,  the 
8-Criterion  will  be  violated. 
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(27)  9-CriteriQn 

Each  argument  bears  one  and  only  one  6-role,  and  each  6-role  is  assigned 
to  one  and  only  one  argument. 

Binding  theory  accounts  for  the  relationships  between  anaphors,  pronominals,  and 
referential  expressions  and  their  antecedents.  An  anaphor  must  be  bound  with  its  antecedent 
within  the  anaphors  governing  category,  that  is,  within  a  clause  containing  minimally  the  anaphor 
and  +AGR  or  a  subject  NP.  NP-trace  is  deemed  an  anaphor.  Pronominals  are  free  within  their 
goveming  categories.  And  referential  expressions,  such  as  names,  are  free. 

PRO,  a  nonlexical  pronoun,  is  a  pronominal  anaphor.  According  to  the  binding  theory,  it 
should  be  free  and  bound  within  its  goveming  category.  As  it  cannot  be  both.  PRO  is  not 
subject  to  binding  conditions.  The  theory  of  control,  however,  dictates  that  the  controller  of 
PRO  must  be  contained  within  the  goveming  category  which  contains  PRO. 

Note  that  PRO  and  NP-trace  are  in  complementary  distribution.  Trace  must  appear  in  a 
governed  position  which  is  assigned  a  8-role;  thus,  its  antecedent  must  be  in  a  position  that 
receives  Case  but  no  6-role.  PRO,  on  the  other  hand,  may  appear  in  ungovemed  positions.  Its 
controller  must  occur  in  a  position  that  receives  an  independent  S-role. 

As  we  can  see  from  these  principles,  the  concept  of  government  is  central  to  GB  theory. 
The  government  relationships  are  expressed  in  D-structure  by  the  subcategorization  rules  and 
the  Projection  Principle,  at  S-structure  by  Case  theory  and  the  Case  Filter,  and  at  LF  by  9-theory 
and  the  8-Criterion.  The  concept  of  government  is  also  crucial  to  the  analysis  of  EQUI  and 
subject-raising  constructions,  as  I  shall  discuss  in  the  next  section. 

III.  The  GB  analysis  of  Nepali  EQUi  and  subject-raising  constructions^^ 

Let  us  now  compare  the  D-Structures  ((a)-sentences)  and  S-Structures  ((b) -sentences) 
for  EQUI  (28)  and  subject-raising  (29)  constructions. 

(28)  (a)    NP  +INFL  S I S  [PRO  -INFL  VP  [  NP  V-na]  ]  ]  V  (EQUI) 

(b)  [NPsub]  +INFL  S  [  S  [  PRO  -INFL  VP  [  [NPobj]  V-na]  ]  ]  V  (EQUI) 

(29)  (a)   [NPe]  +INFLS[S(  NP*  -INFLVP  [NPV-na])]  laO" 
(b)  INP'sub]  +INFL  S[  t*  -INFLVP  [  [NPobj] V-na] )  Iflg- 

Movea  does  not  apply  in  the  EQUI  constmction  (28).  V-na  in  (28)  govems  and  assigns 
object  Case  to  the  NP  it  subcategorizes.  It  also  assigns  a  6-role  to  [NPobj]  at  LF. 

-INFL  in  (28)  is  neither  a  governor  nor  a  Case-assigner.  S  is  an  absolute  barrier  to  govern- 
ment of  constituents  in  the  complement  clause  by  constituents  in  the  matrix  clause.  Therefore, 
the  complement  clause  subject  position  is  ungovemed.  No  lexical  NP  can  hold  this  position; 
otherwise  the  Case  Filter  would  be  violated  at  S-stmcture.  Thus,  nonlexical  PRO  may  appear  in 
this  ungovemed  position.  PRO  is  assigned  a  e-role  by  VP  in  S.  PRO  is  controlled  by  the  matrix 
subject  NP  at  LF.  Thus,  in  EQUI  constructions,  the  subject  of  the  matrix  clause  is  the  antecedent 
for  nonlexical  PRO  in  the  complement  clause. 

The  matrix  subject  NP  in  (28)  is  govemed  by  +INFL.  +INFL  assigns  this  NP  subject  Case. 
And,  this  NP  receives  a  6-role  from  the  matrix  VP.  Thus,  the  6-Criterion  is  satisfied. 

Subject-raising  verbs  do  not  assign  6-roles  to  their  subjects.  Therefore,  no  lexical  NP  may 
appear  as  the  subject  of  the  matrix  clause  in  the  D-structure  in  (29a),  or  the  6-Criterion  would  be 
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violated.  Although  the  e-Criterion  must  be  satisfied  at  LF,  the  Projection  Principle  requires  it  to 
hold  at  D-structure  as  well. 

As  in  (28),  V-na  in  (29)  governs  and  assigns  object  Case  to  the  NP  complement  in  VP.  The 
verb  also  assigns  a  8-role  to  this  NP. 

-INFL  is  a  nongovemor  and  non-Case-assigner.  Therefore,  NP*  in  (29)  may  not  appear  in 
the  complement  subject  position  at  S-structure,  or  the  Case  Filter  would  be  violated.  If  NP*  were 
moved  into  the  matrix -clause  subject  position  by  Move  a ,  then  it  would  be  governed  by  +INFL 
and  receive  subject  Case  through  +INFL.  This  would  satisfy  the  Case  Fitter;  and  the  9-Criterion 
would  not  be  violated  because  subject-raising  verbs  assign  no  S-role  to  the  matrix  subject. 

Move  g.  however,  leaves  NP-trace,  and  NP-trace  must  be  governed.  As  NP-trace,  like  NP*  in 
(29),  cannot  be  governed  by  -INFL,  the  Government  &  Binding  theory  requires  that  subject- 
raising  verbs  induce  S-deletion.  S-deletion  applies  at  S-stmcture  (29b)  and  allows  the  matrix 
verb,  lag-,  to  govern  NP-trace  inthe  complement  subject  position.  Thus,  the  trace  of  NP*  in  (29b) 
is  governed  by  lajt;  it  also  receives  a  9-role  from  the  complement  VP  which  it  extends  to  NP*. 
The  Case  Filter  and  9-Criterion  are  thus  satisfied  at  S-stmcture. 

The  Government  &  Binding  framewor1<  thus  distinguishes  the  EQUI  arid  subject-raising 
constructions  on  several  grounds:  Primarily,  subject-raising  verbs  induce  S-deletion,  and  EQUI 
verbs  do  not.  The  subjects  of  EQUI  verbs  receive  8-roles  from  the  matrix  VP,  while  those  of 
subject-raising  verbs  do  not.  The  complement  clause  subject  position  is  filled  by  PRO  in  EQUI 
clauses,  but  by  NP-trace  in  subject-raising  clauses.  In  the  next  section,  I  shall  consider  whether 
these  theoretical  distinctions  account  for  and  explain  the  differences  discussed  in  Section  I 
between  Nepali  EQUI  and  subject-raising  constructions. 

IV.  Predictions  of  the  GB  analysis 

The  first  distinction  between  Nepali  EQUI  and  subject-raising  constructions  that  we  noted 
earlier  was  that  subject-raising  verbs  permit  finite  and  nonfinite  complement  clauses,  while  EQUI 
verbs  permit  only  nonfinite  complements.  The  explanation  for  this  lies  in  the  assignment  of 
8-roles  to  subject  NPs  and  the  application  of  the  8-Criterion. 

As  the  GB  analysis  of  subject-raising  clauses  indicates,  lagnu  requires  no  lexical  subject. 
The  subject  position  is  not  assigned  a  8-role  by  the  VP  in  which  lagnu  appears  as  verb.  Thus, 
this  position  need  not  be  filled  by  a  lexical  NP,  or  it  may  be  filled  by  an  NP  which  receives  a  8-role 
through  trace.  In  either  case,  the  8-Criterion  is  not  violated.  EQUI  verbs,  on  the  other  hand, 
assign  8-roles  to  subjects  through  VP.  Clauses  with  finite  EQUI  verbs  generally  require  lexical 
subjects,''^  and  this  NP  must  control  another  8-marked  subject  NP,  PRO  in  (28). 

This  theoretical  distinction  is  consistent  with  the  difference  between  Nepali  EQUI  and 
subject-raising  constructions,  lagnu  permits  both  nonfinite  (30a)  and  finite  complements  (30b). 
The  8-role  of  lam  in  (30a)  is  assigned  to  the  trace  of  ilm  by  the  complement  VP.  The  8-role  of 
kohl  in  (30b)  is  likewise  assigned  by  the  complement  VP.  In  neither  example  is  the  8-Criterion 
violated,  although  no  8-role  is  assigned  to  the  matrix  subject  position  by  the  matrix  VP. 

(30)  (a)   ram     kamgamalagyo 

Ram     workdo-inf  begin-pst-3MS 
'Ram  was  beginning  to  do  the  work' 
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(b)  kohi  ae  jastous-lai     lagyo 

somecome-pp  as       him-dat  feel-pst-3MS 

'It  seemed  to  him  as  though  someone  had  come' 

In  EQUI  constructions,  8-roles  are  assigned  to  matrix  subjects  by  the  matrix  VP  and  to  the 
complement  subject  by  the  confiplement  VP.  Thus,  in  (31a),  8-roles  are  assigned  to  hami.  the 
matrix  clause  subject,  and  to  PRO,  the  nonlexical  pronominal  anaphor  in  the  complement  clause 
subject  position  (cf.  (28)).  The  8-Criterion  is  not  violated  in  (31a);  however,  it  is  violated  in  (31b) 
where  the  complement  clause  is  finite  and  no  lexical  NP  appears  in  the  matrix  clause  subject 
position  (cf.  (30b)).  As  the  matrix  VP  In  (31b)  assigns  a  8-role  to  the  subject  position  and  no 
lexical  NP  appears  in  that  position,  the  8-Criterion  is  violated  and  the  sentence  is  ungrammatical. 
The  GB  analysis  of  EQUi  constructions  predicts  that  (31  b)  will  be  ungrammatical,  and  that  subject- 
raising  verbs  will  allow  both  finite  and  nonf  inite  complements.  Thus  the  theory  accounts  for  the 
first  distinction  between  Nepali  EQUI  and  subject-raising  constructions. 

(31)  (a)   hami  aba  kam  gama  thalciiau 

we       now  work  do-inf  start-prs-IP 
"We  Shall  start  to  do  the  vrork  now' 
(b)  *kohi  ae  jasto  thalyo 

some   come-pp  as         start-pst-3MS 

'There  started  as  though  someone  had  come' 

The  second  distinction  between  lagnu  and  the  Nepali  EQUI  verbs  Involved  the  contrast  in 
(32).  EQUI  constructions  (32a)  are  treated  like  transitive  clauses  for  the  purposes  of  morpho- 
kjgical  subject  Case  representations,  while  the  subject-raising  constructions  are  treated  as  in- 
transitive by  the  same  rule  despite  the  presence  of  a  transitive  complement  clause  (32b). 

(32)  (a)   ranvle     kam  gama  thalyo 

Ram-erg  work  do-inf   start-pst-3MS 
'Ram  started  to  do  the  work' 
(b)  ram-0      kam  gama  lagyo 

Ram-nom  work   do-inf  begin-pst-3MS 
'Ram  was  beginning  to  do  the  work' 

Ifi-marking  in  Nepali  is  sensitive  to  the  frame  [NPsub]  [NPobj]  [V]  (Wallace  1985:Chap.  II, 
§2.4.3).  Morphotogtoal  Case  representations  are  assigned  in  Phonototfcal  Form  (PF).  There- 
fore, at  PF,  EQUI  constructkjns  should  have  the  structure  [NPsub]  [NPobj]  [V],  while  sub- 
ject-raising constmctions  should  have  the  'intransitive'  structure  [NPsub]  [VP]. 

The  theoretical  distincttons  between  EQUI  and  subject-raising  in  the  GB  analysis  account 
also  for  this  difference.  At  S-structure,  the  input  to  PF,  (32a)  and  (32b)  woukj  appear  as  (33a) 
and  (33b),  respectively.  Phonotogteal  rules  cannot  'see'  the  features  of  nonlexical  PRO  in  (33a) 
(Chomsky  1982:21),  but  they  are  sensitive  to  trace  in  (33b). 

(33)  (a)  i£am+subl  +infl  s  [  s  [  PRO  -infl  VP  [  Ram+obD  [gama  ]  ]  ]  ]  [mil-] 
(b)  [£flm+sub]+iNFLS[[t) -INFL vp[ii2m-K)bi] [gama]  ]l[llg-] 

At  PF,  the  phionotogical  rule  of  S-pruning  (Hemon  1981)  deletes  nonbranching  nodes.  As 
this  process  ignores  PRO  but  sees  trace,  (33a)  and  (33b)  at  PF  appear  as  (34a)  and  (34b), 
repectlvely,  after  S-pruning  applies.  The  subject-raising  construction  (33b-34b)  is  unaffected 
by  S-paining.  However,  S-pruning  deletes  the  S-boundaries  in  the  EQUI  construction  (33a- 
34a).  The  complement  object  NP  becomes  a  constituent  of  the  matrix  clause  at  PF. 
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(34)  (a)  iLam+sub] +INFL  vpikam+objitoamal  ]]  Btial-] 

(b)  fram-t-subl  4-INFL  S  f  ft]  -INFL  VP  [  kam-Kibn  roama  1  ]]  [Hg-] 

The  product  of  S-pruning  for  EQUI  c»nstructions  is  a  "transitive"  clause  structure  to  wfiich 
the  jfi-marking  rule  is  sensitive.  As  a  result,  the  morphological  representation  of  subject  Case  in 
EQUI  matrix  clauses  is  Ifi.  In  subject-raising  clauses,  the  S-boundary  cannot  be  deleted  be- 
cause of  trace;  therefore  no  clause  union  process  occurs,  as  in  EQUI  clauses.  The  subject- 
raising  clause  matrix  subject  is  thus  marked  with  nominative  morphology. 

Again,  the  GB  principles  correctly  predict  the  distinction  found  in  Nepali  grammar.  And,  the 
analysis  raises  an  interesting  question:  If  EQUI  verbs  are  transitive  in  their  basic  form,  why  do  they 
not  always  require  ergative  marking?  The  answer,  in  the  GB  framework,  is  that  ergative  marking  is 
dependent  upon  syntactic  frames  and  not  verb  class.  As  transitive  verbs  generally  subcategor- 
ize  object  NPs,  they  will  usually  be  found  in  the  appropriate  syntactic  frame  for  ergative  marking  in 
the  perfective  tenses. 

However,  as  the  analysis  of  EQUI  clauses  demonstrates,  not  all  transitjye  structures  are  the 
result  of  subcategorization  features.  EQUI  verbs  subcategorize  an  NP  or  S,  the  sentence  com- 
plement. For  the  latter  structure,  S-pruning  is  necessary  for  an  object  NP  to  appear  as  a  con- 
stituent of  the  matrix  clause  containing  the  EQUI  verb  and  so  for  the  ergative-marking  rule  to 
apply. 

Finally,  let  us  consider  the  contrast  in  (35): 

(35)  (a)   rarrvtai     ti  kKapharu  marvpama      thaleka  chan 

Ram-dat    these  book-pl        mind-befall-inf  start  be-prs-3MP 

"Ram  has  started  to  like  these  books' 
(b)  *rarT>-lai    ti         kitapharu  man-pama      lage 

Ram-dat    these  book-pl        mind-befall-inf  begln-pst-3MP 
'Ram  was  beginning  to  like  these  books" 

As  (35)  indicates,  "transitive"  dative-subject  clauses  may  be  complements  of  EQUI  verbs 
(35a)  but  not  of  subject-raising  verbs  (35b)  in  Nepali. 

In  order  to  account  for  the  contrast  in  (35),  we  must  first  conskJer  the  GB  analysis  of  verb 
agreement  and  the  stmcture  of  Nepali  dative-subect  clauses.  Subject  NPs,  such  as  those  in 
(36),  are  govemed  and  assigned  Case  by  +  AGR,  an  element  of  -i-INFL.  Subject  NPs  govemed 
by  +INFL  are  co-indexed  with  +AGR.  Where  such  NPs  appear,  +AGR  is  moved  onto  the  verb  in 
PF  by  Rule-R,  producing  morphological  agreement  features  on  the  verb. 

(36)  (a)   hami    ghar   janchau 

we-nonti  house  go-prs-1  P 
"We  shall  go  home' 
(b)  uniharu-le  kamgare 

they-pl-erg  work  do-pst-3MP 
"They  did  the  work" 

As  illustrated  in  (37),  the  togical  subject  of  dative-subject  clauses  appears  in  the  dative 
Case.  While  these  experiencer  NPs  control  other  subject  properties  in  dative-subject  clauses, 
the  unmarked  sensation  NP  (Miok,  man.  and  Dar  in  (37))  controls  none  (Wallace  1985: 
Chap.lV).l4 
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(37)  (a)   ram-lai      bhok-lagya 

Ram-dat    hunger-feel-pst-3MS 

•Ram  got  hungry"  (lor  Ram  there  was  hunger*) 

(b)  ram-lai     ti         kitapharu  manisarchan 
Ram-dat   these  book-pl       mJr>d-be'all-prs-3MP 

'Ram  likes  these  txx>ks'  (Ihese  txx)ks  please  Ram') 

(c)  ram-lai     badar-dekhi  Dar  lagcha 
Ram-dat    monkey-from  fear  feel-prs-3MS 

•Ram  fears  monkeys'  ('monkeys  frighten  Ram') 

In  a  GB  analysis  of  dative-subject  clauses,  each  NP  of  the  dative-subject  clause  is  subcate- 
gorized  by  the  verb  (Hermon  1981 ;  Wallace  1985).  The  dative-subject  VP  then  assigns  no 
6-role  to  the  subject  position.  This  suggests  the  D-stmcture  (38a)  for  dative-subject  clauses. 
The  verb  in  (38a)  govems  the  object  NPs  and  assigns  S-roles  to  them.  However,  as  dative- 
subject  clause  verbs  are  intransitive,  they  do  not  assign  Case  to  these  NPs.  [NPexp]  in  (38a) 
receives  dative  Case  from  V.''^  if  a  patient  NP  appears,  as  [NP*]  in  (38a),  then  it  cannot  receive 
object  Case  from  the  verb.  The  fact  that  patient  NPs,  such  as  ti  kitapham  in  (37b),  control  verb 
agreement  suggests  that  NP*  in  (38a)  is  governed  and  assigned  Case  by  +AGR. 

(38)  (a)   [NPe]  +INFL  VP  [  [NPexp]  V  [  ([NP*])  [NPsen]  V  ]  ] 
(b)  [NPe]  VP  [  [NPdat]  V  [  [NP*]  [NPsen]   V+AGR  J  ] 

Chomsky  suggests  (1982:256ff)  that  languages  such  as  Nepali  which  do  not  require  lexical 
subject  NPs  altow  +AGR  to  be  moved  onto  the  verb  at  S-stnjcture  rather  than  at  PF  by  Rule-R, 
as  indicated  in  (38b).  This  allows  the  subject  position  to  be  ungovemed;  no  lexical  NP  appears 
in  this  position.  And  +AGR  govems  any  NP  govemed  by  the  verb  to  which  it  attaches  through 
Rule-R.  Thus,  +AGR  in  (38b)  woukJ  govern  NP*.  (38b)  would  be  the  appropriate  S-structure  for 
(37b).  Thus,  at  S-structure,  ti  kitapham  in  (37b)  (compare  with  NP*  in  (38b))  is  govemed  by 
+AGR.   +AGR  is  a  Case-assigner,  and  so  the  patient  NP  of  a  dative-subject  clause  may  receive 
subject  Case.  The  Case  Filter  and  9-Criterion  are  then  satisfied  at  S-structure. 

Let  us  now  apply  these  principles  to  the  analysis  of  EQUI  and  subject-raising  clauses  like 
those  in  (35).  In  both  (35a)  and  (35b)  the  complement  clause  is  a  Iransitive'  dative-subject 
clause.  If  this  complement  clause  were  a  finite  clause,  the  patient  NP,  tikitapharu.  would  control 
verb  agreement  as  in  (37b). 

The  D-structure  of  (35b)  is  represented  in  (39a).  Move  a  applies  vacuously  in  this  subject- 
raising  clause,  because  both  the  complement  and  matrix  subjects  are  empty  categories  not 
assigned  S-roles;  the  Case  Filter  will  not  be  violated  if  neither  of  the  [NPe]s  receives  no  Case,  as 
both  are  nonlexical. 

(39)  (a)   [NPe]  +INFL  S [  S  [ [NPe]  -INFL  VP [ [NPexp]  V [  NP*  [NPsen]-V-na ] ] ]  jlSa- 
(b)  [NPe]  +INFL  S  [  [NPe]  -INFL  VP  [  [NPdat]  V  [  NP*  [NPsen]-V-na  ]  ]  ]  ]  lag- 

The  S-structure  of  (35b)  then  would  be  (39b).  While  the  experiencer  NP  in  (39b)  may  be 
assigned  dative  Case  by  V,  the  patient  NP  is  not  govemed  by  a  Case  assigner.  Rule  R  must 
apply  vacuously  in  the  complement  clause,  because  -INFL  contains  -AGR,  which  is  not  a  Case- 
assigner.  Therefore,  no  element  of  the  complement  clause  may  assign  Case  to  the  patient  NP; 
and  no  constituent  of  the  matrix  clause  may  govern  and  assign  Case  to  the  patient  NP  because 
VP  is  an  absolute  barrier  to  government.  Thus  the  patient  NP  receives  no  Case  at  S-structure, 
and  the  Case  Fitter  is  violated. 
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In  contrast,  where  the  position  of  the  patient  NP  is  filled  by  an  NP  in  a  prepositional  phrase, 
as  in  (37c),  the  NP  would  be  assigned  oblique  Case  by  the  preposition.  When  such  a  clause 
appears  as  a  complement  to  lagnu  in  the  subject-raising  construction  (40),  then  the  sentence  is 
grammatical.  No  NP  in  (37c)  or  (40)  depends  on  +AGR  for  Case  assignment:  therefore  the  Case 
Filter  is  not  violated.  Thus  the  ungrammaticality  of  (35b)  and  the  contrast  between  (35b)  and  (40) 
follow  directly  from  GB  principles  of  Case  assignment  and  government. 

(40)  ma-lai  kitap-ko  yad-huna        lagyo 
me-dat  book-gen  memory-be-inf  begin-pst-3MS 
■|  was  beginning  to  remember  that  book" 

In  contrast  to  subject-raising  constructions,  the  Nepali  EQUI  construction  alk)ws  dative- 
subject  clauses  with  patient  NPs,  as  in  (37b),  to  appear  as  its  complement  (41a).  Dative-subject 
clauses  with  prepositional  phrase  'patients'  may  also  appear  as  complements  of  EQUI  verbs  (41b). 

(41)  (a)   rarrvlai     ti         kitapharu  man-pama      thaleka       chan 

Ram-dat   these  book-pl       mind-befall-inf  start-pp-MP  be-prs-3MP 
'Ram  has  started  to  like  these  books' 
(b)  ma-lai    rames-ko  ghrina-lagna  khojyo 

me-dat   Ramesh-gen         hate-feel-inf      try-pst-3MS 
'I  tried  to  hate  Ramesh' 

Although  (41a)  is  grammatical,  the  same  theoretical  problems  arise  in  its  analysis  which  we 
encountered  in  the  discussion  of  subject-raising  clauses.  The  D-structure  of  (41  a)  is  illustrated 
in  (42a).  At  S-structure  [NPexp]  may  be  assigned  dative  Case  by  V  in  (42b);  however.  NP*  is 
not  governed  by  a  Case-assigner.  Where  Rule-R  applies  in  finite  forms  of  such  clauses  (37b) 
and  allows  +AGR  to  govem  and  assign  Case  to  the  patient  NP,  in  (42b)  Rule-R  applies  vacuously 
because  -INFL  is  not  a  Case  assignor. 

(42)  (a)   [NPe]  +INFL  S  [  S  [  [NPe]  -INFL  VP  [  [NPexp]  V  [  NP*  [NPsen]-V-na  ]  ]  ]  ]  mil- 
(b)  [NPe]  +INFL  S  [  S  [  [NPe]  -INFL  VP  [  [NPdat]  V  [  NP*  [NPsen]-V-na  ]  ]  ]  ]  Itmi- 

As  discussed  earlier,  at  PF,  S-pruning  applies  to  EQUI  clauses,  deleting  non-branching 
nodes.  Also  at  PF,  Rule-R  woukJ  move  the  matrix  +AGR  onto  the  matrix  EQUi  verb.  This  combin- 
ation could  allow  the  matrix  +AGR  to  govem  and  assign  nominative  Case  to  NP*  at  PF.  However, 
this  would  only  account  for  the  grammaticaiity  of  (41a)  at  PF;  at  S-structure.  the  Case  Filter  would 
be  violated  as  with  subject-raising  clauses. 

Even  if  rules  like  S-pruning  and  Rule-R  applied  at  S-stmcture,  VP  in  the  complement 
clause  is  still  technically  a  bar  to  government  by  constituents  outside  the  VP.  To  have  the  EQUI 
matrix  verb  in  this  example  induce  S-deletion  and  VP-deletion  would  be  a  very  inelegant 
solution.  And  an  alternative  analysis,  that  the  patient  NP  is  the  subject  of  the  dative-subject 
clause  and  the  matrix  EQUI  clause,  as  illustrated  in  (43),  would  lose  much  of  the  generality  in  the 
analysis  of  Nepali  dative-subject  clauses  (Wallace  1985:  Chap.  XI.  §1 1 .1). 

(43)  ti  kitapharu  +INFL  S  [  S  [  PRO  -INFL  VP  [  ram-lai  V  [  man-pama  ]  ]  ]  ]  Um|- 

Thus,  the  GB  analysis  does  not  appear  to  account  for  the  grammaticaiity  of  (41a).  Rather 
than  extending  this  discussion.  I  simply  point  out  that  this  is  a  problem  for  the  basic  principles  we 
have  examined  in  this  paper,  and  that  a  more  detailed  analysis  of  such  examples  will  be 
necessary. 
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V.    Some  further  thoughts 

Although  the  GB  analysis  leaves  some  questions  concerning  the  Nepali  EQUI  construction, 
it  also  suggests  an  insight  into  the  assignment  of  8-roles  to  experiencer  NPs.  As  observed 
earlier,  the  VP  which  has  the  constituent  lignu  assigns  no  8-role  to  its  subject  in  the  examples  of 
(44).  Yet  apparently,  the  subject  NP  of  (45)  is  assigned  a  8-role. 

(44)  (a)   ma-lai    bhok-lagyo 

ma-dat    hunger-feel-pst-3MS 
'I  got  hungry' 

(b)  ram  kam  gama  lagyo 

Ram    work  do-lnf.  begin-pst-3MS 
'Ram  was  beginning  to  do  the  work' 

(c)  kohl    ae  jasto  us-lai     lag/o 
some   come-pp  as        him-dat  feel 

'It  seemed  to  him  as  though  someone  had  come' 

(45)  harm  ghar     lagyau 

we        house   set  out-pst-1  P 
'We  set  out  for  home" 

In  contrast,  VPs  which  have  as  a  constituent  a  member  of  the  class  of  EQUI  verbs  do  assign 
6-rotes  to  their  subjects  in  basic  clauses  (46a)  and  EQUI  clauses  (46b).  However,  in  clauses  such 
as  (47),  where  the  EQUi  complement  is  a  dative-subject  clause,  no  8-role  can  t>e  assigned  to  the 
matrix  subject  position,  or  the  8-Crtterion  wouW  be  violated.  Thus,  both  subject-raising  Hgou 
and  the  EQUi  veitss  do  not  appear  to  assign  8-roles  according  to  the  predicttons  of  the  GB 
analysis. 

(46)  (a)   ram-te     kam  thalyo 

Ram-erg  work   start-pst-3MS 
'Ram  started  the  work' 
(b)  ranvie     kam  gama  khojyo 
Ram-erg  work    do-lnf   try-pst-3MS 
'Ram  tried  to  do  the  work' 

(47)  ma-lai    rames-ko      ghrina-lagna  khojyo 
me-dat   Ramesh-gen    hate-feel-inf      try-pst-3MS 
'I  tried  to  hate  Ramesh' 

Two  alternative  solutions  for  lagnu  present  themselves.  First,  examples  (44-45)  suggest 
that  there  are  two  forms  of  lagnu  in  the  lexteon  of  Nepali.  Hflou-A  is  a  motion  verb  like  iinu  "to 
go',  which  assigns  a  8-role  to  Ks  subject.  lagnu-B.  on  the  other  hand,  occurs  in  the  examples  of 
(44)  and  assigns  no  S-role  to  its  subject,  as  predicted  by  the  GB  analyses  of  HODU  complement 
constructk)ns  and  dative-subject  clauses. 

A  second  possibility  is  that  (45)  is  a  subject-raising  constmction_with  D-  and  S-structures  as 
in  (48).  This  subject-raising  analysis  requires  a  non-phonetic  verb  JANU  which  does  not  appear 
in  PF.  However,  the  VP  including  JANU  woukJ  assign  a  8-role  to  liami  in  (48),  as  in  other  subject- 
raising  clauses. 

(48)  (a)  [NPe]  +INFL  S[S(tiani  -iNFL  VP  [  Qbar  J  ana  )  i  ]  iao- 

(b)   Han    +INFL  S(     [t]  -INFL  VP  [  gbaf  JANA  ]      ]  la^ 

•We  are  beginning  to  go  toward  home'  (d.  (45)) 
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Clearly,  more  data  on  such  sentences  would  be  necessary  before  evaluating  either  of  these 
alternatives  as  a  solution  to  the  distinction  between  the  examples  in  (44)  and  (45). 

It  is  unlikely  that  there  are  two  forms  of  each  Nepali  EQUI  verb  to  account  for  the  distinction  in 
(46)  and  (47).  Therefore,  an  analysis  of  these  examples  nrxjst  provide  a  unitary  account  of 
8-marking. 

One  solution  would  be  to  treat  Vand  VP  as  equivalent;  that  is,  if  an  NP.  such  as  an  experi- 
encer  NP,  receives  a  8-role  through  V,  then  the  VP  of  that  clause  assigns  no  e-role  to  the  sub- 
ject position.  Thus,  in  (46),  the  VP  assigns  a  S-role  to  the  subject  position;  but  in  (47),  no  e-role 
is  assigned  to  the  subject  position.  Rather,  in  (47),  the  V  assigns  a  6-role  to  the  experiencer 
ma-lai. 

We  could  extend  this  analysis  of  8-marking  beyond  the  data  in  (46)  and  (47)  to  account  for 
the  contrasts  in  (49-51).  In  each  of  these  pairs,  either  the  nominative  subject  would  receive  a 
8-role  through  VP,  or  the  experiencer  NP  would  receive  a  B-roie  through  V.  Experiencer  NPs 
like  those  in  the  (a)-sentences  of  (49-51)  behave  very  similarly  to  nominative  NPs  like  those  in  the 
corresponding  (b)-sentences  (Wallace  l985:Chap.lV^  An  assumption  that  in  each  example  only 
one  "subject-like"  8-role  would  be  assigned  by  VP  or  V,  either  to  the  nominative  NP  or  to  the 
experiencer  NP,  may  account  for  these  similarities,  especially  for  LF  properties. 

(49)  (a)   ma-lai  bhok     lagcha 

me-dat  hunger  feel-prs-SMS 
"I  am  getting  hungry' 
(b)  ma      bhokaunchu 
l-nom    hunger-pres-IS 
'I  am  getting  hungry* 

(50)  a)    ma-lai  jaro    cha 

me-dat  fever   be-prs-3MS 
"I  have  a  fever" 
(b)  ram         birarri  parcha 

Ram-nom  sick       befali-prs-SMS 
'Ram  is  getting  ill' 

(51)  (a)   keTHai  khusi        cha 

girld-dat  happiness  be-prs-3MS 
The  girl  is  happy" 
(b)  keTi        ramri   che 

girid-nom  pretty   be-prs-3FS 
The  girl  is  prett/ 

This  analysis  of  8-roles  also  suggests  a  distinction  between  the  dative-marked  NPs  in  (52a) 
and  (52b).  In  the  analysis  of  dative-subject  clauses,  we  have  assumed  that  experiencer  NPs 
(ram-lai  in  (52a))  were  assigned  dative  Case  by  V,  just  as  are  indirect  objects  (ram-lai  in  (52b)). 
Yet,  the  8-roles  of  these  NPs  are  distinct,  and  their  control  properties  at  LF  are  distinct  as  well. 
Experiencer  NPs  behave  like  nominative  NPs,  while  indirect  objects  behave  like  direct  objects 
(Wallace  1985:Chaps.  Ill  and  IV). 

(52)  (a)   ram-lai    bhok-lagyo 

Ram-dat    hunger-feel-pst-3MS 
•Ram  got  hungry' 
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(b)  mai-le  ram-)ai    oThi  paThae 
l-erg     Ram-dat  loner   send-pst-IS 
•|  sent  a  letter  to  Ram' 

This  analysis  suggests  that  these  two  classes  of  NPs  are  stmcturaily  more  dissimilar  than 
similar.  For  example,  indirect  objects  may  be  subcategorized  by  appropriate  verbs,  and  assigned 
inherent  Case  and  a  8-role  by  the  verb,  as  in  (53a)  (Chomsky  1982:1 70-72^.  On  the  other  hand, 
experiencer  NPs  may  appear  in  a  different  structure  where  the  VP  includes  V  as  in  (53b).  In  such 
a  stnjdure,  the  experiencer  NP  receives  dative  Case  from  its  governor  V,  and  it  receives  a  8-role 
from  V  as  well.  Unlike  in  (53a),  in  (53b)  VP  does  not  assign  a  8-role  to  the  subject  posttion._Thus, 
assignment  of  8-roles  to  subjects  by  VP  and  assignment  of  8-roles  to  experiencer  NPs  by  V 
woukj  be  in  complementary  distribution. 

(53)  (a)   [NPnom]  -kINFL  VP[  [NPdat]  [NPobj]  dJDU  (logive')] 

(b)  [NPe]        -HiNFL   VP[  [NPdat]  V  [bdOfcclaODU  (1o  get  hungr/)  ]  ] 

This  would  account  for  the  contrast  in  the  EQUI  verb  clauses  (46-47).  In  (46),  the  VP  of 
which  the  EQUI  verb  is  a  constituent  assigns  a  S-role  to  the  subject  position.  In  (47),  the  experi- 
encer NP  receives  a  8-role,  and  so  the  subject  position  does  not.  Recall  that  S-pruning  makes 
the  complement  verb  phrase  a  constituent  of  the  matrix  clause,  and  so  the  matrix  clause  would 
be  bound  by  that  S-role  assignment. 


NOTES 

*The  following  abbreviations  are  used  in  the  data: 


M 

masculine 

F 

feminine 

nom 

nominative 

erg 

ergative 

obj 

objective 

dat 

dative 

obi 

oblique 

gen 

genitive 

S 

singular 

P 

plural 

C 

Clark,  T.  W. 

SK 

Aryala.  Dha 

pre 

present 

pst 

past 

ps 

passive 

inf 

infinitive 

PP 

perfective  participle 

ger 

gerundive 

hon 

honorific 

Q 

question  particle 

neg 

negative  marker 

Pl 

plural  marker 

Clark,  T.W.  1963.  Introduction  to  Nepali.  Cambridge:  Heffer&Sons. 
Aryala,  Dhairav  (ed.)  1968.  Sajha  katha.  Kathmandu:  Sajha  Prakashan 

^The  dictionary  form  of  the  Nepali  infinitive  has  the  suffix  -nu.  The  Infinitive  in  -qu  may  be 
used  as  an  NP;  -nl  is  the  oblique  form  of  this  nominal  use.  Further  information  on  Nepali 
complement  stmctures  may  be  found  in  Wallace  1985:Chap.ll  and  in  Wallace  1984. 

^Nepali  EQUI  constructions  are  discussed  in  detail  in  Wallace  1985.  The  discussion  of 
Nepali  EQUI  and  subject-raising  constmctions  in  section  I  of  this  paper  is  based  on  the  inform- 
ation in  Wallace  1985. 

-^It  may  be  that  verbs  of  the  EOUl  class  have  various  syntactic  distinctions  an[x>ng  them- 
selves. I  shall  not  explore  such  distinctions  in  this  paper. 
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"^lagnu  has  various  uses  in  Nepali,  several  of  which  are  illustrated  in  this  paper.  In  addition  to 
being  used  in  the  subject-raising  constmction,  lagnu  appears  as  the  verb  in  dative-subject 
clauses  and  in  clauses  with  finite  complements. 

^Neither  construction  allows  an  obligational  clause  as  a  complement;  cf.  (a)  and  (b).  (See 
Wallace  1985:Chap.V.) 

(a)  *uniharu-lai  din-din  kam  gamu  pama      thalyo 
they-pl.dat      day-day  work   do-ger  befall-inf  start-pst-3MS 
They  started  to  have  to  work  every  day' 

(b)  *keTa-lai  skul      janu     pama     lagyo 
boy-dat        school  go-ger   befall-inf  begin-pst-3MS 
The  boy  was  beginning  to  have  to  go  to  school' 

^Throughout  this  paper  I  shall  assume  that  the  sensation  NP  (bhok  in  (9b))  is  not  the 
grammatical  subject  of  dative-subject  clauses  such  as  (9b).  The  sensation  NP  plus  verb  phrase 
appears  to  be  an  idiom  in  Nepali.  See  the  analysis  of  dative-subject  clauses  in  Wallace  1985: 
Chap.  IV. 

^This  is  clearly  not  always  true,  cf .  (6).  Further  discussion  of  this  can  be  found  in  part  V  of 
this  paper. 

Swailace  1984  discusses  this  distinction  in  a  Relational-Grammar  analysis. 

^An  'intransitive'  dative-subject  clause  is  one  which  includes  as  constituents  only  an  experi- 
encer  NP  and  a  sensation  NP.  A  'transitive'  dative-subject  clause  includes  another  NP,  which 
may  appear  as  a  nominative  NP  or  within  a  prepositional  phrase.  Nepali  makes  several  important 
distinctions  between  these  two  types  of  clauses.  See  Wallace  1985:Chap.  IV. 

■"'The  discussion  in  part  II  is  based  on  Chomsky  1982.  See  also  Wallace  1985:Chap.X. 

■•  ■'Chomsky  (1982:170-72)  discusses  an  alternative  rule  for  dative  Case  marking:  An  NP  is 
inherently  Case-marked  as  determined  by  properties  of  its  [-N)  governor.  The  usefulness  of  this 
mie  is  discussed  in  part  V  of  this  paper. 

■•^The  analysis  in  this  section  is  leased  on  Wallace  1985:Chap.  IX.  For  further  information 
see  Wallace  1984. 

■'3see  part  V  for  discussion  of  examples  where  this  statement  is  not  true.  Some  EQUI  vertjs 
do  not  assign  B-roles  to  the  subject  position. 

■•^Sensation  NPs  plus  dative-subject  verbs  appear  to  form  an  idiom  in  Nepali  and  are  there- 
fore treated  as  units.  Chomsky  (1982:37)  refers  to  NPs  in  such  idioms  as  'quasi-arguments'  and 
assumes  they  have  special  rules  for  S-marking.  They  are  also  probably  exempt  from  the  Case 
Filter,  or  receive  some  special  Case.  The  rrrotivation  for  this  analysis  is  discussed  in  Wallace 
1985. 

^^See  part  V  for  discussion  of  6-role  and  Case  assignment  to  experiencer  NPs. 
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CONSONANT  MERGER  IN  NAVAJO: 

AN  UNDERSPECDFIED  ANALYSIS 

J.  Eraser  Bennett* 


The  Navajo  d-effect  is  a  phenomenon  of  consonant  merger  in 
which  d  is  fused  with  the  initial  consonant  of  an  adjacent  verb  stem.  The 
peculiar  forms  of  these  fused  consonants  may  be  expained  within  the 
framework  of  underspecification  theory:  when  consonants  are  viewed  as 
being  maximally  underspecified  in  underlying  representation,  the  facts  of 
the  d-effect  and  related  consonant  mergers  follow  naturally  from  the 
underlying  representations,  a  small  number  of  constraints  on 
representations,  and  facts  about  Navajo  syllable  structure.  While 
accounting  for  the  d-effect,  this  analysis  has  implications  for  larger 
questions  of  Navajo  verbal  morphology,  syllabification,  and  constraints  on 
representations. 


1.  Introduction:  the  d-effect. 

Vowel  merger-for  example,  the  creation  of  [e]  from  the  fusion  of  [i]  and  [a]~is  a 
common  process  among  languages  which  is  currendy  the  subject  of  a  significant  amount 
of  research  within  the  framework  of  autosegmental  phonology  (Goldsmith  1985,  Kaye  et 
al.  1985,  Sohn  1987).  This  paper  presents  evidence  that  Navajo  exhibits  a  similar 
process  of  merger,  known  as  the  d-effect  in  previous  descriptions  of  the  language.  This 
merger  process  affects  not  vowels,  though,  but  consonants,  fusing  the  underlying  verbal 
classifier  prefix  d  with  the  immediately  following  consonant. 

In  this  paper  I  analyze  the  d-effect  from  the  standpoint  of  underspecification 
theory,  and  argue  that  the  results  of  the  d-effect  mergers  follow  naturally  from  the 
underspecified  nature  of  Navajo  consonants  in  underlying  representation  (UR).  And 
while  the  d-effect  itself  has  a  limited  distribution  (involving  only  one  prefix  in  the 
intricate  Navajo  verbal  morphology),  the  analysis  developed  for  the  d-effect  merger 
insightfully  accounts  for  similar  effects  triggered  by  the  1st  dual  subject  prefix,  as  well  as 
for  a  third  class  of  consonant  mergers  termed  "devoicing  effects"  in  previous  work. 

The  results  of  this  merger  appear  to  be  extremely  unpredictable.  That  is,  in  the 
examples  of  the  d-effect  shown  in  (1),  the  merging  of  d  (phonetically  an  unaspirated 
voiceless  alveolar  stop)  with  a  glottal  stop  in  (la)  results  in  a  glottalized  alveolar  stop,  in 
some  sense  a  combination  of  the  two  stops.  Yet  when  d  combines  with  the  fricarives  gh 
and  z  in  (Ib-c),  the  result  is  in  one  case  a  stop,  and  in  the  other  case  an  affricate.  In  (Id) 
the  combination  of  d  with  a  stop  produces  no  visible  effect. 


(1)  Examples  of  the  d-effect^ 

a.  d  +  ?  -->  t' 

b.  d  +  gh  -->  g 

c.  d  +  z  -->  dz 

d.  d  +  b  -->  b 

Sohn's  (1987)  analysis  of  vowel  merger  in  Korean  notes  that,  in  the  case  of  vowel 
merger,  it  is  impossible  for  any  system  which  uses  fully  specified  feature  matrices  to 
extract  the  features  necessary  to  predict  the  result  of  the  merger,  say,  of  [i]  and  [a]  as  [e], 
without  stipulation.  The  same  is  true  of  consonant  merger  in  Navajo,  as  can  be  seen  from 
(1):  if  fully  specified  feature  matrices  arc  to  be  merged,  it  must  be  stipulated  which 
features  are  to  be  retained  in  the  fused  form.  Furthermore,  such  stipulation  must  be  made 
for  each  instance  of  merger.  So  in  (Ib-c),  the  outcome  of  the  merger  is  in  one  case  a 
stop,  which  "keeps"  [-continuant]  from  d  and  "discards"  [-continuant]  from  gh;  in  the 
other  case  an  affricate  results,  in  which  both  [-continuant]  and  [+continuant]  are  retained. 
Moreover,  in  (la),  the  features  for  place  of  articulation  of  the  first  element,  d,  are 
retained;  in  (lb),  though,  it  is  the  place  of  articulation  of  the  second  element  g  which 
appears  in  the  fused  form. 

I  would  claim,  though,  that  Navajo  consonants  are  not  underlyingly  fully 
specified  feature  matrices.  Rather,  they  are  highly  underspecified  in  UR,  and  the 
consonant  which  results  from  the  fusion  of  two  consonants  is  simply  the  result  of 
combining  their  underlying  forms;  that  is,  the  underlying  form  of  a  fused  consonant  is  the 
combination  of  the  underlying  forms  of  the  two  consonants  which  were  merged.  To 
show  how  this  is  so,  it  will  first  be  necessary  to  sketch  the  main  points  of 
underspecification  theory,  as  outlined  in  Archangeli  and  Pulleyblank  (1986);  I  will  then 
describe  the  relevant  portions  of  Navajo  phonology  and  Navajo  verbal  morphology. 
After  a  fuller  exposition  of  the  relevant  data  and  a  consideration  of  earlier  accounts,  I  will 
present  a  system  of  underspecified  feature  matrices  for  Navajo  consonants,  and  show 
how  the  majority  of  the  d-effect  mergers  may  be  accounted  for  in  this  scheme.  At  the 
same  time  I  will  show  how  this  account  of  the  d-effect  also  describes  the  mergers 
associated  with  the  1st  dual  subject  prefix.  Lastly,  I  will  consider  how  this 
underspecified  system,  together  with  facts  about  Navajo  syllable  structure  and 
restrictions  on  possible  representations,  allows  an  account  of  four  sets  of  mergers  which 
might  appear  problematic  for  this  theory  of  consonant  mergers,  including  the  interactions 
of  the  1st  dual  prefix  with  the  verbal  classifier  prefixes,  and  the  so-called  "devoicing 
effects." 

2.  Underspeciflcation  theory. 

Underspecification  theory  aims  to  drastically  reduce  the  amount  of  information 
carried  by  segments  in  UR  by  eliminating  redundant  feature  specifications.  In  this  way  it 
attempts  to  give  an  insightful,  principled  account  of  a  variety  of  phonological  processes, 
for  the  impoverished  underlying  representations  ought  to  severely  constrain  the  range  of 
possible  rules.  As  noted  above,  I  will  be  using  the  version  of  under- specification  theory 
developed  in  Archangeli  and  Pulleyblank  (1986). 

I  also  adopt  the  Nondistinct  Marking  Convention  of  Sohn  (1987),  which  states 
that  "binary  use  of  a  feature  is  prohibited  in  the  underlying  representation."  That  is,  only 
one  value  for  any  feature  is  used  in  Underlying  Representation:  [+  F]  or  [-  F]  may 
appear,  never  both.  This  increases  the  poverty  of  the  representations  and  renders 
segments  only  "potentially  distinct,"  since  there  are  no  conflicting  feature  values  in  UR. 


Segments  become  actually  distinct  only  in  the  course  of  the  derivation,  when 
redundancy  rules  fill  in  the  values  of  features  for  which  the  underlying  representations 
are  unspecified.  Archangel!  and  Pulleyblank  (1986)  propose  that  most  redundancy  rules 
are  not  language  specific,  but  given  by  Universal  Grammar.  However,  a  language  may 
select  the  opposite  value  as  the  underlying  value  for  a  feature,  in  which  case  the 
appropriate  redundancy  rule  is  automatically  formed  by  a  principle  of  Complement  Rule 
Formation. 

Thus,  if  a  particular  feature  [F]  has  a  value  [+  F]  in  underlying  representation, 
there  is  no  contrasting  value  [-  F]  in  UR.  Furthermore,  there  will  be  a  context-free 
redundancy  rule-either  a  default  rule  given  by  Universal  Grammar,  or  a  Complement 
Rule  of  the  form  /  ]-->[-  F7--which  will  provide  the  default  value  at  some  later  point  in 
the  derivation.  Due  to  the  interaction  of  the  Default  Ordering  Principle  and  the 
Redundancy  Rule  Ordering  Constraint  proposed  in  Archangeli  and  Pulleyblank  (1986), 
this  redundancy  rule  will  activate  [-  F]  at  the  stratum  at  which  phonological  rules  first 
make  reference  to  the  value  [-  F].  These,  then,  are  some  of  the  basic  insights  of 
underspecification  theory:  radical  underspecification  of  underlying  forms,  the  potential 
distinctiveness  of  segments,  redundancy  rules.  Other  relevant  portions  of  the  theory  will 
be  introduced  as  needed. 

3.  Navajo  Phonology 
3.1.  Navajo  Consonants. 

In  this  section  I  will  briefly  describe  the  inventory  of  consonants  in  Navajo,  the 
allowable  syllable  types,  and  the  basic  structure  of  the  Navajo  verb,  the  context  in  which 
the  d-effect  appears.  In  most  respects  I  will  follow  Wright  (1984). 

The  Navajo  consonant  inventory  is  given  in  (2): 

(2)  Navajo  Consonant  Inventory  (from  Wright  1984) 


Voiceless 

Voiceless 

Voiceless 

Unaspirated 

Aspirated 

Glottalized 

VL 

VD 

Stop/Affric. 

Stop/Affric. 

Stop/Affiic. 

Cont 

Cont 

Labial 

b 

Alveolar 

d 

t 

t* 

dz 

ts 

ts* 

s 

z 

Lateral 

dl 

tit 

tf' 

+ 

Palatal 

j 

ch 

ch' 

sh 

zh 

Velar 

g 

k 

k' 

X2 

gh 

Glottal 

7 

h 

Nasals 

Glides:  y,  w,  'y,  'w 

0. 

Glott. 

Labial 

m 

'm 

(<'>  indicates  glottal- 

Alveolar 

n 

'n 

ization  throughout.) 

It  is  imperative  that  the  series  of  unaspirated  non-continuants-orthographically  b,  d,  g, 
dz,  dl,  y-not  be  mistaken  for  voiced  stops  and  affricates.  They  are  phonetically  voiceless. 
Since  the  vowels  are  not  relevant  to  the  process  of  consonant  merger,  I  will  not  treat 
them.  (But  note  that  high  tone,  which  is  contrastive  in  Navajo,  is  marked  as  an  acute 
accent,  e.g.  d,  and  that  long  vowels  are  marked  as  a  sequence  of  two  vowels.  Also, 
nasalization  of  vowels,  also  distinctive,  is  marked  by  a  cedilla,  e.g.  ^.) 


3.2.  Navajo  Syllables. 

(3)  shows  the  three  basic  syllable  types  found  in  Navajo: 

(3)  Basic  Navajo  Syllable  Types 

cvc 
cw 
cwc 

This  admittedly  odd  inventory  of  syllable  types  does  not  describe  the  situation  entirely. 
CV  syllables  are  found  in  certain  highly  restricted  environments,  as  are  V  and  W 
syllables,  and  certain  syllabic  consonants.  These  exceptional  types  are  restricted  to 
word-final  position,  though,  and  since  the  d-effect  occurs  word-internally  I  will  not  deal 
with  them.  (There  will  be  one  occasion  where  a  V  syllable  will  bear  on  the  d-effect.) 

It  is  important  to  note  that  while  every  consonant  in  (2)  may  be  syllable-initial, 
only  the  alveolar  continuants  and  [h]  may  generally  end  a  syllable. 

(4)  Possible  Codas 

s,  z,-!-,  1,  sh,  zh,  h 

d,  n,  g,  and  .'  may  also  be  codas,  but  only  in  certain  word-final  contexts.  Again,  since  the 
d-effect  occurs  word-intemally,  these  exceptional  codas  are  not  relevant  here. 

33.  The  Navajo  Verb. 

The  Navajo  verb  has  developed  something  of  a  reputation  for  complexity,  and 

deservedly  so.  Each  verb  consists  of  a  stem:  a  verb  classifier  prefix,  which  may  modify 
the  lexical  meaning  of  the  stem  or  change  the  voice  of  the  verb;  and  a  variety  of  prefixes 
which  express  subject  marking,  aspect,  mode,  and  tense,  and  other  functions.  The 
prefixes  have  different  properties  with  respect  to  phonological  rules;  Stanley's  (1969) 
analysis  posits  a  large  number  of  verb-internal  boundaries  and  12  pre-verbal  "slots"  for 
prefixes  to  account  for  this,  while  Kari  (1976)  reduces  the  number  of  pre-verbal  slots  to 
10  and  divides  the  prefixes  into  "disjunct"  (slots  1-3)  and  "conjunct"  (slots  4-10).3 
Wright's  (1984)  modification  of  Kari's  system  is  given  in  (5)*. 

(5)  Navajo  Verbal  Prefixes 

Disjunct  Conjunct 

ADV-ITER-PLUR      #  OBJ-DEIC-ASP-MODE-PERF-SUBJ-CLASS 

12  3  4        5        6  7 

di        0 

hi         ni 

si 

o 

I  have  reproduced  only  those  prefixes  in  Wright  (1984)  which  are  relevant  to  forms  to  be 
examined,  which  show  the  alternations  involving  the  subject  and  classifier  prefixes 
(positions  9  and  10). 


BJ-CLASS 

[VERB 

9             10 

Stem 

sh          0 

ni          i 

iid          1 

6h          d 

4.  Thed-effect 

4.1.  The  d-classifler 

Navajo  does  not  retain  the  Athabaskan  verbal  classifier  prefix  d  in  surface  forms, 
although  related  languages  such  as  Carrier  do  (see  Young  and  Morgan  (1980),  p.  355). 
However,  it  is  relatively  clear  that  that  verbs  which  on  the  surface  appear  to  have  no 
verbal  classifier  do  not  have  a  "^-classifier"  (the  Athabaskan  verbal  classifier  with  no 
phonological  content)  in  UR.  That  is,  it  may  be  seen  that  the  ^/-classifier  remains  in 
Navajo  underlyingly  because  of  the  "d-effecL"  In  Young  and  Morgan  (1980),  the  "d- 
effect"  refers  to  the  alternations  in  the  stem-initial  consonant  of  verbs  when  the  verbs 
take  certain  prefix  paradigms  (paradigms  distinct  fi-om  those  of  the  j0'-classifier,  in  which 
no  alternation  is  seen).  As  Coleman  (1976)  points  out  in  her  analysis  of  Dogrib  (a 
Northern  Athabaskan  language),  to  treat  these  alternations  as  suppletions  would  be  to 
miss  a  large  generalization.  Moreover,  these  alternations  are  identical  to  the  alternations 
conditioned  by  die  1st  dual  subject  prefix  when  it  is  adjacent  to  die  verb  stem.  While  it 
would  be  possible  to  synchronically  narrow  the  field  of  possible  underlying  prefixes,  as 
Coleman  does  in  Dogrib,  I  will  simply  note  that  in  Coastal  Athabaskan  languages  such  as 
Carrier,  which  do  not  show  this  alternation,  the  verbal  classifier  is  visible  on  the  surface 
asd. 

(6)  Navajo  and  Carrier  d-classifier 

Carrier:  nahadna  'it  moves' 

Navajo:  naha'na  'it  moves' 

4.2.  d-efTects:  classifier  prefix,  1st  dual  prefix,  "non-efTects." 

Actually,  since  the  d-effect  leaves  no  trace  on  some  consonants,  as  illustrated  in 
(Id),  the  characteristic  paradigms  associated  with  the  rf-classifier  are  the  only  indication 
of  a  d-classifier  in  some  verbs.  These  initial  consonants  on  which  an  underlying  d- 
classifier  has  no  surface  effect- "non-effects,"  one  might  call  them-are  given  in  (7),  as 
listed  in  Young  and  Morgan  (1980).  (Young  and  Morgan  (1980),  being  a  descriptive 
structuralist  grammar,  simply  lists  the  effects  of  the  d-classifier  on  the  following  initial 
consonant  of  a  verb  stem.) 

(7)       Stem-initial  Vowels  Showing  No  d-effect 


Merger 

[Classifier            Phonetic 
rSteml                 Repr. 

Gloss 

d  +  b     - 

->     b 

[d[b^>->?adaah  yoo^' 

'he  stripped  them  off 
himself 

dz 

dz 

[d[dzii']->haasdrif? 

'I  spoke' 

ch' 

ch' 

[d[ch'id]->?adfc/j'id 

'he's  scratching 
himself 

t' 

t' 

[d[t'eesh]">?idfr'eesh 

'he's  blackening  himself  with 
charcoal' 

d  +  j 

g 


[d(jih]-->?^deeshyih 

[d[gish]-->?^eeshgish 

[d[kaad]-->nd?iidoo/:aad 


'he  grabbed  himself 
'he  cut  himself 
'he  slapped  himself 


It  is  immediately  apparent  that  all  the  consonants  in  (7)  are  stops  or  affricates.  That  is, 
all  the  consonants  which  are  not  affected  by  a  preceding  d  are  non-continuants.  In 
contrast,  consonants  which  are  affected  by  a  preceding  rf-classifier  are  all  continuants, 
with  the  exception  of  ?. 

(8)-(12)  present  the  effects  of  the  rf-classifier  and  the  1st  dual  subject  prefix  on  the 
initial  consonant  of  a  verb  stem  in  parallel  fashion.  (8a)-(12a)  illustrate  the  d-effects  as 
listed  by  Young  and  Morgan  (1980),  and  (8b)-(12b)  illustrate  the  effects  associated  with 
the  1st  dual  subject. 


(8) 
d  +    1 

-->    dl 

[d[166sl      --> 

yidl66s 

'it  is  being  led  along' 

d  +    1 

dl 

[dmw] 

siirf/g? 

'we  became' 

(9) 
d  +gh 

-->     g 

[d[gh^dd] 

yigiM 

'it's  being  shaken"* 

gh 

g 

[dcgiighddd] 

yiigi^ 

'we  shook  it' 

(10) 
d  +   z 

-->    dz 

[d[zfid] 

?6di  yayiidzifd          'he's  pouring  it  for 

dz  d[0[zin] 


nii^zin 


'we  think' 


a. 

d  +zh 

-->      J 

[d[zhi?] 

y66jil 

'he  was  called' 

b. 

zh 

J 

d[0[zhee?] 

adiiyee? 

'we  spit' 

a. 

(12) 
d  +   ? 

->     t' 

[d[?|] 

yir'( 

'he's  visible' 

b. 

7 

t' 

dm'm 

yur'dil 

'we're  carrying  it 
along' 

A  few  observations  about  the  above  forms:  although  the  result  of  most  of  the 
mergers  (8,10,11)  are  phonetically  affricates,  (9)  and  (12)  (repeated  from  (lb)  and  (la), 
respectively)  are  not-they  are  stops;  yet  one  is  unaspirated,  the  other  voiceless.  (9)  and 
(12)  are  also  exceptional  as  regards  place  of  articulation:  while  most  of  the  merged 
consonants  are  alveolars,  these  two  in  some  sense  retain  the  place  of  articulation  of  the 
second  element-completely  with  g,  in  some  sense  along  with  alveolar  d  in  the  case  of  t'. 
Let  us  now  consider  how  these  forms  have  been  accounted  for  in  the  past,  and  how  they 
may  be  accounted  for  in  the  framework  of  underspecification  oudined  in  Section  2. 


4.3.  Past  Accounts  of  the  d-effect. 

Previous  accounts  of  the  d-effect  within  the  generative  framework  have  attempted 
to  deal  with  it  in  various  ways.  Kari  (1976),  working  within  the  "linear"  framework  of 
Chomsky  and  Halle  (1968),  proposes  a  rule  which  is  hierarchical  in  nature;  that  is,  it 
selectively  applies  to  change  certain  features  first,  then  others  if  the  values  of  the 
preferred  features  do  not  meet  the  structural  description  of  the  rule.  Kari's  rule  also  has  a 
transformational  component  which  deletes  the  d  if  no  change  has  been  made  (to  account 
for  the  forms  in  (7)  above).  While  Kari's  rule  may  seem  extremely  arbitrary  and 
stipulative,  the  SPE  framework  allows  no  other  way  to  describe  the  d-effect. 

Working  within  an  autosegmental  framework,  Wright  (1984)  gives  an  insightful 
and  interesting  account  of  the  d-effecL  Wright  proposes  a  rule  which  would  add  an 
association  line  from  the  d  to  the  following  consonant,  as  in  (13).  In  this  way  she 
captures  the  idea  that  the  d-effect  is  a  combination  of  c/  with  the  following  consonant. 

(13)        d+l->dl(as in  Wright  1984) 

d  1 

c  c 

Wright's  analysis  suffers,  though,  because  she  is  working  with  an  assumption  of  fully 
specified  feature  matrices.  She  must  therefore  make  stipulations  of  preferred  features  in 
order  to  predict  the  phonetic  results  of  the  doubly  linked  C-s1ol5  However,  Wright  does 
recognize  the  basic  similarity  of  the  d-effect  and  the  effects  associated  with  the  1st  dual 
prefix;  she  also  links  to  the  analysis  of  the  d-effect  the  so-called  "devoicing  effects"  (to 
be  introduced  in  Section  5.3),  a  set  of  forms  which  show  not  only  devoicing,  but  also 
merger  in  some  cases.  In  this  way  her  work  serves  to  define  the  body  of  data  which  an 
adequate  account  of  the  d-effect  must  cover. 

5.  Underspecified  Navajo. 

Wright's  (1984)  insight  that  the  d-effect  is  merging  by  autosegmental  spreading  is 
a  valuable  one.  However,  I  propose  that  instead  of  positing  a  hierarchical  stipulation  of 
which  features  will  dominate  in  an  affricate-structure,  the  consonant  inventory  of  Navajo 
be  defined  in  terms  of  the  radical  underspecification  demanded  by  underspecification 
theory.  Now,  underspecification  theory  predicts  that  when  two  phonemes  merge,  the 
product  ofthe^erger  will  be  the  segment  which  is  represented  in  UR  by  the  combination 
oUtheirjEeaSrc-specifications  (see  Sohn  1987).  I  therefore  turn  to  the  problem  of 
defining  a  maximally  underspecified  characterization  of  the  Navajo  consonants. 

5.1.  Feature  Defaults  and  the  Consonant  Inventory. 

Archangeli  and  Pulleyblank  (1986)  propose  a  set  of  universal  default  values  for  features. 
In  themselves  these  defaults  negatively  define  underlying  values  for  features,  and  I 
propose  that  Navajo  consonants  arc  defined  in  terms  of  these  underlying  values  with  few 
exceptions.  A  thorough  argument  for  the  particular  feature  values  which  Navajo  uses  in 
UR  would  be  a  task  far  greater  than  present  space  allows,  and  would  require  an 
investigation  of  a  wide  range  of  phonological  processes.  Instead,  I  will  present  a  set  of 
underlying  feature  values  which  will  allow  us  to  account  for  the  d-effects  described 
above,  and  then  show  how  we  may  account  for  the  interactions  between  -iid-  and  the 
verbal  classifiers,  the  "devoicing  effects,"  and  a  number  of  more  problematic  d-effects 
within  the  frame  of  these  underspecified  representations.  I  will  take  a  simple  solution  to 
the  problem  of  the  d-effect  to  be,  if  not  proof,  substantiation  of  the  choices  of  UR  feature 


values.  (14)  gives  default  values  for  the  features  distinctive  in  the  consonants,  and  (15) 
presents  a  chart  giving  underspecified  feature  matrices  for  the  Navajo  consonants. 

Several  features  of  (14-15)  require  comment  First,  1  have  not  included  the 
features  [sonorant]  and  [distributed],  as  it  is  not  clear  that  are  distinctive  in  Navajo 
consonants.  Secondary  place  features  such  as  [high]  and  [back]  are  not  included  for  the 
same  reason. 

Second,  affricates  (such  as  ts,  dl,  and  ch')  are  not  represented  as  doubly  linked 
structures,  but  as  single  feature  matrices.  In  fact,  underspecification  theory  demands  this, 
for  in  a  doubly  linked  structure  there  must  be  a  sequence  of  features  with  distinct  values— 
for  instance,  [-continuant]  followed  by  [+continuant]6.  However,  since  the^Nondistinct 
Marking  Condition  of  Sohn  (1987)  prohibits  binary  use  of  a  feature  in  UR,  such 
sequences  are  ill-formed,  at  least  at  deeper  levels  of  the  phonology.  Affricates  must 
therefore  be  represented,  at  least  in  UR,  as  single  matrices.  Third,  the  laryngeals,  ?  and 
h,  have  specifications  only  for  laryngeal  features. 

Fourth,  Navajo  takes  [+anterior]  and  [-i-coronal]  as  default  values.  The  Navajo 
default  values  for  [anterior]  and  [coronal]  thus  differ  from  the  universal  default  features 
which  Archangeli  and  Pulleyblank  (1986)  propose  (namely,  [-anterior]  and  [-coronal]). 
However,  it  should  be  noted  that  [+anterior]  and  [+coronal]  are  the  redundant  featiu^ 
values  in  other  theories  of  redundancy,  such  as  Chomsky  and  Halle  (1968).  In  any  case, 
the  particulars  of  which  feature  values  are  defaults  in  Navajo  does  not  affect  the  validity 
of  the  major  proposals  of  underspecification  theory.  What  is  important  is  that  the  system 
of  defaults  in  Navajo  insightfully  account  for  empirical  data. 

Lastly,  there  is  a  special  redundancy  rule  which  operates  to  increase  the  degree  of 
underspecification  in  (15).  In  Navajo,  no  segment  which  is  [-continuant]  is  also  voiced. 
We  can  therefore  leave  every  [-continuant]  segment  unspecified  for  [voice]  in  UR,  and 
fill  in  the  value  [-voice]  later  by  means  of  a  special  redundancy  rule.7 

( 1 6)  Special  Redundancy  Rule 

[   ]  ">  [-voice]  /  [ ,  -cont] 

5.2.  Underspecification  and  d-effects. 

Let  us  now  see  how  the  system  of  underspecified  consonants  outlined  above  will 
account  for  the  d-effects  of  (8)-(12).  First,  we  must  define  the  rule  responsible  for  the  d- 
effect.  Adopting  Wright's  (1984)  insight  that  the  d-effect  is  caused  by  spreading  of  d 
features  onto  the  following  consonant,  I  propose  the  following  rule: 

(17)  Consonant  Merger* 
X         [X 


t-' 


[-cont] 

(17)  merges  d  with  the  following  consonant  by  spreading  the  features  of  t/  onto  the 
following  timing  slot  and  dissociating  d  from  its  own  timing  slot.  Since  [-continuant]  is 


(14)      Underlying  feature  values  for  Navajo  consonants 
Default  features 


Present  in  UR 
Laryngeal  features 
[+voice] 
[+constricted] 
[+spread] 

Primary  Place  features' 
[-anterior] 
[-coronal] 

Manner  features 
[+nasal] 
[+strident] 
[-continuant] 
[+lateral] 


[-voice] 

[-constricted] 

[-spread] 


[-(-anterior] 
[+coronal] 


[-nasal] 
[-strident] 
[+continuant] 
[-lateral] 


(15)  Maximally  Underspecified  Navajo  Consonants  in  UR 
d  t    t 


Laryngeal 

[voice] 

[constricted] 

[spread] 

Manner 
[continuant] 
[nasal] 
[strident] 

[lateral] 

P-Place 

[anterior] 

[coronal] 


Laryngeal 

[voice] 

[constricted] 

[spread] 

Manner 
[continuant] 
[strident] 
[lateral] 

[nasal] 

P-place 

[anterior] 

[coronal] 


Stops 

b      g   k  k' 


Continuants 
s__z__t_JL__sh__zh_ 


X    gh 


16 


_1     -     - 

+ 


m 


i 


g 


VA 


resii   - 


i 


dz ts_ 


Affricates 


j    ch    ch' 


( 16_ 


Nasals 
m  'm      n    'n 


P)l       1(20)1 
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the  only  feature  of  d,  all  the  features  of  the  other  consonant,  including  place  and  manner 
features,  will  be  retained.  Furthermore,  the  result  of  (17)  is  not  a  doubly  linked  structure 
but  a  single  feature  matrix,  for  by  the  Nondistinct  Marking  Convention  of  Sohn  (1987)  no 
feature  in  the  result  of  (17)  can  have  distinct  values. 

We  are  now  in  a  position  to  test  the  proposal  I  have  laid  out.  (18)  shows  that  the 
proposal  can  handle  the  d-effects  given  in  (8)-(12).  In  each  case,  the  spreading  of  [-cont- 
inuant] (the  underlying  representation  of  d)  onto  the  adjacent  timing  slot  yields  a  feature 
matrix  which  combines  the  underlying  specifications  of  d  and  the  second  consonant 
involved  in  the  merger,  and  this  combined  feature  matrix  is  exacdy  that  of  the  consonant 
which  results  firom  the  merger. 

(18)  d-effects  with  Under  specified  Consonants. 


d 
[-cont] 


b.         d 
[-cont] 


1 
[+lat] 


gh 


dl 
+lat  1 
-cont  I 


-cont 

-ant 

-cor 


d 
[-cont] 


z 
[+str] 


dz 
-cont 
+str 


d.         d 
[-cont] 


zh 
[-ant] 


J 
-cont 
-ant 


e.         d 
[-cont] 


?     ->  t' 

[-Hconstr]         -cont 

L  -fconstrj 


So  combining  the  underspecified  feature  matrices  of  d  and  the  consonants  which  show  d- 
effects  gives  us  exactly  the  right  results-the  feature  matrices  of  the  fiised  consonants. 


5.3 


Configurational  Constraints:  Prefix  Interactions,  d-effects,  and  Devoicing 
Effects. 


While  the  system  described  so  far  is  able  to  account  for,  it  will  not  cover  a 
number  of  cases  related  to  the  more  "central"  d-effects  of  (8)-(12).  These  include  the 
interactions  between  the  1st  dual  prefix  -iid-  and  the  verbal  classifiers  /  and  /,  the  effects 
of  the  d-classifier  on  a  stem-initial  nasal,  the  "devoicing  effects"  of  Wright  (1984),  and 
some  miscellaneous  effects.  I  will  deal  with  each  of  these  in  turn,  showing  how  the 
configurational  constraints  proposed  in  Archangeli  and  Pulleyblank  (1986)  allow  us  to 
account  for  each  while  maintaining  the  framework  of  underspecification  developed  in 
Section  5.1 -5.2. 


As  a  means  of  capturing  further  redundancies  in  the  phonological  system  of  a 
language,  Archangeli  and  Pulleyblank  (1986)  propose  that  a  there  may  be  language- 
specific  constraints  on  the  content  of  feature  representations—on  which  features  must 
cooccur  within  a  segment,  and  which  cannot.  These  constraints  on  what  features  may  be 


associated  together  may  be  positive  ("c-constraints,"  which  state  that  "feature  F  can  be 
connected  to  a  segment  X  only  if  X  is  connected  to  feature  G"),  or  negative  ("c- 
constraints,"  of  the  form  "feature  F  cannot  be  connected  to  a  segment  X  if  X  is  connected 
to  feature  G").  Although  Archangeli  and  PuUeyblank  allow  that  a  configurational 
constraint  might  be  removed  at  some  (higher)  level  in  the  lexical  phonology,  they 
conceive  of  these  configurational  constraints  as  constraints  on  representations  which  hold 
at  all  earlier  levels.  I  propose  that  the  restraints  on  the  system  proposed  above  which  are 
needed  to  account  for  the  more  "problematic"  d-effects  may  be  formalized  in  terms  of 
several  configurational  constraints. 

5.3.1.   d-efTects  on  Nasals. 

Now,  let  us  consider  the  effect  of  a  d-classifier  or  the  prefix  -iid-  on  an  adjacent 
stem-initial  nasal.  In  (19)  we  see  that  the  result  of  d  merging  onto  a  stem-initial  nasal  is  a 
glottalized  nasal. 

(19)      Effects  of  d-classifier  and  1st  dual  subject  prefix  on  Stem-Initial  Nasal 


I.     d  +  m 

->   'm 

[d[maa}'] — >?ii '  maai 

'they're  being 
bolted  down' 

d  +  m 

->    'm 

[d[0[mas]~>yii'mas 

'we're  rolling  it' 

).     d+      n 

->    'n 

[d[niH->n^'nil' 

'they  were  brought 
back' 

d  +  n 

->    'n 

[d[0tml->dii'm 

'we  say  it' 

However,  this  outcome  is  not  predicted  by  the  system  of  underspecified  segments  and 
defaults  outlined  in  (14)-(15).  The  fusion  of  d  with  m  "ought"  to  be  a  consonant  which  in 
UR  combines  the  underlying  features  of  d  (i.e.,  [-continuant])  and  m  ([+nasal,-coronal]). 
In  fact,  though,  the  intersection  of  these  two  sets  conspicuously  lacks  the  feature 
[+constricted],  which  encodes  glottalization. 

I  propose  that  this  difficulty  may  be  explained  if  nasals  in  Navajo  are  regarded  as 
continuants.  (In  fact,  the  chart  of  UR  consonant  specifications  in  (15)  assumes  this.)  In 
addition,  I  propose  that  in  Navajo  there  is  a  positive  configurational  constraint  which 
interprets  a  nasal  which  becomes  non-continuant  in  the  course  of  a  derivation  as 
glottalized.  This  may  be  formalized  as  a  c-constraint  which  requires  that  any  segment 
which  is  [+nasal]  and  [-continuant]  also  be  [+constricted],  as  in  (20). 

(20)      C-constraint: 

[+constr]  must  be  attached  to  X  if  X  is  attached  to  [+nasal,  -continuant] 

So  in  the  derivation  of  ?ii'maal'(\9a),  d  fuses  with  m  by  rule  (17)  and  the  feature  [-cont] 
is  added  to  the  feature  matrix  of  m.  The  combined  underspecified  feature  matrix  is  thus 
[-cont,  +nasal,  -coronal].  This  structure  is  not  permitted  by  c-constraint  (20),  though;  by 
(20),  [+constr]  is  added  to  the  feature  matrix  of  the  fused  consonant,  and  the  segment  is 
realized  as  'm.  So  the  system  we  have  developed  so  far  can  be  constrained  just  enough  to 
account  for  the  d-effects  on  nasals  by  a  configurational  constraint.  I  will  next  describe 
how  configurational  constraints  can  account  for  the  interactions  between  the  1st  dual 
prefix  and  verbal  classifiers  and  Wright's  "devoicing  effects." 
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5.3.2.    Interactions  between  -iid-  and  Verbal  Classifiers. 

While  the  merger  rule  in  (17)  refers  to  a  morphological  bracket  be  between  d  and 
the  consonant  with  which  d  fuses,  (17)  does  not  require  this  bracket  to  be  the  verb  stem 
boundary.  This  is  because  the  1st  dual  subject  prefix  -iid-  interacts  with  the  verbal 
classifiers-^ and  /.  The  interactions  are  shown  in  (21).  However,  the  outcome  of  these 
mergers  is  not  predicted  by  the  framework  of  (14)-(15). 

(2 1 )     Interactions  between  1st  Dual  Subject  Prefix  and 
Verbal  Classifiers  I  and  \  (as  in  Wright  1984) 


.    d  +    t    --> 

1 

[dt}(ghozh]-->yii/ghozh 

'we  tickle  him' 

d  +   *    "> 

1 

[d[*[baal]-->dii/baal 

'we  hang  it' 

).    d  +    1     --> 

1 

[d[l[ghal]-->yii/ghal 

'we  eat  it  (meat)' 

d  +    1     --> 

1 

[d[l[deel]-->yii/deel 

'we  start  to  eat  it' 

The  forms  in  (20)  arc  particularly  odd,  since  the  "normal"  d-effect  between  d  and  a  lateral 
(as  in  (8))  produces  an  affricate,  dl.  Furthermore,  fusing  d  with  voiceless  f  produces  a 
voiced  /.  This  is  also  odd,  as  the  merger  between  d  and  /  in  (8)  produced  dl. 

One  of  the  central  insights  of  Wright's  (1984)  autosegmental  analysis  is  that  the 
segments  in  Verbal  Slots  8  and  9~the  subject  prefixes  and  the  verbal  classifiers-are 
assigned  to  the  same  timing  slot.  Wright  argues  that  Navajo  requires  the  verbal  stem  to 
be  preceded  by  at  least  one  syllable  to  "carry"  the  prefixes,  and  proposes  that  the 
"unmarked"  form  of  this  pre-stem  syllable  (the  form  which  appears  when,  say  only  a 
consonantal  subject  prefix  such  as  sh-  is  present  and  a  syllable  must  be  constructed)  is 
CVC.  The  prefixes  which  occupy  Slots  8  and  9  in  the  verb  are  both  assigned  to  the  final 
C-slot  of  this  pre-stem  syllable. 

Thus,  in  the  derivation  of  yiilghal  (21b),  the  segments  of  the  underlying 
representotion  are  mapped  onto  syllable  templates,  and  in  the  process,  the  underlying  d  of 
the  1st  dual  prefix  and  the  /-classifier  are  both  mapped  to  the  coda  position  of  the  pre- 
stem  syllable.  We  would  expect  this  structure  to  surface  as  dl,  just  as  the  mapping  of  the 
classifier  d  to  the  skeletal  slot  occupied  by  stem-initial  /  yielded  dl  in  (8).  (22)  shows  this 
structure.  (I  assume  that  the  onset  and  nucleus  of  the  pre-stem  syllable  are  later  filled  in 
by  rule,  as  Wright  proposes.) 

(22)        d[l[ghal]  'vj/^ff? 

^     xxxxxx 

V 


Y 


It  was  stated  in  Section  3.2,  though,  that  non-continuants  cannot  be  the  coda  of  a 
syllable.  This  condition  on  Navajo  syllable  structure  may  be  expressed  by  a  negative 
configurational  constraint,  (23): "> 

(23)     T-constraint: 

[-continuant]  cannot  be  connected  to  a  segment  X  if  X  is  in  coda  position. 


13 


In  the  syllabified  structure  in  (22),  though.J-continuant]  is  linked  to  a  coda  position, 
forming  a  structure  which  is  disallowed  by  c-constraint  (23).  [-continuant]  is  therefore 
delinked  from  the  coda  slot,  leaving  only  /  in  that  position;  and  after  construction  of  the 
pre-stem  syllable,  the  surface  form  of  the  verb  is  the  correct  yiilghal. 

While  the  above  account  of  d  fusing  with  the  /-classifier  is  quite  elegant,  the 
merger  of  d  with  the  voiceless  classifier  f  is  not  so  neat.  In  (21a),  -iid-  combines  with  t- 
to  give  /.  By  the  account  of  d  +  I  above,  it  is  not  mysterious  that  d  +  I  does  not  yield  an 
affricate:  the  account  developed  so  far  would  predict  that  when  d  and  fare  combined  in  a 
structure  like  (22),  the  combined  feature  matrix  linked  to  a  coda  position— [-continuant, 
+lateral,  -voice]~will  be  disallowed  by  c-constraint  (23),  and  [-continuant]  will  therefore 
be  delinked.  However,  this  leaves  the  feature  matrix  [+lateral,  -voice],  which  ought  to 
surface  as  voiceless-/;  not  as  voiced  /. 

In  fact,  nothing  in  the  account  developed  so  far  will  explain  this  voiced  /,  and  to 
account  for  it  we  must  posit  a  feature-changing  rule.  While  this  does  detract  from  the 
elegance  of  the  system,  it  does  not  discount  our  basic  approach.  Underspecification 
theory  should  reduce  the  number  of  feature-changing  rules  needed  in  a  system,  but  it 
does  not  disallow  such  rules.  (It  may  be  some  comfon  that  Kari  also  posited  a  rule  of  i- 
voicing.)  (24)  gives  a  formulation  of  the  rule  required  to  voice -fin  these  contexts. 

(24)      l-voicing. 

[         r-continuant]  [ 

[+lateral      J 

^      '  [-voice] 


The  effect  of  (24)  is  to  delink  [-voice]  from  the  structure  which  results  when  d  and  i  are 
both  mapped  to  the  pre-stem  coda  in  the  derivation  of  yiilghozh.  Notice  that  (24)  is  not 
actually  a  feature  changing  rule,  but  a  feature  deletion  rule.  I  propose  that  after  the 
application  of  (24),  [-continuant]  will  also  be  delinked  because  of  c-constraint  (23), 
leaving  only  [+lateral]  linked  to  the  coda  position.  This  will  give  the  correct  surface 
form." 


SJ.3.    "Devoicing  Effects." 

Wright  (1984)  refers  to  a  set  of  interactions  between  the  verbal  classifiers  /  and  / 
and  the  1st  singular  and  2nd  dual  subject  prefixes,  which  she  calls  "devoicing  effects." 
These  "devoicing  effects"  are  given  in  (25). 

(25)  Devoicing  Effects  (1st  sg.l2nd  Dual  Subject  Prefixes  and  Lateral  Verbal 
Classifiers) 


sh+    + 

->    sh 

[sh[4[ghozh]->yis/ixosh 

'I  tickle' 

h   +   + 

->      t 

[ohEHghozh]->wo/Xosh 

'you  (dual)  tickle' 
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(25)  Devoicing  Effects  (cont.) 

c.  sh+     1     -->    sh  [sh[l[zheeh]-->haaj/izheeh        'I  hunt  it' 

d.  h   +    1     -->      i         [oh[l[sheeh]-->haofeheeh  'you  (dual)  hunt  it' 


Like  the  above  interactions  between  -iid-  and  the  verbal  classifiers,  the  analysis  of  these 
devoicing  effects  hinges  on  the  insight  that  both  are  assigned  to  a  single  coda  position. 
And  as  in  the  last  section,  configurational  constraints  are  required  to  restrict  the  possible 
outcomes  of  consonant  mergers.  The  two  c-constraints  which  are  needed  arc  given  in 
(26)-(27). 

(26)      C-constraint. 

[+lateral]  cannot  be  connected  to  X  if  X  is  connected  to  any  PLACE 
feature. 


(27) 


C-constraint. 

[+spread]  cannot  be  connected  to  X  if  X  is  connected  to  [+lateral]. 


C-constraint  (26)  prohibits  [+lateral]  fi-om  being  connected  to  an  X-slot  which  has  place 
features  in  UR,  i.e,  to  any  segment  other  than  an  alveolar.  The  constraint  is  not 
unreasonable  a  priori:  cross-linguistically,  laterals  are  generally  alveolar,  and  Sapir  and 
Hoijer  (1967)  make  no  mention  of  velarized  laterals  in  their  discussion  of  Navajo 
phoneticsiz.  (26)  is  responsible  for  the  fusions  in  (25a,c),  sh  +t->  sh  and  sh  +  /  ->  sh, 
by  prohibiting  any  non-alveolar  segment  to  be  a  lateral  in  UR.  (28)  shows  the  features 
involved  in  the  merger  sh  +  I  -->  sh: 

(28).     sh         +  1  ~>       sh 

t  voice"]  [-Hlatl  [-voice 

■ant   J  '  -ant 

L+lateral. 

Merging  the  features  of  sh  and  /  creates  a  lateral  palatal  matrix.  However,  [-(-lateral]  is 
barred  from  being  connected  to  a  [-anterior]  segment  by  c-constraint  (26).  The  delinking 
of  [+lateral]  results  in  a  matrix  [-anterior,  -voice]  which  surfaces  as  sh.  The  derivation  of 
sh  from  sh+f  is  similar. 

(27)  bars  [+spread]  from  being  part  of  a  segment  with  [+lateral].  In  analyzing  the 
other  two  devoicing  effects  (25b,d),  we  wish  to  to  capture  the  generalization  that  when  h 
fuses  with  a  lateral,  the  lateral  acquires  the  [-voice]  of  h,  as  illustrated  in  (29). 


(29)a.  h 

f -spread} 
voice  J 

b.         h 
r+spreadl 
[-voice  J 


1 
[+lat] 


f-Hlat   ] 
[-voicej 


->      + 
+lat 


-voice 
+spread 

+lat      1 

-voice 

+spreadl 


While  the  above  mergers  express  the  devoicing  of  the  laterals,  they  yield  feature  matrices 
which  are  lateral  and  [+spread]--a  segment  which  is  not  defined  in  UR.  (27)  ensures  that 
in  such  a  case  as  (29),  [+spread]  will  be  delinked,  leaving  only  the  voiceless  /. 

5J.4,  Problematic  d-effects. 

There  remain  three  problematic  d-effects,  which  are  illustrated  in  (30). 

(30)  Problematic  d-effects. 

a.  d  +   y     -->    'y  [d[£fty6fl-->t*6<3'ahoni')^f        'we  are  many' 

b.  d  +   y     -->     d  [d[0t4]->yii^4  'we  eat' 

c.  d  +   z     -->     d  [d[0tzeez]~>yiideez  'we  singed  it' 

(cf.  yizeez,  "he  singed  it") 

(30a,b)  present  an  obvious  problem,  if  one  looks  at  only  the  consonants  involved: 
in  one  case  d  +  y  yields  'y,  in  another  case  d.  However,  I  follow  Wright  (1984)  in 
analyzing  the  two  stems  in  (30a,b)  as  having  different  initial  segments  in  UR.  That  is,  I 
suppose  that  the  underlying  verb  stem  in  (30a)  is  yof,  with  initial  y,  while  in  (30b)  the 
stem  is  vowel  initial,  ^.  I  propose  that  the  combination  of  d  and  y  gives  'y,  much  as  the 
combination  of  d  and  m  gave  'm  in  Section  5.3.1.  At  this  point  I  will  not  propose  a  c- 
constraint  which  would  predict  this,  though,  for  the  following  reason:  there  is  some 
evidence  in  Wright  (1984)  that  the  occurrence  of  the  vowel  /—at  least  in  the  conjunct 
prefix  system-is  very  predictable,  and  need  not  be  represented  in  UR.  Thus,  Wright 
(1984)  analyzes  most  occurrences  of  y  as  an  underlying  i  which  has  been  linked  to  an 
onset  or  coda  syllable  position.  This  suggests  that  the  stem-initial  y  of  (30a)  is  actually  / 
in  UR.  A  proper  formulation  of  the  c-constraint  ensuring  that  d  and  y  combine  to  form  'y 
would  therefore  require  an  underspecified  analysis  of  5ie  vowel  system  of  the  Navajo 
verb,  which  I  will  not  undertake  at  this  point. 

In  contrast,  the  stem  in  yiid^  (30b)  is  vowel  initial  ^  in  Wright's  (1984)  analysis.  Wright 
does  not  explicitly  discuss  this  case,  but  I  believe  her  general  analysis  can  be  made 
explicit  in  the  in  the  system  outlined  above.  I  propose  the  following  derivation  for  yiidi^: 
Since  the  usual  syllable  type  in  Navajo  is  CVC,  when  the  UR  fonns  are  mapped  onto  the 
syllabified  skeleton,  the  stem  d  is  mapped  onto  the  nucleus  position  of  a  CVC  template. 
When  the  rule  of  Merger  (17)  applies,  the  1st  dual  prefix  d  is  linked  to  an  otherwise 
empty  timing  slot  in  syllable-initial  position,  and  so  is  realized  as  d  on  the  surface.  (This 
exceptional  case  of  prefix  d  appearing  as  d  is  the  motivation  for  the  Merger  rule's 
formulation  in  (17)  as  not  showing  the  stem-initial  timing  slot  linked  to  any  feature 
content.)  To  illustrate,  I  present  derivations  for  yiilghozh  (21a)  and  yiid4  in  (31).  In  (31), 
the  system  of  underspecification  and  configurational  constraints  (plus  the  rule  of  /- 
Voicing)  is  able  to  derive  the  correct  surface  form.  (Syllable  Construction  fleshes  out  the 
pre-stem  syllable  at  the  end  of  the  derivation;  its  exact  formulation  is  beyond  the  scope  of 
this  paper,  but  note  that  the  syllable  is  fleshed  out  with  an  i  vowel  and  a  y  onset.)!' 

At  present  I  have  no  real  explanation  for  the  third  problematic  merger,  d  -^  z->  d 
(30c).  However,  I  would  point  out  that  this  is  an  extremely  rare  merger,  and  not 
productive.  Since  this  is  the  case,  it  seems  possible  that  d  +  z  ->  rf  is  a  case  of 
suppletion  or  allomorphic  variation;  i.e.,  the  d-classifier  forms  of  the  root  zeez,  "to  think, 
believe,"  have  the  form  *d-deez  in  UR,  presumably  as  a  result  of  some  historical  process. 
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(31) 


UR 

d[l[ghozh] 

d[0[^] 

Syllabif. 

VTif 

XX      X    XXX 

V              0- 

xxxxxx 

Y  Y 

Merger 

— 

d[0[i] 
X  xxxxx 

/•Voicing 

vfrr 

XX    X  XXX 

l-cont] 
delinked 
(c-constr.  23) 

a-         W 

Syllable 
Construction 

yiilghozh 

yiid^ 

6.  Conclusions. 

Accounts  of  vowel  merger  from  an  underspecified  approach  (such  as  Sohn  1987) 
have  succeeded  in  describing  vowel  fusion  as  the  combining  of  the  underlying  features  of 
radically  underspecified  segments.  I  have  shown  above  that  the  d-effect  in  Navajo,  a 
process  of  consonant  merger,  may  also  be  described  in  terms  of  the  theory  of 
underspecification.  That  is,  when  consonant  fusion  is  taken  to  be  a  similar  sort  of 
combination  of  the  feature  specifications  of  consonants  in  UR,  most  of  the  consonants 
which  result  from  the  d-effect  can  be  predicted  from  the  underlying  features  of  their 
component  parts.  Those  instances  of  fusion  which  are  not  simple  combinations  of 
features  in  UR  can  be  described  if  we  adopt  one  feature-changing  rule  (^ Voicing)  and 
four  configurational  constraints.  In  this  way,  underspecification  theory  offers  an  account 
of  the  d-effect  which  is  both  elegant  and  explicit.  In  return,  the  extension  of  the  theory's 
application  to  consonant  mergers  lends  support  to  the  overall  theory. 

At  various  points  in  the  above  analysis  issues  of  Navajo  syllable  structure  and 
morphology  have  been  raised.  It  is  clear  that  the  analysis  of  the  d-effect  is  only  one  part 
of  a  larger  generative  treatment  of  Navajo,  but  a  proper  account  of  the  d-effect  is 
essential  if  these  issues  within  Navajo  are  to  be  addressed.  It  appears  that  an 
underspecified  analysis  of  the  Navajo  vowel  system,  and  an  analysis  of  the  Navajo  verbal 
system  from  the  standpoint  of  lexical  phonology,  would  be  especially  fruitful.  In  this 
analysis,  though,  larger  questions  of  phonological  theory,  such  as  syllabification 
processes,  the  formulation  of  phonological  rules,  and  the  nature  of  configuradonal 


constraints  have  been  only  hinted  at,  and  I  have  mostly  avoided  these  larger  issues. 
Navajo  is  a  language  of  stunning  morphological  complexity,  though,  and  a  closer 
analysis  of  Navajo  from  the  standpoint  of  current  phonological  theory  will  certainly  raise 
many  interesting  and  challenging  problems  for  that  theory. 

NOTES 

•I  would  like  to  thank  Dr.  Charles  Kisseberth  for  valuable  comments  on  an  earlier 
draft  of  this  paper,  also,  thanks  to  Hyang-Sook  Sohn,  for  introducing  me  to 
underspecification  theory  and  reassuring  me  that  Navajo  isn't  really  bizarre.  Remaining 
faults  are  of  course  my  own.  SDG. 

2The  Young  and  Morgan  orthography  uses  <h>  for  both  [h]  and  [x].  The 
allophonic  variations  of  these  consonants  arc  somewhat  controversial,  and  I  will 
distinguish  them. 

3It  appears  that  the  Navajo  verb  might  be  well  described  in  terms  of  lexical 
phonology:  Kari's  groupings  of  "conjunct"  and  "disjunct"  prefixes  bear  some 
resemblance  to  two  different  strata.  I  am  not  aware  of  any  work  on  this  topic,  though, 
and  will  only  allude  to  possible  results  of  such  an  analysis. 

■♦Wright  (1984)  actually  proposes  to  eliminate  /  vowels  from  the  underlying 
representation  of  prefixes.  This  is  a  stimulating  proposal  from  the  standpoint  of 
underspecification  theory,  but  it  is  not  directiy  relevant  to  the  d-effect. 

5The  solution  which  Wright  (1984)  proposes  utilizes  a  version  of  structure 
preservation,  or  rather,  "structure  rectification."  As  she  sees  it,  two  consonants  linked  to 
a  single  C-slot  "compete"  for  the  position.  A  Coda  Hierarchy  determines  which  features 
will  be  realized,  and  this  hierarchy  acts  always  to  mold  the  consonants  into  an  allowable 
affricate.  Wright  proposes  to  account  for  the  effects  of  the  rf-classifier  and  the  1st  dual 
prefix  in  this  way,  as  well  as  for  the  interaction  of  the  1st  dual  with  the  verbal  classifiers. 
Wright's  proposal  incorporates  valid  insights,  such  as  the  fact  that  syllable  structure  plays 
a  role  in  determining  allowable  surface  forms.  However,  she  does  not  elaborate  the 
strong  theoretical  claims  of  her  proposal;  nor  does  the  Coda  Hierarchy  give  correct 
results  for  the  merger  d+'t-->  I. 

^Assuming,  that  is,  the  Obligatory  Contour  Principle  of  McCarthy  (1981),  which 
prohibits  two  adjacent  occurrences  of  a  feature  on  a  tier. 

'In  (15)  the  glottal  consonants  ?  and  h  are  specified  [-voice].  It  would  be  possible 
(and  probably  desirable)  to  attribute  this  to  another  special  redundancy  rule  of  the  form. 
The  exact  formulation  of  this  rule  is  uncertain,  though,  since  it  must  refer  to  those 
segments  features  which  are  distinguished  by  not  having  specifications  for  super- 
laryngeal features.  Furthermore,  this  would  create  a  distinction  in  UR  between  "zero"  (a 
feature  which  has  not  yet  been  filled  in  by  a  default)  and  "nothing"  (a  feature  for  which  a 
segment  receives  no  specification  at  all),  for  ?  would  be  distinguished  from  t'  only  in 
being  specified  as  [+constricted]  (with  no  supralaryngeal  features  at  all),  whereas  t'  is 
specified  as  [+constricted,  -continuant],  with  supralaryngeal  features  to  be  filled  in  later. 
While  this  has  implications  for  the  formulation  of  rules,  they  which  are  beyond  the  reach 
of  the  present  study.  (For  "supralaryngeal  features,"  see  Note  9.) 
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*The  rule  of  Consonant  Merger  must  be  formulated  so  that  it  will  apply  only  to 
segments  whose  only  feature  is  [-continuant],  and  not  [-continuant]  and  some  other 
feature.  While  this  has  implications  for  the  theory  of  rule  formalism  proposed  in 
Clements  (1985),  I  will  not  pursue  the  matter. 

'"Primary  place"  (P-place)  and  "Secondary  place"  (S-place)  are  terms  drawn  from 
the  theory  of  feature  geometry  described  in  Clements  (1985),  in  which  the  various 
autosegmental  tiers  which  bear  the  features  are  organized  into  a  tree-like  structure,  (i) 
shows  the  modified  tree  structure  presented  in  Archangeli  and  PuUeyblank  (1986). 

(i) 


Root  node 

[voice] 

Laryngeal  node 

Ispreadl 

Icon.lrl 

(com] 

Supralaryngeal  node 

[strid] 

(manner  features) 

[nasdl 

Place  node 

[am] 

(P-place  features) 

[cor] 

S-place  node 

[back] 

[high] 

In  this  paper  I  loosely  adopt  this  conception,  since  it  allows  a  simple  formulation 
of  configurationid  constraint  (26).  Otherwise,  feature  geometry  does  not  crucially  affect 
rule  formulation,  and  I  will  not  pay  strict  attention  to  it. 

lOMichael  Kenstowicz  has  observed  to  me  that  this  constraint  may  be  viewed  as  a 
requirement  that  the  default  feature  [+continuant]  appear  in  coda  consonants  on  the 
surface.  In  this  way  Navajo  parallels  and  contrasts  with  Korean,  for  in  Korean  coda 
consonants  must  be  [-continuant],  and  the  defauh  value  is  [-continuant]  (see  Sohn  1987). 

i2My  analysis  crucially  assumes  that  the  rule  of  i- Voicing  (24)  can  refer  to  the 
banned  matrix  [-cont,  +lat,  -voice]~i.e.,  that  c-constraint  (23)  does  not  destroy  the  crucial 
environment  for  rule  (24)  before  (24)  applies.  While  this  bears  on  questions  of  how 
configurational  constraints  apply  in  the  derivation,  I  will  not  pursue  them.  I  should  also 
note  that  my  use  of  configurational  constraints  hinges  on  the  assumption  that  a  violated 
configurational  constraint  does  not  simply  block  the  derivation.  In  my  analysis,  then, 
configurational  constraints  capture  expliciUy  Wright's  somewhat  vague  processes  of 
"structure  rectification"  (see  note  5). 

'31  assume  that  c'-constraint  (26)  would  be  removed  at  some  point  in  the 
derivation  to  allow  laterals  to  receive  default  place  features. 

•-•These  pre-stem  syllables  and  their  form  have  implications  for  the  theory  of 
syllabification  as  proposed  by  Levin  (1985)  and  expanded  by  Archangeli  and  others. 
This  would  be  an  interesting  area  for  further  investigation. 
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This  paper  argues  for  the  inclusion  of  code-mixed 
and  other  multidialectal  and  multilingual  facts  and  for 
a  reconceptualization  of  linguistic  competence  vis-a- 
vis multilinguals  in  the  formulation  of  syntactic  the- 
ories. Drawing  on  a  few  code-mixed  sentences  involving 
two  Bantu  and  two  Indo-European  languages  (viz.,  Lingala, 
KiSwahili,  French,  and  English)  as  an  example  of  multi- 
dialectal  and  multilingual  data,  the  paper  argues  that 
the  speech  of  code-mixers  represents  the  externalization 
of  the  (internalized)  bilingual 's  grammar  and  that  this 
knowledge  must  be  accounted  for  by  any  theory  of  language 
in  general,  and  syntax  in  particular.  It  is  suggested 
that  the  inclusion  of  language  varieties  in  syntactic 
investigations  will  not  only  expand  the  data-base  upon 
which  insightful  conclusions  can  be  drawn  about  what 
constitutes  language  knowledge,  but  will  also  benefit 
linguistic  theory.  The  paper  contends  that  the  current 
syntactic  theories  are  formulated  on  a  very  narrow  do- 
main to  allow  for  an  adequate  account  of  knowledge  of 
human  language.  A  reconceptualization  of  the  internal 
organization  of  multilingual  grammars  is  suggested  and 
its  implications  for  syntactic  theory  are  examined. 


1.0  INTRODUCTION 

Since  its  advent  in  1957,  generative  transformational  theory 
has  purported  to  be  the  study  of  "what  one  knows  when  one  knows  a 
language:  that  is,  what  has  been  learned,  as  supplemented  by  in- 
nate principles"  (Chomsky  1986:  24).  This  study  of  the  language 
faculty,  whose  ultimate  objective  is  to  formulate  a  theory  of 
linguistic  knowledge,  has  focused  attention  on  four  basic  questi- 
tions  (cf.  Chomsky  1965,  1980,  1982,  1986,  Labov  1972,  Bresnan 
1878,  1982,  Kaplan  &  Bresnan  1982,  Riemsdijk  &  Williams  1986): 
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(1)  a.  What  are  the  universal  properties  of  human  language? 

b.  What  constitutes  knowledge  of  language? 

c.  How  is  knowledge  of  language  acquired? 

d.  How  is  knowledge  of  language  put  to  use? 

In  spite  of  the  proliferation  of  syntactic  and  phonological 
theories  in  the  past  six  years  or  so,  this  research  goal  has 
remained  constant.  The  approaches  and  therefore  the  answers  to 
the  questions  in  (1) ,  however,  have  varied  considerably.  For 
instance,  while  sociolinguistics  has  attempted  to  address  these 
questions  by  examining  data  from  both  the  performance  and  compe- 
tence aspects  of  language  competence  (Labov  197  0,  1972,  Ervin- 
Tripp  1970,  Hudson  1980,  Trudgill  1983)  or  what  Chomsky  (1986) 
terms  "Internalized"  (I-)  and  "Externalized  (E-)  language,  re- 
search in  phonological  and  syntactic  theories  has  focused  pri- 
marily on  competence/ I-language.  As  a  result,  synchronically  in- 
variant language  has  generally  been  taken  as  the  sole  basis  of 
theoretical  development;  multilingual  and  variant  monolingual 
data  have  been  excluded  from  consideration  in  the  formulation  of 
such  theories. 

1.1  Code-mixed  data  &  linguistic  description.  The  exclusion 
of  dialectal  and  multilingual  data  as  possible  objects  of  lingu- 
istic analysis  raises  serious  questions  about  the  adequacy  of 
current  theories  in  accounting  for  linguistic  knowledge,  viz.,  in 
answering  questions  (Ib-d) .  Consider,  for  example,  the  sentences 
in  (2)  and  (3)  from  Bantu  languages: 

(2)  KiSwahili-Enqlish  (KE) : 

a.  Ni-na-m-provoke  kila  wakati  tu-ki-kutana  naye.  (Scotton 
I-Pres-OM-provoke  each  time  we-TA-meet  with  him  1983) 
(I  provoke  her  every  time  if  we  meet  with  her) 

b.  Ni  nani  a-li-i-spoil  kamba  yetu?  Iko  short  sana.. 
Is  who  3rdSg-pst-0M-spoil  rope  ours  it's  short  very 
(Who  spoiled  our  rope?  It  is  very  short...) 

(Scotton,  id.) 

(3)  Linqala-French  (LF) 

a.  Soko  a-compren-aki     yo  te   a-kok-aki  ko-ndima  te. 

If   he-understand-Impf  you  not  he-can-Impf  to-agree  not 
If  he  did  not  understand  you,  he  would  not  have  agreed.) 

b.  Ba-garcons  wana  ba-zo-mi-defendre  kaka  mpamba. 

Pl-boys    there  they-be-OM-defend   just  for  nothing 
(Those  boys  are  arguing  (in  self-defense)  just  to  save 
face.) 
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These  sentences,  which  are  representative  of  code-mixed  speech 
commonly  heard  in  the  Eastern  and  Central  African  countries 
where  the  language  pairs  are  spoken,  cannot  be  explained  adequ- 
ately within  the  current  theories  of  generative  grammar.  To  ac- 
count for  the  derivation  of  any  of  these  utterances  in  one  of 
these  theories,  for  example  Government  and  Binding  (GB) ,  one  will 
have  to  modify  the  theory  in  several  important  respects. 

1.2  Purpose  and  scope  of  the  study.  It  will  be  argued  in  this 
paper  that  code-mixed  facts  such  as  (2)  and  (3)  above  represent 
the  externalization  of  the  linguistic  competence  of  the  bilingual 
speaker  (hereafter  the  bilingual 's  grammar)  and  that  this  know- 
ledge must  be  explained  by  any  theory  of  language.  Code-mixed 
data  drawn  from  Bantu  languages  are  used  to  motivate  an  analysis 
of  the  bilingual 's  grammar,  and  its  implications  for  linguistic 
theory  are  discussed.  It  is  suggested  that  the  use  of  multi- 
dialectal  and  multilingual  data  is  a  necessary  extension  of  the 
domain  of  linguistic  analysis  if  generative  theory  is  to  become  a 
true  model  of  the  speaker's  knowledge  of  language. 

The  discussion  begins  with  a  brief  characterization  of 
code-switching,  and  moves  on  to  a  critical  review  of  general 
linguistic  and  code-switching  theories  in  light  of  data  from 
Up-country  Swahili  and  Kinshasa  Lingala  (KL) .  It  concludes  with 
an  analysis  of  the  bilingual 's  grammar  and  its  significance  to 
syntactic  theory.  We  will  focus  our  attention  on  GB  and  its  im- 
mediate predecessors  (Standard  &  Extended  Standard  Theory)  under 
which  most  code-mixing  studies  have  been  conducted. 

2.0  CODE -MIXING,  LINGUISTIC  DESCRIPTION  &  LINGUISTIC  THEORY 

2.1  Overview  of  code-switching.  Code-switching  (CS) , 
generically  defined  as  the  alternating  use  of  linguistic  units 
from  two  or  more  distinct  languages  (cf.  Lipski  1978,  Kachru 
1978,  Sridhar  &  Sridhar  1980),  has  been  one  of  the  dominant 
topics  of  sociolinguistic  research  for  the  last  sixteen  years 
or  so.  Recently  the  phenomenon  has  attracted  the  attention  of 
syntacticians  of  various  persuasions  (cf.  Sridhar  &  Sridhar  1980, 
Sankoff  &  Poplack  1981,  Di  Sciullo  et  al.  1982,  Woolford  1983, 
Joshi  1983,  Bokamba  1985,  Ewing  1985),  but  its  fundamental 
characteristics  remain  poorly  understood. 

Strictly  speaking,  CS  consists  of  two  distinct  sub-phenomena 
referred  to  as  code-switching  proper  or  intersentential  switching 
and  code-mixing  (CM)  or  intra-sentential  switching.  The  former  is 
defined  in  (4)  and  the  latter  in  (5) : 

(4)  Code-switching  is  the  embedding  of  words,  phrases,  and 
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sentences  from  two  codes  across  sentence  boundaries  wi- 
thin the  same  speech  event  (Bokamba  1985:  3).^ 

(5)  Code-mixing  is  the  embedding  of  various  linguistic  units, 
i.e.,  morphemes,  words,  phrases,  and  clauses  from  two 
distinct  grammatical  systems  or  sub-systems  within  the 
same  sentence  and  speech  event  (Bokamba  1985:  4).^ 

The  difference  between  the  two  phenomena  is  illustrated  in  (6) 
and  (7),  where  the  former  is  a  code-switched  discourse,  and  the 
latter,  a  code-mixed  utterance: 

(6)  Linaala-French: 

Boni,  Beya,  osili  kozonga!  Tu  rentres  comme-pa  sans  me 
prevenir?  (Bokamba  1985) 

Well,  Beya,  you  are  already  back!  You  return  (just  like 
that)  without  warning  me? 

(7)  Kiswahili-Enqlish: 

lie  accident  ilitokea  a-li-po-lose  control  na 
a-ka-overturn  and  landed  in  a  ditch  (Abdulaziz  1972) 
(The  accident  occurred  when  he  lost  control  and  over- 
turn. .  . ) 

In  this  paper  we  are  primarily  concerned  with  CM,  but  our  conclu- 
sions can  equally  be  applied  to  CS  proper. 

CM  continues  to  be  a  fascinating  topic  in  linguistic  and 
psycholinguist ic  research  for  three  main  reasons:  (1)  initial  and 
continuing  misunderstanding  of  the  general  phenomenon  of  CS;  (2) 
paucity  of  cross-linguistic  studies  on  the  phenomenon;  and  (3) 
the  implications  that  the  mixed  data  have  for  the  characteriza- 
tion of  the  internalized  grammar  of  the  bilingual  speaker. 

Initially,  CS  was  seen  as  a  verbal  behavior  that  is  symptom- 
atic of  language  deficiency  on  the  part  of  the  speaker.  The  phe- 
nomenon was,  therefore,  characterized  as  random  and  non-rule  go- 
verned (cf.  Gumperz  1970,  Gumperz  &  Hernandez  1972,  Abdulaziz 
1972,  Timm  1975).  Since  the  late  1970s,  however,  it  has  become 
evident  that  CS  and  CM  are  rule-governed  phenomena  that  reflect 
the  bilingual 's  linguistic  creativity  (Kachru,  1978,  Lipski  1978, 
Pfaff  1979,  Poplack  1980).  As  a  general  phenomenon,  CS  is  govern- 
ed socio-psychologically  and  structurally.  It  is  socio-psychologic- 
ally  governed  in  the  sense  that  switching  is  typically  triggered 
by  the  context  of  the  situation  for  identifiable  social  and  psycho- 
logical functions  (Scotton  &  Ury  1977,  Wentz  1977,  Jacobson  1978, 
Kachru  1978) .  The  phenomenon  is  structurally  governed  in  that  the 
resulting  utterances  are  constructed  according  to  the  rules  of  the 
language  pair  concerned  (Lipski  1978,  Pfaff  1979,  Poplack  1980, 


Sridhar  &  Sridhar  1980,  Sankoff  &  Poplack  1981,  Kachru  1982,  Di 
Sciullo,  Muysken,  &  Singh  1986) . 

In  spite  of  this  gain  in  our  understanding  of  CS,  the  basic 
linguistic  and  psycholinguistic  characteristics  underlying  it  re- 
main poorly  understood.  This  inability  is  ascribable  to  the  pau- 
city of  cross-linguistic  studies  on  CS,  on  the  one  hand,  and  to 
the  narrowness  of  the  domain  of  linguistic  investigation  in  gener- 
ative grammar,  on  the  other.  The  vast  majority  of  the  studies  pu- 
blished on  CS  have  been  based  on  Spanish-English  bilingualism, 
with  a  few  others  on  similar  Indo-European  language-pairs  (e.g., 
French-English,   French-Italian,  Greek-English,  German-English, 
etc.).  Studies  based   on  non-Western  languages  are  few  (cf.  Ure 
1972,  Abdulaziz   1972,  Bautista  1975,  Annamalai  1978,  N'Sesep 
1978,  Sridhar  and  Sridhar  1980,  Kachru  1982,  Bokamba  1985,  1987, 
Kamwangamalu  1987a,  and  forthcoming).  As  a  result,  the  conclu- 
sions drawn  from  code-mixed  data  in  Western  languages  are  much 
less  generalizable  than  the  literature  has  led  us  to  believe,  and 
are  therefore  flawed  in  their  characterization  of  the  phenomen- 
on. ^ 

Even  if  cross-linguistic  research  on  CS  were  to  be  undertaken 
extensively,  the  insight  that  such  work  might  shed  would  continue 
to  be  limited  as  long  as  researchers  confine  themselves  to  the 
current  basic  assumptions  that  underlie  generative  theory.  Code- 
switching  in  general,  and  code-mixing  in  particular,  are  vastly 
interesting  precisely  because  they  constitute  the  strongest  type 
of  evidence  for  I-language  and  its  relationship  to  E-language 
(i.e.,  competence  vs  performance).  In  other  words,  code-mixed 
speech  supports,  on  the  one  hand,  the  long  held  view  that  langu- 
age is  creative/innovative;  and  on  the  other,  it  contradicts  the 
claim  that  competence  and  performance  are  separate  and  dis- 
tinct aspects  of  language  knowledge  (Chomsky  1965,  1972,  1975, 
1986) .  Assuming  that  code-mixed  speech  represents  the  externaliza- 
tion  of  the  internalized  bilingual 's  knowledge  of  the  languages 
concerned,  can  linguistic,  especially  syntactic  theory  account 
for  this  intuition?  If  so,  how  can  this  be  done  and  what  implica- 
tions would  the  extension  of  the  syntactic  theory  in  this  respect 
have  for  general  theory?  We  take  up  these  questions  in  the  next 
and  last  part  of  this  paper. 

2.2  Overview  of  linguistic  theory.  To  understand  the  signi- 
ficance of  code-mixing  to  linguistic  theory,  it  is  nr>cessary  to 
review  the  objectives  of  Transformational  Grammar  (TG)  and  its 
basic  assumptions  and  principles. 

We  stated  at  the  beginning  of  this  paper  that  the  main  object- 
ive of  generative  theory  has  been  to  discover  the  universal  pro- 
perties that  underlie  human  language  and  to  provide  a  character- 
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ization  of  what  it  is  that  a  person  knows  when  (s)he  knows  a  langu- 
age, and  how  such  knowledge  is  acquired  (Chomsky  1965,  1972,  1975, 
1981).  Initially  the  theory  focused  on  questions  (la-c),  but  in  the 
last  two  three  years  or  so  the  scope  of  inquiry  has  come  to  include 
the  issue  of  language  use,  viz.,  question  (Id). 

In  the  Aspects  of  the  Theory  of  Syntax  (1965)  model  (Stand- 
ard Theory,  ST)  and  its  extended  versions  (Extended  Standard  Theo- 
ry, EST)  of  the  1970s,  the  theory  attempted  to  answer  the  questions 
in  (1)  by  assuming  the  construction  of  sentences  to  be  based  on  the 
interaction  of  three  major  components:  syntactic,  phonological,  and 
semantic,  with  the  last  two  being  largely  interpretational  and  the 
first  basic.  Two  levels  of  linguistic  representation,  viz.,  deep 
and  surface  structure,  were  postulated;  and  a  series  of  transform- 
ational rules  (both  in  syntax  and  phonology)  was  posited  to  operate 
on  specific  base-rules  (in  the  deep  structure)  to  construct  the 
phonetic/ surf ace  structure  (Chomsky  1965,  1975) .  Thus  in  the  cons- 
truction of  a  language-specific  monolingual  sentence  such  as  the 
KiSwahili  (8) 

8)  Tafadhali  ni-elez-e  ma-nene  haya 

(Please  explain  these  things/matters  to  me.) 

the  relevant  syntactic,  lexical,  and  phonological  representations 
upon  which  the  transformational  rules  and  rules  of  semantic 
interpretation  would  operate  would  be  extracted  from  the  grammar 
of  KiSwahili.  Such  a  grammar,  it  was  assumed  in  the  ST,  was  free 
of  errors  and  diversity:  it  was  the  grammar  of  the  idealized 
speaker  who  lived  in  a  homogeneous  speech  community.  This  model 
underwent  a  number  of  changes  in  the  1970s  to  become  the  Extend- 
ed Standard  Theory  (EST)  (Radford  1981) . 

With  the  advent  of  Government  and  Binding  theory  (GB)  in  1980/ 
1981,  the  over-all  conception  of  linguistic  theory  has  changed 
radically:  we  have  moved  from  a  theory  of  rules  to  a  theory  of 
principles  (Riemsdijk  &  Williams  1986) .  Specifically,  the  trans- 
formations assumed  under  the  previous  model,  TG,  are  now  handled 
by  a  small  set  of  principles  based  on  the  principles  of  the  "core 
grammar"  (i.e.,  universal  grammar).  Only  one  transformation,  viz. 
"Move-Alpha",  has  been  retained  under  the  new  theory.  The  role  of 
this  transformation  is  not  to  relate  D-structure  to  S-structure, 
but  rather  to  move  any  element  anywhere  in  a  sentence  in  accord- 
ance with  the  sub-theories  of  GB:  binding,  0-criterion,  control, 
etc.  (Chomsky  1982,  1986). 

Unlike  the  TG  of  the  1960s  and  1970s  which  paid  lip-service 
to  UG  and  child  language  acquisition,  GB  theory  actively  attempts 
to  address  itself  to  these  questions  in  the  hope  of  providing  a 
general  characterization  of  the  language  faculty.  The  range  of 
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languages  drawn  upon  to  develop  the  various  principles  in  the 
theory  has  expanded  from  English,  to  other  languages  of  the 
world,  including  Bantu  languages  (Bergvall  1982,  1983,  Clements 
1984,  Zaenan  1984).  Child  language  acquisition,  which  Chomsky 
continues  to  regard  as  the  ultimate  test  of  the  explanatory 
adequacy  of  any  linguistic  theory  (Chomsky  1982,  1986),  receives 
considerable  attention  in  GB  theory. 

In  spite  of  the  claim  by  Chomsky  (1986:  3ff)  and  some  of 
his  followers,  e.g.,  Riemsdijk  and  Williams  (1986),  that 
performance/language  use  is  one  of  the  key  issues  to  which 
current  linguistic  theory  seeks  answers,  very  little  attention 
has  been  paid  to  it  in  the  syntactic  literature.  In  fact,  Chomsky 
continues  to  treat  performance,  referred  to  now  as  "Externalized 
language",  as  derivative.  He  states  (Chomsky  1986:  31): 

(9)  The  account  presented  by  Quine  [1972],  Lewis  [1975]  and 
others  has  the  story  backwards:  E-languages  are  not  given, 
but  are  derivative,  more  remote  from  data  and  from  mechan- 
isms than  I [internalized] -languages  and  the  grammars  that 
are  theories  of  I-languages;  the  choice  of  E-language 
therefore  raises  a  host  of  new  and  additional  problems 
beyond  those  connected  with  grammar  and  I-language. 

By  maintaining  the  primacy  of  I-language  over  E-language, 
current  syntactic  theory  rejects  the  legitimacy  of  E-languages  as 
objects  of  linguistic  inquiry.  We  concur  with  Bresnan  and  Kaplan 
(1982:xix)  that  "if  it  is  uncontroversial  that  stored  knowledge 
structures  underlie  all  forms  of  verbal  behavior,  the  question 
arises  here  of  how  these  different  components  of  linguistic 
knowledge  are  related."  Is  the  appeal  to  "idealization"  of 
language  simply  an  attempt  "to  restrict  the  kind  of  evidence  that 
may  be  brought  to  bear  on  representational  issues,"  as  Bresnan 
and  Kaplan  (1982:xxiii)  observe,  or  is  it  a  way  to  gain  some 
initial  insights  about  the  language  faculty?  What  role,  for 
example,  should  an  I-language/grammar  play  in  models  of 
performance? 

In  attempting  to  answer  these  questions,  among  others,  a 
number  of  linguists  have  suggested  recently  that  linguistic 
theory  should  reflect  socio-psychological  reality  (cf.  Labov 
1972,  Ervin-Tripp  1970,  Hudson  1980,  Bresnan  &  Kaplan  1982). 
Bresnan  and  Kaplan  (1982:  xxii)  observe  in  this  regard  that 

(10)  [The  concept  of  competence]  requires  that  we  take 
responsibility  not  only  for  characterizing  the  abstract 
structure  of  the  linguistic  knowledge  domain,  but  also  for 
explaining  how  the  formal  properties  of  our  proposed 
linguistic  representaions  are  related  to  the  nature 
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of  the  cognitive  processes  that  derive  and  interpret  them 
in  actual  language  use  and  acquisition.  Chomsky's  current 
conception  of  psychological  reality  represents  a  retreat 
from  this  more  ambitious,  and  scientifically  far  more 
interesting,  goal. 

Bresnan  and  Kaplan  (1982:  xxiii)  note  further  that, 

(11)  Since  we  cannot  directly  observe  this  'internal  grammar,'  we 
must  infer  its  properties  indirectly  from  the  evidence 
available  to  us  (such  as  linguistic  judgments,  performance 
of  verbal  tasks  in  controlled  experimental  conditions, 
observations  of  the  linguistic  development  of  children, 
and  the  like) .  The  data  of  linguistics  are  no  more  or  less 
privileged  for  this  inquiry  than  any  other  data. 

In  pursuit  of  these  research  objectives,  Kaplan  and  Bresnan  (1982) 
developed  the  theory  of  Lexical-Functional  Grammar  (LFG)  in  which 
it  is  assumed  that  there  are  only  two  major  structural  components 
(i.e.,  constituent  structure  and  functional  structure),  and  no 
transformational  rules  of  any  kind.   All  grammatical  relation- 
changing  rules  are  assumed  to  be  lexical  operations,  and  the  dis- 
tinction between  D-structure  and  S-structure  is  dispensed  with 
completely  (Kaplan  &  Bresnan  1982,  Kaplan,  Maxwell  &  Zaenan  1987). 
The  mapping  between  the  c-structure  and  the  f-structure  is  achiev- 
ed in  a  "many-to-many",  rather  than  isomorphic  fashion. 

The  elimination  of  abstract  syntactic  concepts  such  as  deep 
structure,  surface  structure,  and  transformational  rules  in  LFG 
is  seen  by  Bresnan  (1978,  1982),  and  Kaplan  and  Bresnan  (1982), 
among  others  (e.g.,  Mchombo  1978),  as  a  necessary  refinement  that 
will  eventually  lead  both  to  a  simpler  and  more  realistic  theory 
of  language  than  competing  theories  have  hitherto  permitted.  In 
this  paper  we  will  argue  that  the  investigation  of  CM  can  play  a 
significant  role  in  refining  further  our  conceptualization  of  syn- 
tactic theory. 

It  is  important  to  point  out  here  that  while  the  architecture 
of  GB  and  LFG  differ  in  many  significant  respects,  they  do  share 
one  major  common  assumption:  the  lexicon  and  related  lexical  rules 
are  language-specific.  This  view  raises  a  number  of  fundamental 
questions  regarding  the  analysis  of  code-mixed  sentences,  as  we 
will  see  in  the  following  section. 

2.3  Code-mixing  and  linguistic  theory.  There  have  been 
essentially  two  main  approaches  in  the  study  of  the  general 
phenomenon  of  CS:  functional  and  descriptive/ syntactic.  The 
functional  approach  has  attempted  to  identify  the  various 
socio-psychological  functions  that  CS  serves  and  the  contexts  in 
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which  the  phenomenon  occurs,  (cf.  Gumperz  197  0,  Gumperz  & 
Hernadez  1972,  Scotton  &  Ury  1977,  Jacobson  1978a, b,  Kachru 
1978) .  The  descriptive/syntactic  approach  has  attempted  to 
provide  a  general  characterization  of  the  structural  properties 
exhibited  in  CS  in  general,  and  CM  in  particular,  under  the  ST 
and  the  EST  models  (Wentz  1977,  Lipski  1978,  Pfaff  1979,  Poplack 
1980,  Sridhar  &  Sridhar  1980,  Joshi  1985) . 

The  dominant  paradigm  in  the  syntactic  approach  has  been  to 
analyze  CM  speech  in  terms  of  syntactic  constraints  on  what  can 
or  cannot  be  mixed.  The  implicit  goal  of  the  hypothesis 
underlying  this  approach  is  that  in  order  for  a  code-mixed 
utterance  to  be  grammatical,  there  must  be  surface  syntactic 
equivalence/congruence  at  the  juncture  where  the  mixing  occurs 
between  LI  and  L2  (Lipski  1978,  Poplack  1979) .  In  view  of  this 
hypothesis,  several  constraints  have  been  proposed  in  the 
literature  to  purportedly  account  for  the  grammatical ity  of 
various  code-mixed  utterances  in  different  languages.  The  most 
popular  of  these  restrictions  include  the  Adjectival  phrase 
Constraint,  the  Equivalence  Constraint,  the  Free  Morpheme 
Constraint,  and  the  Dual  Structure  Principle  stated  in  (12) 
through  (15): 

(12)  The  Adjectival  Phrase  Constraint  (Pfaff  1979) : 

Adjective/noun  mixes  must  match  the  surface  word  order  of 
both  the  language  of  the  adjective  and  the  language  of  the 
head  noun. 

(13)  The  Equivalence  Constraint  (Poplack  1980:  586) : 

Code-switches  will  tend  to  occur  at  points  in  the 
discourse  where  juxtaposition  of  LI  and  L2  elements  does 
not  violate  a  syntactic  rule  of  either  language,  i.e.,  at 
points  around  which  the  surface  structures  of  the  two 
languages  map  on  to  each  other. 

(14)  The  Free  Morpheme  Constraint  (Poplack  1980:  585-6): 

Codes  may  be  switched  after  any  constituent  in  discourse 
provided  that  [that]  constituent  is  not  a  bound  morpheme. 

(15)  The  Dual  Structure  Principle  (Sridhar  &  Sridhar  1980) : 

The  internal  structure  of  the  guest  constituent  need  not 
conform  to  the  constituent  structure  rules  of  the  host 
language,  so  long  as  its  placement  in  the  host  sentence 
obeys  the  rules  of  the  host  language. 
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The  unstated  objective  of  the  constraint-oriented  approach  to  CM 
is  to  provide  a  definition  of  the  notion  "possible  grammatical 
utterance/sentence"  in  CM.  Such  a  definition,  it  is  assumed, 
would  enable  us  to  make  certain  syntactic  predictions  and  draw 
psycholinguistic  inferences  about  the  grammmar  of  the  bilingual 
code-mixer. 

Unfortunately,  however,  all  the  syntactic  constraints 
proposed  in  the  published  literature  thus  far,  including  those 
stated  in  (12)  through  (15),  have  been  invalidated  by  data  from 
various  language  groups  (cf.  Nartey  1982,  Bokamba  1985,  El-Noory 
1985,  Pandharipande  1986,  Kamwangamalu  1987a) .  As  a  consequence 
of  the  invalidity  of  these  constraints,  the  most  fundamental 
question,  viz.  the  characterization  of  the  bilingual 's  grammar, 
which  research  on  CS  seeks  to  address,  has  remained  unanswered. 

As  we  suggested  previously,  the  lack  of  meaningful  progress 
in  this  respect  is  due  in  part  to  flawed  assumptions  about  CS/CM 
based  on  Spanish-English  dominated  data  and  a  restricted  data-based 
theory.  Mere  description  of  which  elements  of  the  sentence  can  be 
switched  and  at  what  juncture  do  not  address  the  issue  of  the 
internalized  bilingual  grammar.  The  discussion  of  this  question 
requires  us  to  deal  with  the  following  key  sub-questions,  among 
others:  (1)  How  many  grammars  has  the  bilingual  code-mixer 
internalized?  (1)  If  (s)he  has  internalized  one  super-grammar, 
how  does  it  work  in  the  production  of  monolingual  and  code-mixed 
speech?  (3)  If  (s)he  possesses  two  or  more  grammars,  how  do  they 
interact  in  the  production  and  recognition  of  code-mixed  speech? 
For  instance,  are  the  grammars  accessed  simultaneously  or 
sequentially  in  such  speech  processing?  And  (4)  how  can  the  know- 
ledge that  underlies  the  code-mixed  verbal  behavior  be  described 
syntactically  under  any  of  the  current  syntactic  theories? 

These  questions  constitute  a  research  program  and  cannot  be 
answered  in  a  study  of  this  scope.  What  we  would  like  to  do  in 
the  remainder  of  this  section,  however,  is  to  examine  a  few  facts 
from  code-mixed  speech  involving  Bantu  and  European  languages  in 
order  to  ascertain  their  syntactic  characteristics.  With  this 
analysis  we  can  then  draw  certain  inferences  concerning  the 
nature  and  organization  of  the  bilingual 's  grammatical  systems. 

Consider  in  this  regard,  the  Nairobi  Swahili  code-mixed 
sentences  in  (2)  and  (7)  and  the  Kinshasa  Lingala  sentences  in 
(3)  and  (6) ,  repeated  here  for  convenience. 

KiSwahili-Enqlish; 

(2)  a.  Ni-na-m-provoke  kila  wakati  tu-ki-kutana  naye  (Scotton  1983) 
I-Pres-OM-provoke  each  time  we-TA-meet   with-him 
(I  provoke  her  every  time  if  we  meet  with  her.) 


b.  Ni  nani  a-li-i-spoil  kamba  yetu?  Iko  short  sana. . (Scotton,  id.) 
Is  who  3rdSg-pst-0M-spoil  rope  ours  it's  short  very 
(Who  spoiled  our  rope?  It  is  very  short...) 

(7)  lie  accident  ilitokea  a-li-po-lose  control  na  a-ka- 
overturn  and  landed  in  a  ditch  (Abdulaziz  1972.) 
(The  accident  occurred  when  he  lost  control  and  overturned..) 

Linaala-French : 

(3)  a.  Soko  a-compren-aki   yo  te  a-kok-aki  ko-ndima  te. 

If   he-understand-Impf  you  not,  he-can-Impf  to-agree  not 
(If  he  did  not  understand  you,  he  would  not  have  agreed) 

b.  Ba-garcons  wana  ba-zo-mi-d^fendre  pamba. 

Pl-boys    there  they-be-OM-defend  just  for-nothing 
(Those  boys  are  arguing  (in  self-defense)  just  to  save 
face.) 
(6)  Boni,  Beya,  osili  kozonga!  Tu  rentres  comme-^a  sans  me 
prevenir?  (Bokamba  1985) 

(Well,  Beya,  you  are  already  back!  You  return  (just  like 
that)  without  warning  me?) 

Sentence  (2a)  contains  the  English  verb  stem  -provoke  to  which 
are  prefixed  a  number  of  KiSwahili  morphemes:  ni  for  first  person 
subject  marker;  {-na-}  for  present  (continuative)  tense  prefix; 
and  {-m-}  for  third  person  object  marker. 

Assuming  that  the  lexicon  and  lexical  rules  are  language- 
specific,  as  noted  earlier,  what  we  can  infer  from  the 
construction  of  sentences  such  as  (2a)  is  this.  The  speaker 
determines,  first,  the  appropriate  syntactic  structure  in 
Swahili;  (s)he  then  makes  the  relevant  lexical  insertions  by 
drawing  the  verb  provoke  from  English  and  the  rest  of  the  lexicon 
from  KiSwahili,  before  applying  the  requisite  morpho-syntactic, 
phonological  and  semantic  interpretation  rules  from  KiSwahili. 
Since  provoke  is  pronounced  as  in  an  English  monolingual 
sentence,  it  presumably  undergoes  English  phonology.  Unless  one 
assumes  that  the  verb  provoke  is  a  loanword,  which  it  cannot  be 
because  other  words  are  treated  similarly,  sentence  (2a)  cannot 
be  construed  as  being  derived  from  a  monolingual  grammar.  The 
same  analysis  applies  to  (2b) ,  with  the  addition  that  the 
O(bject)  M(arker)  {-i-)  here  refers  to  the  noun  kamba  'rope'. 
Further,  in  the  second  sentence  of  that  discourse,  the  adjective 
short  is  drawn  from  the  English  lexicon.  In  both  sentences  (2a,b) 
the  grammatical  rules  that  apply  are  characteristically 
KiSwahili,  and  therefore  Bantu:  English  here  only  provides  a  few 
lexical  elements. 
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Sentence  (7),  which  consists  of  two  conjuncts  joined  by  the 
conjunction  na,  provides  one  of  the  best  illustrations  of  the 
interaction  of  the  two  grammatical  systems:  KiSwahili  and  English. 
The  sentence  contains  three  KiSwahili  lexical  elements  (disregard- 
ing for  a  moment  the  affixes):  the  demonstrative  ile  'that' 
(functioning  here  as  a  definite  article),  the  verb  ku-tokea  'to 
occur/happen,'  and  the  conjunction  na  ' and/with. 'Except  for  the 
occurrence  of  the  KiSwahili  prefixes  on  the  verb,  viz.  (a-)  for 
third  person  (human  class)  singular  subject  prefix,  (li-)  for 
past  tense  marker,  (-po-)  for  relative  temporal  marker,  and 
{_ka-)  for  consecutive  tense  marker,  the  rest  of  the  sentence 
is  English,  Morphosyntactically,  sentence  (7)  exemplifies  the 
intricate  mixing  of  the  grammatical  rules  of  the  language-pair: 
the  construction  of  the  sentence  appears  to  draw  equally  on  the 
syntactic,  lexical  (i.e.,  lexicon  and  lexical-formation  rules), 
phonological  and  semantic  components  of  English  and  KiSwahili. 

Consider,  next,  the  Kinshasa  Lingala  sentences  in  (3)  and 
(6).  The  sentences  in  (3)  illustrate  a  similar  interaction  to  (7) 
between  the  rules  of  Lingala  and  French.  Sentence  (3a)  includes 
in  its  first  clause  the  French  verb  comprendre  to  which  is 
prefixed  the  third  person  human  subject  marker  (a-),  and  suffix- 
ed the  imperfect  tense  marker  {-aki);  the  rest  of  the  elements 
in  this  sentence  are  Lingala.  Sentence  (3b)  is  thus  far  the  most 
interesting  struncture  for  two  main  reasons:  First,  it  exhibits  a 
doubly  pluralized  subject  noun,  ba-garcon-s  'boys',  where  the 
human  class  2  prefix  (ba-)  cooccurs  with  the  French  plural  suf- 
fix (-S}.  Second,  it  contains  a  concantenation  of  the  Lingala 
auxiliary  verb  ko-zal-a  'to  be'  and  the  French  verb  defendre. 
Sandwiched  between  these  two  verbs  is  the  Lingala  reflexive 
pronoun  (-mi-).  The  construction  of  this  sentence  involves  not 
only  the  rules  referred  to  in  the  description  of  the  sentences  in 
(2)  and  (7)  above,  but  also  pluralization  and  ref lexivization. 
Plural ization  appears  to  have  applied  twice:  once  in  French  and 
then  in  Lingala. 

The  discourse  in  (6)  includes  two  sentences,  one  in  Lingala 
and  the  other  in  French.  The  first  sentence  in  this  discourse, 
which  consists  entirely  of  Lingala  lexical  elements,  would 
presumably  be  processed  in  the  usual  fashion  by  applying  the  rule 
of  subject-verb-agreement  (SVA)  to  the  base  in  a  GB  framework. 
This  application  will  produce  the  third  person  human  subject  pre- 
fix {a-}  on  the  verb  ko-sil-a.  In  an  LFG  framework  SVA,  as  a 
lexical  rule,  would  result  naturally  from  the  association 
expressed  between  the  subject  and  the  verb  in  the  structural  tree 
(Bresnan  &  Mchombo  1987) .  Similarly,  the  French  sentence  in  the 
discourse  would  be  processed  in  the  same  fashion  by  the  applic- 
cation  of  SVA  to  produce  the  second  person  agreement  marker 
{-s)  on  the  verb  rentrer.  It  is  to  be  noted  here  that  unlike 
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in  the  construction  of  (2),  (3)  and  (7),  the  processing  of  each 
of  the  sentences  in  (6)  does  not  involve  the  interaction  of  the 
grammatical  rules  of  the  language-pair  at  any  level:  each  grammar 
operates  independently.  This  type  of  relation  is  characteristic 
of  code-switched,  rather  than  code-mixed,  speech. 

In  summary,  the  few  facts  we  have  examined  here  yield  the 
following  generalizations.  Code-switched  sentences  are  produced 
from  a  monolingual  grammar,  while  code-mixed  sentences  are 
produced  minimally  from  a  bilingual  grammar  of  some  sort.  In 
the  latter  case,  the  grammatical  rules  of  the  language-pair 
concerned  can  interact  at  different  levels  ranging  generally 
from  the  mere  sharing  of  lexicons  to  complex  morphosyntactic 
operations  (e.g.,  SVA,  ref lexivization,  object  incorporation, 
etc.).  In  each  code-mixed  sentence  the  speaker  seems  to  treat 
morphosyntactically  all  elements  as  if  they  belonged  to  the  same 
language;  but  phonetically  and  phonologically  each  lexical  item 
retains  the  properties  of  the  language  of  origin.  It  is  evident 
from  the  facts  presented  here  that  code-mixed  speech  is  much  more 
integrative  of  the  grammatical  systems  of  the  languages  involved 
in  the  discourse  than  it  has  been  hitherto  generally  acknowledged 
in  the  literature. 

These  conclusions  or  similar  ones  have  been  noted  previously 
in  the  literature  on  code-switching  (cf.  Lipski  1978,  Poplack 
1979,  Kachru  1982,  Bokamba  1985,  Kamwangamalu  1987).  Cross- 
linguistic  facts  similar  to  those  cited  in  (2) ,  (3) ,  (6) ,  and  (7) 
exist  in  and  can  be  cited  from  many  other  language-pairs  through- 
out the  world  (Bokamba  1987  &  forthcoming,  Kamwangamalu,  forth- 
coming, but  they  will  not  materially  alter  these  conclusions.  What 
is  particularly  interesting  to  us  are  the  implications  of  these 
facts  for  the  development  of  what  Kaplan  and  Bresnan  (1982)  call 
"a  realistic  linguistic  theory".  We  now  turn  to  a  consideration 
of  this  question. 

3.0  SIGNIFICANCE  OF  CODE-MIXING  TO  LINGUISTIC  THEORY. 
The  facts  presented  in  (2) ,  (3) ,  (6)  and  (7)  raise  very 

interesting  questions  concerning  both  the  syntactic  construction 
of  sentences  in  code-mixed  speech  and  the  mental  representation 
of  the  bilingual 's  grammar.  If  "Externalized  language"  (E- 
language)  is  an  indirect  reflection  of  "Internalized  language" 
(I-language) ,  as  Chomsky  (1986)  suggests,  then  the  occurrence  of 
code-mixed  speech  as  a  world-wide  phenomenon  necessitates  a 
re-examination  of  current  grammatical  theory.  In  this  section  we 
take  up  this  question  and  propose  an  analysis  that  has  important 
consequences  for  linguistic  theory  in  general. 

3.1  Inferences  about  the  bilingual 's  grammar.  Code-mixed 
speech  in  Western  and  non-Western  language  pairs  has  been  assumed 
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to  represent  the  highest  degree  of  bilingual  and  multilingual 
competence  (cf.  Poplack  1982,  Kachru  1982).  This  assumption  is 
based  on  at  least  two  main  facts:  (1)  the  rapidity  and  effort- 
lessness with  which  code-mixed  speech  is  produced  by  bi-  and 
multi-lingual  speakers;  and  (2)  the  morphological  and  syntactic 
integration  exhibited  in  such  speech,  as  seen  in  the  illustra- 
tions given  in  the  preceding  sections.  These  facts  indicate,  as 
is  now  universally  recognized  in  sociolinguistics,  that  code- 
mixed  speech  is  rule-governed.  The  question  that  arises  here  is 
how  to  formally  characterize  such  speech. 

In  one  of  the  current  approaches  to  syntax,  viz.  GB,  it  is 
assumed  that  the  construction  of  sentences  crucially  involves  the 
interaction  of  categorial  and  transformational  rules  (all  lumped 
under  "move  alpha")  mediated  by  phonological  and  semantic  rules 
of  interpretation  (Chomsky  1986,  Riemsdijk  &  Williams  1986).  The 
construction,  in  proceeding  from  the  postulated  D-structure  to 
the  S-structure,  is  assumed  to  be  based  on  a  single  dialect  of  a 
single  language.  Thus  the  various  categorial,  phonological, 
morphological,  syntactic  and  semantic  rules  of  interpretation  are 
drawn  from  one  and  the  same  grammar,  variations  even  from  the 
same  language  are  excluded  from  consideration.  Code-mixed  data 
pose  a  problem  for  this  framework,  because  (a)  they  come  from  at 
least  two  dialects  or  languages,  and  (b)  they  show  intricate 
interaction  between  the  rules  of  the  language-pair  involved  in 
the  discourse.  Given  the  commonly  accepted  hypothesis  that 
speech  or  E-language  is  an  indirect  reflection  of  I-language, 
syntactic  theory  must  therefore  be  able  to  accommodate  code-mixed 
data.  Currently  this  does  not  appear  to  be  possible  within  the  GB 
theory . 

To  characterize  code-mixed  data  it  is  necessary  to  first 
understand  the  internal  relationship  that  obtains  between  the 
grammars  of  the  bi-  and/or  multi-lingual  speaker.  Such  an  under- 
standing can  be  gained  in  the  usual  way  through  inferences  based 
on  an  examination  of  data  produced  by  the  speaker,  for  example 
the  sentences  cited  in  (2),  (3),  and  (7)  above.  From  a  morpho- 
syntactic  perspective  the  salient  characteristics  identified  and 
described  in  section  (2.3)   demonstrate  an  amazing  degree  of 
grammatical  integration  between  the  rules  of  any  language-pair. 
It  is  not  merely  the  insertion  of  lexical  items  from  one  language 
into  another,  but  also  how  these  lexicons  are  treated  by  the 
word- formation  and  syntactic  components  of  the  host  language.  It 
appears  to  us  that  psycholinguistically  in  order  for  the  bi-  and 
/or  multi-lingual  speaker  to  be  able  to  produce  an  understand 
effortlessly  such  utterances,  two  necessary  conditions  must 
obtain:  (1)  (s)he  must  have  internalized  the  grammars  of  the 
languages  concerned,  and  (2)  (s)he  must  be  able  to  access  these 
languages  simultaneously  at  the  moment  of  speaking.  Intuitively, 
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it  seems  that  the  lack  of  one  of  these  capabilities  would 
seriously  hamper  code-mixed  speech  production  and  interpretation. 
The  question  then  is,  how  can  this  bi-Zmulti-lingual  competence 
be  conceptualized? 

Following  Bakamba  (1985),  we  will  assume  here  that  the  bi-/ 
multi-lingual  (code-mixing)  speaker  has  internalized  at  least  two 
grammatical  systems.  These  systems  operate  independently  of  each 
other  in  the  production  and  interpretation  of  monolingual  speech, 
but  interact  in  very  specific  ways  in  the  production  and   inter- 
pretation of  code-mixed  speech.  More  specifically,  in  the  produc- 
tion of  monolingual  speech,  the  speaker  extracts  the  relevant 
syntactic,  lexical  formation,  phonological,  and  semantic  inter- 
pretation rules  from  that  particular  language.  These  rules  may  be 
selected  from  one  of  the  several  dialects  that  the  speaker  has 
internalized,  unless  (s)he  is  producing  mixed  dialect  speech.  In 
the  production  of  code-mixed  speech,  in  contrast,  the  grammars  of 
the  two  languages  involved  in  the  discourse  interact  by  providing 
selected  elements  and  rules  for  the  derivation.  The  selections 
from  each  grammar  are  determined  by  the  host  language  in  the  sen- 
tence under  construction.  For  instance,  if  Lingala  is  the  host 
language  and  French  the  guest  language,  as  in  the  examples  given 
in  (3),  some  lexical  elements  and  possibly  some  categorial  rules 
will  be  extracted  from  the  latter;  but  the  vast  majority  of  these 
will  come  from  the  former. 

3.2  Internal  organization  &  processing  mechanism.  The 
interaction  of  the  bilingual 's  grammars  described  thus  far  raises 
questions  regarding  (1)  the  internal  organization  of  these 
grammars,  (2)  their  inter-relationship,  and  (3)  the  processing 
mechanism (s)  invoked  in  the  construction  of  code-mixed  utterances. 
While  there  have  been  some  attempts  in  the  published  literature 
to  address  the  question  of  processing  mechanism  in  code-mixing  or 
code-switching,  no  published  work  known  to  us  has  dealt  with  the 
issues  of  internal  organization  and  inter-relationship  of  the 
bilingual's  grammars  (Bautista  1975,  Sridhar  &  Sridhar  1980, 
Joshi  1983,  Ewing  1984).  In  this  section  we  take  up  all  three 
questions,  drawing  heavily  on  Bokamba  (1985). 

Bokamba  (1985)  views  the  bilingual's  grammar  as  consisting 
minimally  of  two  independent  grammars  that  may  be  connected  at 
the  appropriate  speech  event  time  through  a  filtering  or  speech 
editing  mechanism,  as  illustrated  in  (16)^. 

Psycholinguistically,  the  filter  or  editor  postulated  in  (16) 
can  be  viewed  both  as  a  monitor  and  trigger  device  for  speech 
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(16)  The  bilingual  grammar  of  a  code-mixer  (Bokamba  1985) 
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processing.  It  serves  as  a  monitor  and  editor  in  the  production 
of  code-mixed  speech  by  ensuring  the  relative  grammatical ity  of 
the  utterances  being  processed  by  the  speaker.  Aspects  of 
sentence  production  such  as  word  order,  lexical  selection,  and 
application  of  phonological  and  morphosyntactic  rules  in  code- 
mixed  speech  is  monitored  closely  by  the  filter.  In  our  example 


37 

sentences,  for  instance,  the  filter  will  make  sure  that  English 
and/or  French  verbs  inserted  in  the  sentence  would  receive  the 
appropriate  Bantu  verb  morphology.  The  filter  functions  as  a 
trigger  mechanism  by  being  sensitive  to  the  context  or  socio- 
linguistic  situation  of  the  discourse.  For  instance,  the  presence 
of  a  bi-  or  multi-lingual  listener  in  the  discourse  may  trigger 
code-mixing,  while  that  of  a  monolingual  would  inhibit  code-mix- 
ing. Similarly,  code-mixing  on  the  part  of  a  listener  will  very 
likely  elicit  a  code-mixed  response  from  another  listener.  When 
the  appropriate  conditions  are  met,  the  filter  will  make  the 
internalized  grammars  accessible  to  the  speaker. 

If  we  assume  the  kind  of  internal  organization  just  described 
and  illustrated  in  (16),  it  becomes  possible  to  conceptualize  a 
syntactic  theory  that  would  allow  the  construction  of  sentences 
from  several  grammatical  sub-systems  or  systems.  Under  this 
conception  of  the  interconnection  of  grammars  it  is  plausible  for 
the  speaker  to  draw  rules  and  lexicons  freely  from  any  one  of 
his/her  languages  or  dialects  in  speech  production.  Whether  the 
model  proposed  in  (16)  is  accurate  or  not,  is  beyond  the  scope  of 
the  present  study  and  will  likely  not  be  ascertain  for  a  long 
time.  What  we  are  hypothesizing  here  is  that  there  is  a  psycho- 
linguistic,  and  probably  a  neurolinguistic,  interconnection 
between  the  (sub-) grammars  internalized  by  bi-  and  multi-lingual 
speakers  so  as  to  permit  them  to  produce  code-mixed  speech  in  the 
manner  they  do. 

If  we  accept  this  characterization  of  the  internal 
organization  of  the  bilingual 's  grammar  and  assume  a  theory  that 
allows  T-rules,  for  example  GB,  the  construction  or  processing  of 
monolingual  sentences  would  proceed  in  the  usual  fashion.  The 
construction  of  code-mixed  sentences,  however,  can  be  viewed  as 
occuring  in  seven  ordered  psycholinguist ic  steps:  (1)  determina- 
tion of  the  D-structure  in  the  host  language;  (2)  selection  and 
insertion  of  the  relevant  lexical  elements  from  LI  and  L2; 
(3)  application  of  the  host  language  lexical  phonology  and 
morphlogy  rules;  (4)  application  of  the  guest  language 
phonological  rules;  (5)  scanning  of  the  structure  to  determine 
syntactic  compatibility  between  the  host  and  guest  languages;  (6) 
application  of  the  relevant  local  adjustment  syntactic  rules  of 
LI  and  L2 ,  and  (7)  application  of  the  relevant  host  or  guest 
language  T-rules  to  the  relevant  clause  (Bokamba  1985:  34) .  Steps 
(1)  through  (4)  and  step  (7)  are  general  in  the  senre  that  they 
are  applicable  also  in  the  construction  of  monolingual  sentences. 
Steps  (5)  and  (6)  are  more  particualr  to  code-mixing  than  they 
are  to  monolingual  speech,  and  constitute  what  we  have  referred 
to  as  the  filtering  or  speech  editing  mechanism. 

It  is  assumed  under  this  conception  of  the  bilingual 's 
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grammar  that  code-mixed  sentences  typically  involve  two  languages 
at  a  time.  In  the  construction  of  the  (code-mixed)  sentence,  the 
speaker  accesses  simultaneously  the  grammar-pair  to  be  drawn 
upon.  The  lexicon  for  the  sentence  is  drawn  freely  but  selective- 
ly from  what  Bokamba  (1985)  regards  as  a  composite  thesaurus 
consisting  of  LI  and  L2  lexical  elements.  The  structure  then 
undergoes  filtering/editing,  i.e.,  steps  5  and  6,  before  being 
subjected  to  the  application  of  the  relevant  morphosyntactic 
rules  in  the  host  language  or  guest  language;  depending  on  the 
clause  concerned.  Step  6  performs  word  order  adjustments, 
whenever  and  wherever  necessary,  to  permit  the  production  of  an 
acceptable  surface  code-mixed  sentence.  The  re-ordering  may 
conform  to  the  host  or  guest  language  constituent  structure 
depending  on  the  constituent  in  question.  As  will  be  noticed  from 
the  Figure  in  (16)  ,  the  T-component  is  not  subject  to  filtering, 
because  there  is  no  evidence  that  transformations  interact  in  the 
construction  of  clauses  (Bokamba,  1987)  .  That  is,  transformation- 
al (a  la  GB)  or  lexical  rules  (a  la  LFG)  appear  to  be  language- 
specific. 

3.3  Conclusions.  The  main  concern  of  this  study  was  to  argue 
for  the  significance  of  code-mixed  data  in  the  formulation  of 
syntactic  theory.  Our  claim  was  that  code-mixed  speech  is  a 
concrete  externalization  of  part  of  the  internalized  knowledge  of 
language  (i.e.,  I-language)  that  a  bi-  and  multi-lingual  speaker 
has.  In  the  course  of  the  discussion  we  examined  a  few  sample 
sentences  involving  code-mixing  and  code-switching,  and  described 
their  salient  morpho-syntactic  characteristics.  The  facts  adduced 
have  shown  clearly  that  code-mixed  sentences  exhibit  extensive 
integration  of  several  of  the  linguistic  components  (i.e., 
lexical,  word- formation,  syntactic)  of  the  language-pair  involved 
in  the  discourse.  We  have  argued  that  code-mixed  data  cannot  be 
accounted  for  under  a  model  such  as  GB,  and  possibly  LFG 
(although  we  have  not  pursued  the  argumentation  in  this  respect 
fully) 5  for  two  main  reasons:  (1)  mono-lingual  and  mono- 
dialectal  data  are  assumed  to  be  only  appropriate  objects  of 
syntactic  investigation  in  these  theories;  and  (2)  there  is  no 
provision,  as  a  result  of  the  first  assumption,  for  a  bilingual 
dictionary  in  the  construction  of  sentences.  To  obviate  these 
difficulties,  we  have  suggested  that  the  linguistic  competence 

of  bi-  and  multi-lingual  speakers  be  viewed  as  involving  a 
network  of  (sub-) grammars  which  interact  under  the  appropriate 
conditions  to  allow  the  production  of  various  sorts  of  speech: 
monolingual,  code-mixed  and  code-switched. 

3.4  Theoretical  implications.  One   of  the  most  commonly 
held  views  about  human  language  is  that  language  is  very 
creative  (Chomsky  1965,  1972,  1981,  1986).  This  creativity  is 
reflected  in  the  production  and  interpretation  of  novel 


39 

sentences,  child  language  acquisition,  and  second  language 
learning.  A  second  popular  view  held  in  generative  grammar  is 
that  language  knowledge  is  mental istic  and  that  there  is  a 
special  innate  language  faculty  which  caters  to  this  knowledge 
(Lenneberg  1967,  Chomsky  1972,  1975,  1986,  Riemsdijk  &  Williams 
1986).  In  light  of  these  views,  the  study  of  language  in  the  last 
3  0  years  has  predominently  focused  attention  on  what  Chomsky 
(1986)  terms  "internalized  language".  As  noted  earlier  in  this 
paper,  however,  the  "I-language"  that  serves  as  the  basis  for  the 
formulation  of  linguistic  theory  is  drawn  from  intuitive 
judgments  of  monolingual  and  mono-dialectal  speakers  (cf.  Chomsky 
1986) .  Except  for  the  work  of  sociolinguists,  especially 
variationists  (cf.  Labov  1972,  Gumperz  1982,  Trudgill  1983, 
Mufwene  1986) ,  multi-dialectal  and  multi-lingual  data  have  rarely 
been  used  as  a  basis  for  theoretical  formulation.  If  the  main 
objective  of  linguistic  theory  is  to  provide  a  characterization 
of  I-language,  "how  it  is  constituted,  how  it  is  acquired,"  and 
"how  it  is  used"  (Riemsdijk  &  Williams  1986:  3),  then  there  is  no 
better  indicator  of  this  knowledge  than  multi-dialectal  and 
multi-lingual  data,  because  they  exemplify  one  of  the  most 
creative  aspects  of  language  knowledge  by  integrating  lexicons 
and  grammatical  rules  from  different  languages  in  the 
construction  of  sentences.  To  process  sentences  from  one  language 
requires  one  type  of  skill  or  level  of  knowledge;  but  to  process 
sentences  from  two  or  more  languages  simultaneously  requires 
special  knowledge  because  of  the  complexity  involved.  This 
special  knowledge,  call  it  ability  2,  can  be  understood  by  ex- 
tending the  domain  of  linguistic  investigation  to  code-mixed  and 
other  multi-lingual  data.  It  seems  to  us  that  the  acceptance  of 
these  facts  as  legimate  objects  of  linguistic  inquiry  will  not 
only  lead  to  better  insights  in  answering  questions  (la-d),  but 
will  also  force  the  theory  to  incorporate  the  types  of  conven- 
tions and  assumptions  suggested  above. 
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*This  is  a  revision  of  a  paper  presented  at  the  17th  Annual 
Conference  on  African  Linguistics  held  at  Indiana  University, 
Bloomington,  April  1987.  We  acknowledge  with  gratitude  the 
comments  of  two  of  SLS  reviewers,  Yamuna  Kachru  and  Michael 
Kenstowicz,  on  the  pre-final  version  of  this  paper.  We  alone  are 
responsible  for  any  error  of  analysis  or  interpretation. 

^See  also  Sridhar  &  Sridhar  1980  for  similar  definitions. 

^For  detailed  criticism  of  this  literature  see  Bokamba  (1985, 
1987) ,  and  for  a  general  review  of  the  published  literature  to 
date  see  Kamwangamalu  (1985,  and  forthcoming) 
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^This  is  a  very  tentative  representation  that  is  undergoing 
revision  to  reflect  currently  shared  tenets  of  GB  and  LFG 
theories  (cf .  Bokamba,  forthcoming) . 

*There  is  considerable  evidence  in  the  code-switching  litera- 
ture which  suggests  that  code-mixing  speakers  treat  lexical  items 
from  their  languages  of  competence  as  if  they  were  synonym  from 
the  same  language. 

^For  a  detailed  discussion  of  the  descriptive  and  theoretical 
problems  posed  by  code-mixing  vis-a-vis  GB  and  LFG,  see  Bokamba 
(forthcoming) . 
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A  COMPUTER  TOOL  IN  THE  STLT>Y  OF  TAIWANESE  TONES  1 
Tsai-Chwun  Du 


This  paper  introduces  a  method  which  incorporates  a  pattern 
matching  algorithm  and  computer  techniques  in  the  investigation  of 
the  inventory  of  Taiwanese  tones.  The  result  turns  out  to  be  fairly 
satisfactory.  From  the  clustering  results  we  postulate  the  underlying 
forms  of  the  short  citation  tone  in  the  dialect  examined. 

0.  INTRODUCTION 

With  the  development  of  digital  computer  techniques,  people  doing  acoustic 
analysis  such  as  Tseng  (1981)  have  been  able  to  use  computers  to  analyze  speech  sounds, 
abstracting  suprasegmental  features  like  stress,  intonation,  and  tone.  Most  researchers,  in 
studying  the  tonal  phenomena,  derive  the  tonal  inventory  of  a  tone  language  through 
perception  or  sonagraph  machine.  Perception  is,  of  course,  more  subjective  than 
objective.  I  do  not  claim  that  it  is  incorrect  to  count  on  perception  in  the  study  of  tones, 
particularly  since  prior  to  the  development  of  computer  technique  or  sonagraph  machine, 
it  was  the  only  method  available.  However,  it  is  not  easy  for  tone-deaf  people  to  perceive 
contrastive  tones.  Sonagraph  machines  have  already  been  available  for  waveform  and 
frequency  display.  However,  in  the  calculation  of  the  fundamental  frequency,  it  is  not 
very  convenient,  e.g.  the  FO  is  usually  obtained  by  dividing  the  10th  harmonic  frequency 
by  10.  With  the  help  of  the  computer  hardware  and  software,  the  FO  can  be  calculated 
automatically  by  computer  program.  In  this  paper,  I  will  introduce  a  tool  that  can  be  used 
to  derive  the  tonal  inventory  of  a  language  without  depending  on  measuring  frequencies 
through  a  sonagraph  machine.  This  tool  is  not  to  replace  the  perception  role  in  the 
investigation  of  tones  as  most  linguists  have  been  very  successful  in  discovering  the  tonal 
inventories  of  different  languages  by  their  perception  only.  It  is  rather  designed  to  show 
that  computer  techniques  can  be  related  to  the  study  of  language  phenomena  in  the  area 
of  computational  linguistics. 

The  tone  language  I  used  for  the  experiment  is  Taiwanese,  a  Chinese  language, 
which  is  spoken  in  Taiwan.2  Basically,  each  syllable  in  Taiwanese  has  two  tones:  the 
citation  form  which  occurs  in  isolation  or  before  a  neutral  tone  and  the  sandhi  form 
which  occurs  in  other  environments^.  Except  for  the  Qingshui  dialect  mentioned  briefly 
by  Ho  (1984)  which  has  six  citation  tones,  all  studies  in  Taiwanese  tones  (R.  Cheng 
1973,  1977,  Tung  et  al  1967,  Zhang  1983)  report  that  there  are  seven  citation  tones.  Of 
the  seven  tones,  five  are  non-entering  Oong)  tones  and  two  are  entering  (short)  tones. 
The  long  tones  are  with  open  syllables  or  syllables  ending  in  nasal  consonants,  e.g.  /si55/ 
'silk',  /si33/  'yes',  /si53/  'die',  /si21/  'four',  /sil3/  'time'  in  R.  Cheng  (1977).  The  short 
tones  are  with  syllables  ending  in  stops  /p,  t,  k,  ?4/,  e.g.  /sit53/  'honest',  /sit21/  'lose'  in 
R.  Cheng  (1977).  In  the  Qingshui  dialect,  there  are  five  long  tones  and  only  one  short 
tone.  I  am  a  native  speaker  of  Taiwanese.  In  this  paper,  I  confirm  Ho's  findings  that  the 
Taiwanese  spoken  in  some  areas  in  Taiwan,  like  Qingshui  dialect,  does  have  five  long 
citation  tones  and  only  one  short  citation  tone.  In  order  to  find  out  whether  there  are 
actually  six  citation  tones,  I  used  a  computer  and  a  pattern-matching  algorithm  to  derive 
the  inventory  of  the  citation  tones  and  the  sandhi  tones.  In  the  examination  of  the  sandhi 
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tones,  we  discover  that  there  are  two  short  tones  underlyingly.  These  two  short  tones  are 
neutralized  in  isolation,  but  the  distinction  remains  before  a  special  tone  category,  the 
neutral  tone,  which  will  be  discussed  in  the  text  later.  In  the  following  sections,  I  will 
first  describe  the  motivation  for  the  experiment.  Then  I  discuss  the  tool  used  in  the 
analysis  of  the  tones:  the  computer  hardware  and  software  used  to  extract  the 
fundamental  frequency  of  each  syllable  and  the  statisucal  algonthm  used  to  classify  the 
tones.  Finally,  I  wili  present  the  statistical  results  from  which  the  tone  inventory  is 
derived. 

1.  MOTIVATION 

The  idea  of  using  computers  and  pattern  matching  algorithm  to  study  tones  came 
from  research  in  Mandarin  tones  done  by  Cheng  and  Sherwood  (1981)  who  extracted  six 
parameters  from  each  syllable  input  through  a  pitch  extraction  device  interfaced  with 
PLATO.  These  six  parameters  are  the  beginning  frequency  (BF),  the  mid  frequency 
(MF),  the  end  frequency  (EF),  the  beginning-end  slope  (BESL),  the  beginning-mid  slope 
(BMSL),  and  the  mid-end  slope  (MESL).  The  six  parameters  are  used  for  tone 
recognition.  The  recognition  result  shows  that  some  parameters  or  combination  of 
parameters  are  better  than  others  in  identifying  tones.  With  the  assumption  that  some 
parameters  are  better  cues  in  tone  recognition  and  can  be  used  to  group  syllables 
produced  with  one  tone  together  within  a  person's  pitch  range,  I  adopted  these  six 
parameters  for  the  use  of  tone  classification.  Furthermore,  I  added  a  seventh  parameter, 
tone  duration  (DUR)  because  tone  length  between  the  long  and  short  tones  is  distinctive 
in  Taiwanese.  The  classification  of  the  citation  tones  is  based  on  a  pattern  matching 
process  called  the  K-means  algorithm  (Macqueen  1965).  This  process  groups  syllables 
that  have  similar  parameter  values  together.  The  average  parameter  values  of  each  group 
represent  one  tone  value.  To  obtain  the  values  of  the  seven  parameters,  I  used  a  speech 
program  to  get  the  fundamental  frequency  of  each  syllable.  The  seven  parameter  values 
were  calculated  and  stored  on  a  file  for  the  subsequent  grouping  process. 

2.  COMPUTER  TOOLS 

2.1  THE  SPEECH  PROGRAM 

In  examining  the  tones,  I  used  a  digital  computer  technique  to  calculate  the 
fundamental  frequency  of  the  speech  input  In  this  section,  I  will  briefly  discuss  the 
structure  of  this  speech  program. 

The  computer  program  used  for  the  speech  processing  of  voice  in  the  pitch 
extraction  experiment  requires  an  electronic  analog-digital  (A/D)  and  digital- analog 
(D/A)  converter.5  The  program  was  written  in  BASIC  and  was  designed  to 

(a)  accept  speech  input,  display  the  waveform,  and  echo  the  sound  back  to  the  user; 

(b)  splice  the  speech  input  in  order  to  extract  monosyllabic  words; 

(c)  calculate  and  plot  the  fundamental  frequency  of  the  speech  input 

With  this  A/D  and  D/A  converter,  the  program  sampled  at  the  rate  of  10  K/sec  for 
male  voices  and  20  K/sec  for  female  voices.o  Using  this  program,  sounds  could  be  input 
through  a  tape  recorder,  the  waveform  displayed  on  the  computer  screen,  and  the  sound 
reproduced.  The  sound  was  1.2  seconds  long  for  male  voices  and  0.62  seconds  long  for 
female  voices. 
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2.2  EXTRACTION  OF  THE  FUNDAMENTAL  FREQUENCIES 

The  fundamental  frequency  estimation  (also  called  pitch  period  estimation)  was 
carried  out  by  using  the  autocorrelation  function  (Rabiner  and  Schafer  1978:141-148), 
the  center  clipping  function  (Sondhi  1968),  and  the  three-level  center  clipping  function 
(Dubnowskj.  Schafer,  and  Rabmer  1976) J  With  these  functions,  the  software  was 
wntten  to  calculate  the  FO  of  a  monosyllabic  input  sound  and  print  the  frequency  on  the 
computer  screen.  Figure  1  shows  the  waveform  and  fundamental  firequency  of  syllable 
/si55/  'silk"  with  high  level  tone  computed  by  the  speech  program. 

The  seven  parameters  were  obtained  in  the  following  way.  The  program  was  also 
written  to  search  for  the  meaningful  beginning  and  end  fundamental  frequency  points  of 
a  syllable.  In  the  process  of  searching  for  the  meaningful  beginning  point,  the  difference 
in  the  fundamental  frequency  between  two  neighboring  points  is  checked  until  a 
difference  in  frequency  not  greater  than  10  Hz  is  found.  TTien  the  leftmost  of  these  two 
points  was  selected  as  the  beginning  point.  Next  the  program  searched  for  the 
meaningful  end  point  from  the  end  of  the  syllable.  The  search  continued  backwards 
(leftwards)  until  a  difference  in  frequency  not  greater  than  10  Hz  was  found.  Then  the 
right  point  was  chosen  as  the  end  point.  Once  the  beginning  and  end  points  were  located, 
the  time  of  each  point  was  calculated.  The  tone  length  was  obtained  by  subtracting  the 
time  of  the  begirming  point  frequency  from  the  time  of  the  end  point  frequency.  The  mid 
point  time  and  frequency  were  determined  as  well.  The  parameter  'duration'  in  our 
experiment  refers  to  tone  duration  rather  than  syllable  duration.  To  get  the  beginning-end 
slope,  the  beginning-mid  slope,  and  the  mid-end  slope,  Cheng  and  Sherwood's  formulae 
(1981)werefoUowed:8 

beginning-end  slope  =  (end  frequency  -  beginning  frequency)  /  (end  time  -  beginning 

time) 
beginning-nud  slope  =  (mid  frequency  -  beginning  frequency)  /  (mid  time  -  beginning 

time) 
mid-end  slope  =  (end  frequency  -  mid  frequency)  /  (end  time  -  mid  time) 

The  seven  parameters  of  each  syllable  were  stored  on  a  file  for  the  purpose  of  similarity 
grouping  discussed  next 

2.3  K-MEANS  ALGORITHM 

The  K- means  algorithm  is  a  process  of  partitioning  large  groups  of  samples  into  K 
sets  on  the  basis  of  specified  values.  MacQueen  (1965:283)  gives  a  good  definition  of 
the  K-means  process.  He  says, 

Let  Zi,  Z2  .  .  .  be  a  random  sequence  of  points  (vectors) .  .  . 
The  K-means  procedure  consists  of  simply  starting  with  K 
groups,  each  of  which  consists  of  a  single  random  point,  and 
thereafter  adding  each  new  point  to  the  group  whose  mean  the 
new  point  is  nearest.  After  a  point  is  addaJ  to  a  group,  the 
mean  of  that  group  is  adjusted  in  order  to  take  account  of  the 
new  point  Thus  at  each  stage  the  K-means  are,  in  fact,  the 
means  of  the  groups  they  represent 

Basically  what  the  K-means  algorithm  does  is  to  cluster  a  body  of  samples  into  K 
number  of  groups,  each  of  which  consists  of  samples  of  similar  X.  X  can  be  anything, 
e.g.  social  values,  distance,  etc.  The  steps  for  clustering  N  samples  are  the  following: 
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Figure  1.  The  waveform  and  fundamental  frequency  of  the  word  /si/  'silk'  transcribed 
with  55  in  citation  form 
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(a)  select  the  number  of  groups,  i.e.  K; 

(b)  select  (randomly)  a  sample  as  the  initial  center  of  each  group;  whatever  value  the 
initially  selected  sample  has  is  considered  initial  mean  value  of  the  group; 

(c)  compute  the  difference  between  the  value  of  any  new  sample  and  that  of  each  group 
center,  then  add  this  new  sample  to  the  group  to  whose  mean  the  new  sample  is 
nearest; 

(d)  recalculate  the  mean  and  go  back  to  itep  c  until  all  the  samples  are  properly 
classified. 

So  "Jvith  a  set  of  raw  data  at  hand,  one  has  to  specify  the  number  of  groups  and  the 
center  values  so  that  the  algorithm  is  able  to  properly  classify  the  data  into  different 
groups.  There  are  several  applications  of  the  K-means  process;  one  of  them  is  "similarity 
grouping'  (MacQueen,  1965:288).  In  my  experiment,  this  K-means  process  is  used  to  do 
similarity  grouping,  i.e.  classify  all  the  speech  data  into  seven  tonal  groups  based  on  the 
distances  among  the  parameters. 

3.  EXPERIMENTS 

As  mentioned,  previous  studies  in  Taiwanese  tones  (R.  Cheng  1977,  Tung  et  al 
1967,  Zhang  1983)  show  that  there  are  seven  citation  tones.  Thus  in  the  very  beginning  I 
will  also  assume  that  there  are  seven  ciution  tones  in  the  data  to  be  examined.  The 
selection  of  groups  is  very  important  that  the  more  groups  are  selected,  the  more  detailed 
the  tone  classification  is.  Without  this  initial  assumption  the  clustering  program  cannot 
perform  any  job.  As  at  most  seven  citation  tones  have  been  reported  in  Taiwanese 
dialect,  so  I  choose  seven.  For  a  language  with  no  literature  reference,  it  is  advisable  to 
divide  the  raw  data  into  groups  ranging  from  2  to  9.  From  the  classification  print  out,  a 
linguist  will  be  able  to  fmd  how  many  possible  tonal  alternations  there  are  in  the 
language  examined  Section  3. 1  deals  with  citation  tones  and  sections  3.2  and  3.3  deal 
with  sandhi  tones. 

3.1  CITATION  TONES 

3.1.1  DATA 

The  speech  data  were  citation  forms,  consisting  of  140  monosyllabic  words 
selected  from  R.  Cheng  (1977)  and  Zhang  (1983),  20  words  for  each  tone.  In  order  to 
differentiate  these  words,  each  syllable  was  transcribed  with  a  numerical  tone  value.  For 
example,  /si55/  'silk'  was  a  syllables  with  high  level  tone.  Here  I  adopt  R.  Cheng's  tone 
system  to  transcribe  diese  data.  In  R.  Cheng  ( 1977),  the  seven  lexical  tones  are  five  long 
tones  /55/,  /53/,  /33/.  /2l/,  /13/  and  the  two  short  tones  /53/  and  /21/.  In  order  not  to  mix 
the  two  long  tones  /21/  and  /53/  with  the  corresponding  short  tones  /21/  and  /53/,  in  the 
clustering  program  later,  I  used  /43/  instead  of  /53/  for  the  long  tone  /53/  and  Hll  instead 
of /21/  for  the  short  tone  /21/.  In  my  experiment,  for  the  long  tones,  /55/  is  used  for  high 
level  tone,  /33/  for  mid  level  tone,  /43/  for  high  falling  tone,  /21/  for  low  falling  tone,  and 
/1 3/  for  rising  tone.9  For  the  short  tones,  /53/  is  used  for  high  falling  tone  and  /22/  for 
low  falling  tone. 

Two  speakers  read  the  data.  They  are  both  bilingual,  native  speakers  of  Taiwanese 
in  their  twenties.  Speaker  A  is  from  the  Gaoxiong  area  and  speaker  B,  the  author,  is  from 
the  Taizhong  area.  The  two  speakers  recorded  the  data  three  times  during  a  three-week 
period.  They  read  the  word  list  at  a  normal  speed.  The  speech  was  recorded  on  a  reel 
tape  using  a  REVOX  PR99  recording  system.  The  recorded  speech  then  was  transferred 
from  reel  to  cassette  using  a  SANYO  RD5030  stereo  cassette  recorder.  Using  the  speech 
program  described  in  section  2.1,  I  obtained  the  fundamental  frequency  of  each 
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monosyllabic  word  in  the  recording.  Seven  parameters  were  extracted  from  each  word 
and  were  stored  on  a  file  for  the  purpose  of  similarity  grouping.  Since  both  speakers  read 
the  data  three  times,  the  final  file  used  for  the  grouping  test  for  each  speaker  was  the 
average  of  the  parameters  obtained  for  each  of  the  speech  words  in  the  three  recordings. 

3.1.2  S[MIL.\R1TY  GROUPING 

All  the  parameters  of  the  280  words  read  by  the  two  speakers  were  normalized  at 
fii^t  widiin  the  range  of  0  to  100.  Thus,  e.g.  die  highest  frequency  of  speaker  A's  data 
(275.67  Hz)  and  the  highest  frequency  of  speaker  B's  data  (248.50  Hz)  were  both  set 
equal  to  100.  The  lowest  frequency  of  speaker  A's  data  (149.00  Hz)  and  the  lowest 
frequency  of  speaker  B's  data  (133.67  Hz)  were  both  set  equal  to  0.  The  remainder  of 
the  frequencies  were  normalized  within  this  range.  10  Since  I  assumed  that  there  were 
seven  citation  tones,  the  program  was  written  to  cluster  the  data  into  seven  groups.  These 
data  were  clustered  on  the  basis  of  the  closeness  of  the  parameters  selected  for  similarity 
grouping.  The  K-means  program  first  selected  one  wonl  from  each  tone  to  be  the  initial 
center  of  each  group.  The  mean  values  of  each  group  depend  on  which  parameters  are 
selected  as  the  clustering  parameters.  For  example,  if  parameters  "beginning  point 
frequency"  (BF)  and  "end  point  frequency"  (EF)  arc  the  clustering  parameters,  then  the 
BF  and  the  EF  of  the  initially  selected  group  center  are  the  mean  values  of  that  group. 
Next  the  BF  and  EF  parameters  of  a  second  word  were  compared  with  those  of  the  seven 
groups.  This  second  word  was  added  to  the  group  whose  mean  values  that  the  BF  and  EF 
of  this  word  were  nearest  to,  i.e.  to  the  group  which  had  the  smallest  distance  between  the 
parameters  of  the  second  word  and  the  mean  values  of  a  group.  The  distance  was 
calculated  based  on  the  pythagorean  theorem;  the  distance  between  two  points  equals  to 
the  square  root  of  the  sum  of  the  length  of  the  sides.  For  instance,  if  I  assume  that  the 
BFs  and  EFs  of  the  center  of  one  of  the  groups  and  the  second  word  are  (xl.yl)  and 
(x2,y2)  respectively,  then  the  distance  between  the  group  and  the  second  word  is  the 
square  root  of  ((xl-x2)2+(yl-y2)2).  Next  a  third  word  was  selected  and  added  to  a  group 
based  on  the  smallest  distance  difference  between  the  parameter  values  and  the  group 
mean  values.  After  aU  the  280  speech  words  were  classified,  new  means  were  obtained 
for  each  group  by  first  adding  the  clustering  parameter  value  of  each  word  collected  in  a 
group  and  then  dividing  the  sum  over  the  number  of  words  in  this  group.  For  example,  if 
BF  and  EF  are  the  clustering  parameters,  then  the  BF  values  of  all  the  words  in  the  group 
are  added  together,  then  divided  by  the  number  of  the  total  words  in  the  group.  Similarly 
die  EF  values  of  all  the  words  are  added,  and  the  result  is  divided  by  the  total  number  of 
the  words  in  the  group.  These  two  newly  averaged  BF  and  EF  values  become  the  new 
center  mean  values.  With  these  new  means,  all  the  data  were  classified  again.  This 
recalculation  of  group  mean  values  and  reclassification  was  repeated  until  the  last 
recalculated  mean  values  were  die  same  as  die  previous  ones.  This  means  that  the  words 
clustered  together  in  each  group  are  stable.  Words  clustered  in  the  same  group  can  be 
said  to  have  similar  parameter  values.  The  program  then  printed  out  the  number  of  words 
and  the  words  in  each  group.  Since  die  grouping  can  be  based  on  just  one  parameter,  two 
parameters,  or  up  to  seven  parameters,  there  are  in  total  127  combinations  of  the  seven 
parameters.  From  this  print-out,  it  is  generally  easy  to  tell  which  parameter  or 
combinations  of  parameters  can  best  distinguish  among  these  seven  tones.  In  the  next 
section,  I  will  discuss  what  the  statistics  can  tell  us  about  the  citation  tones  in  Taiwanese. 

3.1.3  RESULT  AND  DISCUSSION 

In  total,  there  are  127  classification  sets  for  the  280  experimental  syllables  based  on 
the  combination  of  the  seven  parameters.  Among  these  127  sets,  I  picked  one 
classification  shown  in  Table  1  based  on  four  parameters:  the  beginning  frequency,  the 
mid  frequency,  the  end  frequency,  and  the  tone  duration  for  discussion.  The  combination 
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of  these  four  parameters  identified  the  citation  tones  besL^^    In  Table  1,  I  observe  the 
following: 

The  citation  forms  of  39  syllables  transcribed  with  /55/  are  collected  in  group  1. 
The  citation  forms  of  40  syllables  transcribed  with  43/  arc  collected  in  group  2.  The 
citation  forms  of  40  syllables  transcribed  with  '33/  are  collected  in  group  3.  The  citation 
forms  of  35  syllables  transcnbed  with  tone  22/  and  35  syllables  transcribed  with  tone 
/53/  arc  collected  in  group  4.  The  citation  forms  of  18  syllables  transcribed  with  /21/  are 
collected  in  group  5  and  the  remaining  22  syllables  are  collected  in  group  6.  The  citation 
forms  of  40  syllables  transcribed  with  tone  /1 3/  are  collected  in  group  7. 

The  first  three  center  mean  values  in  each  group  in  Table  1  are  the  normalized 
average  frequency  values  for  the  beginning,  the  mid,  and  the  end  points.  I  convert  each 
of  these  three  mean  values  into  the  five-point  scale  tone  system  (Chao  1930).  The 
normalized  highest  mean  value  86.054  is  interpreted  as  5  and  the  lowest  normalized 
mean  value  11.351  is  interpreted  as  1.  The  pitch  range  from  5  to  1  is  derived  by  first 
dividing  the  difference  between  86.054  and  11.351  by  5,  making  an  interval  of  14.941. 
Next  I  subtract  14.941  from  86.054,  which  is  the  range  for  5.  Again  I  subtract  14.941 
from  71.112,  which  is  the  range  for  4,  etc.  The  mean  value  range  for  each  point  in  the 
scale  is  summarized  in  Table  2. 


Table  1.  Citation  tone  grouping  using  parameters  BF,  MF,  EF,  DUR 


group # 

12            3            4            5            6            7 

group  mean 

BF 

84.971    84.632    65.682    61.350    53.457    50.476    37.414 

values 

MF 

78.991    60.900    56.900    50.626    36.214    26.290    22.707 

EF 

86.054    26.712    63.771    33.835    18.419    11.351    48.996 

DUR 

70.333    37.480    73.799    12.928    17.281    48.411    78.042 

transcrip- 

number of  syllables  in  one  of  the  seven  tones  collected  in 

tion  tones 

each  group 

55 

39            0             1             0            0            0            0 

33 

0            0          40            0            0            0            0 

43 

0          40            0            0            0            0            0 

21 

0            0            0            0           18           22            0 

13 

0             0             0             0             0             0           40 

22 

0            0            0          35             5            0            0 

53 

0             0             4           35              1             0             0 

total  number 

39           40           45           70           24           22           40 

ofsyUables 

in  each  group 

BF:  the  beginning  frequency;  MF:  the  mid  frequency;  EF:  the  end  frequency; 

DUR:  tone  duration 

The  decimal  numbers  beneath  each  group  are  the  group  center  mean  values. 
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Table  2.  Tone  letter  assigned  to  each  group  in  Table  1 


numerical 

system 

center  mean 
values 

values 

from 

86.054 

group  1     55 

to 

71.113 

group  2     52 

from 

71.112 

group  3     44 

to 

56.172 

group  4     42 

from 

56.171 

group  5     31 

to 

41.231 

group  6     31 

from 

41.230 

group  7  213 

to 

26.291 

from 

26.290 

to 

11.351 

Using  Table  2, 1  can  see  that  the  experimental  clustering  groups  represent  six  tone 
values:  /55/,  /52/,  /44/,  /42/,  /31/,  and  /213/.  Syllables  transcribed  with  /21/  distributed  in 
two  groups  5  and  6,  but  they  were  all  produced  with  tone  /31/.  The  reason  that  the  18 
syllables  transcribed  with  /21/  in  group  5  were  differentiated  from  the  remaining  22 
syllables  transcribed  with  /21/  in  group  6  is  that  group  5  syllables  were  produced  with 
shorter  tone  durations.  However,  this  discrepancy  will  not  influence  the  statistical  result 
Group  4  contains  most  of  syllables  in  the  short  tones,  which  suggests  that  phonetically, 
there  is  only  one  short  tone.  Later,  in  the  examination  of  sandhi  tones,  this  short  tone  was 
found  to  have  two  sandhi  forms.  Another  noteworthy  item  is  that  the  duration  mean 
values  tell  us  that  syllables  produced  in  /42/  tone  are  comparatively  shorter  than  syllables 
produced  in  other  tones.  The  correspondence  between  the  tone  letters  in  the  transcription 
and  the  computed  citation  tones  from  Table  2  is  listed  in  Table  3.  It  is  clear  that  the 
Taiwanese  dialect  R.  Cheng  (1977)  studies  has  different  tone  values  from  my 
experimental  data. 

Table  3.  Transcription  tones  and  computed  citation  tones 


transcnpuon 

computed  citation  | 

tone 

55 

< — > 

55 

< — > 

44 

< — > 

52 

< — > 

31 

< — > 

213 

< — > 

42 

22 

< — > 

42 

< — >  means  'corresponds  to' 

After  we  obtain  the  different  citation  tones,  we  will  examine  the  sandhi  tones  that 
each  citation  form  has  in  the  sandhi  environment. 
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3.2  SANDHI  TONfES 

3.2.1  DATA 

We  made  up  l-iO  two-syllable  phrases,  each  of  which  consisted  of  a  monosyllabic 
word  used  m  the  previous  analysis  followed  by  a  second  monosyllabic  word.  There  were 
20  phrases  for  each  tone.  For  example,  /tang55  t'i~55/  winter",  /to33  1143/  reason', 
to-ng43  p'ai2l/l2  "poliucal  party',  /si21  tsap53/  'forty',  /tol3  tsau43/  'escape',  /kut53 
to21  \  "slippery... fall',  and  101122  t'aul3/  'bone'.  As  before,  all  the  monosyllabic  words 
in  the  phrases  were  transcribed  with  the  tone  letters  used  in  the  previous  experiment.  The 
tlrst  syllable  of  each  phrase  was  analyzed  by  the  speech  program.  The  seven  parameters 
were  extracted  and  stored.  All  the  parameters  were  normalized  between  100  and  0. 
Then,  based  on  the  combination  of  these  seven  parameters,  these  280  tokens  were 
classified  by  the  K-means  program  in  the  same  way  as  described  in  section  3. 1.2. 

3.2.2  RESULT  AND  DISCUSSION 

One  hundred  and  twenty-seven  classification  sets  were  obtained.  It  was  found  that 
parameters  related  to  mid  point  frequency  or  slope  do  not  play  an  important  role  in  the 
two-word  phrase  classification.  The  best  classification,  then,  is  shown  in  Table  4,  based 
on  BF,  EF,  and  DUR.  Again,  the  normalized  mean  beginning  and  end  frequency  values 
for  BF  and  EF  of  all  seven  groups  were  converted  into  the  numerical  equivalents.  These 
converted  tone  values  from  Table  4  are  listed  in  Table  5. 


Table  4.  Sandhi  tone  grouping  using  parameters  BF,  EF,  and  DUR 


group # 

1 

2            3            4            5             6 

7 

group  mean 
values 

BF 
EF 
DUR 

88.616 
35.576 
42.987 

76.080    71.406    50.374    48.487    33.973 

59.557    67.386    46.753    41.157     18.947 

7.040    64.305    73.917    42.361     12.320 

32.479 
15.878 
56.782 

transcrip- 
tion tones 

number  of  syllables  in  one  of  the  seven 
tones  collected  in  each  group 

55 
33 
43 
21 
13 
22 
53 

0 
0 
0 
37 
0 
8 
0 

0            3           13           22             1 
0            0            0            0           15 
0           31             9            0            0 
10            0            0             1 

0  0           14           18             1 
29            0            0            2             1 

1  0            0            0           34 

25 
0 
I 

7 
0 

5 

total  number 
ofsyUablcs 
in  each  group 

45 

31           34           36          42           53 

39 

BF:  beginning  frequency;  EF:  end  frequency;  DUR:  tone  duration 
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Tabic  5.  Converted  group  tones  in  Tabic  4 


group  1 

52     i 

group: 

54     i 

group  3 

44 

group  4 

33 

groups 

32 

group  6 

21 

group? 

21     1 

1 

In  Table  4,  some  syllables  transcribed  with  22  and  53  had  sandhi  forms  with  long 
tone  durations.  I  will  discuss  this  later.  At  this  moment,  I  will  ignore  syllables 
transcribed  with  short  tones  which  are  long.  According  to  Tables  4  and  5,  I  have  the 
following  sandhi  rules: 

(1)  Syllables  transcribed  with  55  and  13  were  produced  in  33  or  32  tone,  e.g.  /si55/ 
'poem'  in  [si33  lin213]  'a  poet'  and/sil3/  'time'  in  [si33  kan55]  'time'.  The 
difference  between  33  and  32  is  not  phonologically  important 

(2)  Syllables  transcribed  with  43  were  produced  in  44  tone,  e.g.  /si43/  'to  die'  in  [si44 
lang213]  'a  dead  person'. 

(3)  Syllables  transcribed  with  21  were  produced  in  52  tone,  e.g.  /si21/  'four'  in  [si52 
tsap42]  'forty'. 

(4)  Syllables  transcribed  with  33  were  produced  in  21  tone,  e.g.  /si33/  'yes'  in  [si21 
hui55]  'yes- no'. 

(5)  Syllables  transcribed  with  53  were  produced  in  21  tone,  e.g./sit53/  'solid'  in  [sit21 
ue33]  'truth'. 

(6)  Syllables  transcribed  with  22  were  produced  in  tone  54,  e.g.  /sit22/  'to  lose'  in  [sit54 
k'i3l]  'lose'. 

Using  the  derived  citation  tones  from  the  first  experiment,  I  rephrase  the  observed 
sandhi  in  (1)  through  (6)  in  Table  6. 


Table  6.  Computed  citation  and  sandhi  tones  in  Taiwanese 


citauon  tones 

sandhi  tones 

55,213 

=> 

33/32 

52 

=> 

44 

44 

=> 

21 

31 

=> 

52 

42(short) 

=> 

54(short) 

42(short) 

=> 

21  (short) 

=>  means  the  tone  at  the  left 
becomes  the  tone  at  the  right 


In  total,  there  are  six  sandhi  tones.  Clearly,  citation  tone  /42/  has  two  sandhi  forms 
/54/  and  I2\l.  I  would  like  to  ask  'is  there  only  one  underlying  short  tone  which  takes 
two  sandhi  forms?'  If  the  historical  development  of  tone  system  in  Chinese  is  taken  into 
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consideration,  underlyingly  there  arc  two  short  tones.  Historically,  the  voicing  split  of 
the  syllable- initial  consonants  in  Chinese  increased  the  tone  inventory  from  four  tones  to 
eight  tones.  Each  of  the  middle  Chinese  four  tones.  Ping,  Shang.  Qu.  and  Ru  split  into 
Yin  and  Yang,  with  Yin  having  high  tone  and  Yang,  low  tone.  From  a  diachromc  point 
of  view,  there  should  be  two  shon  tones.  This  distinction  is  found  in  the  sandhi  forms  but 
not  m  the  citation  forms  m  our  data.  I  can  postulate  that  underlyingly  there  were  two 
short  tones,  and  later  m  the  language  development  process,  the  two  short  tones  were 
neutralized  in  isolation  in  our  data.  Thus  the  sandhi  forms  serve  as  a  criterion  for  the 
determination  of  the  underlying  short  tones.  Furthermore,  if  there  is  only  one  underlying 
short  tone,  it  would  be  very  difficult  to  predict  when  a  syllable  in  this  tone  has  a  /21/  or 
'54/  sandhi  firom  as  the  sandhi  environment  is  the  same.  Because  the  distinction  is 
maintained  in  the  sandhi  forms.  Ho  (1984)  who  studies  Qingshui,  a  Taiwanese  dialect 
that  has  one  citation  short  tone  and  two  sandhi  short  forms,  proposes  that  the  sandhi 
forms  are  actually  the  underlying  forms  and  the  citation  form  is  a  derived  form.  This 
postulation  simplifies  the  prediction  of  sandhi  forms.  If  the  citation  form  is  the 
underlying  form,  it  would  be  difficult  to  predict  when  it  is  changed  into  one  tone  and 
when,  the  other.  On  the  other  hand,  if  the  sandhi  forms  are  the  underlying  forms,  then 
the  two  tones  are  neutralized  in  isolation.  Nevertheless,  I  cannot  adopt  Ho's  view  to 
assume  that  in  my  data  the  sandhi  forms  are  the  underlying  forms.  The  evidence  comes 
from  another  experiment  of  the  tonal  patterns  before  the  neutral  tone.  Before  I  discuss 
the  neutral  tone  experiment,  I  will  devote  a  small  section  to  the  'tone  length'  issue  as 
firom  the  direction  that  a  short  tone  becomes  a  long  tone,  I  obtain  crucial  evidence 
towards  the  establishment  of  the  tone  inventory  in  Taiwanese. 

3.3.  TONE  DURATION 

As  noticed  in  Tables  1  and  4,  when  tone  duration  was  one  of  the  clustering 
parameters,  some  syllables  transcribed  with  short  tones  both  in  the  citation  and  sandhi 
enviroimients  were  collected  with  syllables  transcribed  with  long  tones.  Checking  the 
average  values  of  tone  duration,  I  see  that  these  syllables  were  collected  in  groups  with 
large  duration  values.  This  indicates  that  these  syllables  were  not  produced  with  short 
tones  but  rather  with  long  tones.  In  Table  7  I  list  some  of  these  syllables  and  their  tone 
durations  in  real  time. 


Table  7.  Syllables  transcribed  with  short  tones  produced  long 


citation  context 

sandhi  context 

o?53        'study'      224  ms 
tsio53     'stone'       179  ms 
guat53    'moon'      159  ms 

tsio53        'stone'       149.00  ms 
ti^2         'drop'        137.65  ms 
gik53        'jade'        142.33  ms 

R.  Cheng  (1977)  mentions  that  some  syllables  ending  in  the  glottal  stop  drop  the 
final  consonant  and  are  produced  with  a  long  tone  in  the  sandhi  environment,  though  they 
still  keep  the  same  short  sandhi  tone  values  as  those  syllables  that  do  not  drop  the  final 
consonants.  In  our  data,  not  only  /?/  but  other  final  consonants  are  dropped  in  both 
isolation  and  sandhi  environments  for  some  syllables.  These  syllables,  after  losing  the 
final  stop  consonants,  have  different  tone  values.  For  instance,  as  shown  in  Tables  1,  3, 
4,  and  5,  five  syllables  transcribed  with  tone  /22/  in  isolation  context  had  a  /31/  tone 
rather  than  a  /42y  tone,  e.g.  /a?22/  — >  [a31]  'duck',  and  in  the  sandhi  context  eight 
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syllables  had  a  ISAI  sandhi  tone,  e.g.  [a54  bu52]  'female  duck';  four  syllables  transcribed 
with  tone  /53/  in  the  isolation  context  had  a  .44/  tone.  e.g.  /pe?53/  — >  [pe44]  "white"  and 
in  the  sandhi  context  five  syllables  had  a  21/  sandhi  tone,  e.g.  [pe2l  sik421  'white  color'. 
Here  is  a  siniation  where  the  short  tones  are  beginning  to  disappear  and  redistribute 
themselves  among  the  long  tones.  These  syllables  are  just  the  first  few  to  undergo  this 
change.  Furthermore,  after  a  syllable  loses  its  final  stop  consonant,  it  has  either  a  long 
31  or  a  long  /44/  citation  tone,  which  has  either  a  /52/  or  a  /21/  sandhi  tone.  This 
observation  helps  me  in  the  determination  of  the  underlying  forms  of  the  short  citation 
/42/  tone  as  discussed  in  the  next  two  sections. 

3.4  TONAL  PATTERNS  BEFORE  THE  NEUTR.\L  TONT 

In  Taiwanese,  in  addition  to  full  tones,  there  is  another  tone  category  called  the 
neutral  tone  (R.  Cheng  1968,  1973,  1977,  Zhang  1983).  The  distribution  of  the  neutral 
tone  is  in  general  limited  and  predictable,  so  it  is  not  regarded  as  a  phonemic  tone  (Tung 
et  al  1967).  There  are  several  environments  where  the  neutral  tone  can  occur  and  some 
turn  out  to  be  idiosyncratic  13  (R.  Cheng  1968,  1973,  1977).  Any  syllable  produced  in 
neutral  tone  loses  its  original  full  tone  and  has  a  low  falling  tone  like  /21/.  For  example, 
pronouns  at  sentence  final  object  position,  e.g.  /gua52/  T  in  [i33  ma-33  guall]  *he 
scolded  me'  and  directional  verbs  used  as  verb  complement,  e.g.  [kue31  lai213]  'pass 
come'  in  [tan31  kue21  lai21]  'throw  over  here'  have  the  neutral  tone.  Syllables 
immediately  preceding  the  neutral  toned  syllables  have  been  observed  not  to  undergo  any 
tone  sandhi  (R.  Cheng  1968,  1973,  1977).  I  collected  42  phrases  with  final  neutral  toned 
syllables  and  these  phrases  were  read  by  the  same  two  speakers.  The  full  toned  syllable 
immediately  preceding  the  neutral  tone  in  each  phrase  was  used  in  the  K-means  program 
and  these  syllables  were  transcribed  with  the  computed  citation  tones  in  the  citation  tone 
experiment.  Short  /42/  tone  with  a  /54/  sandhi  tone  is  represented  by  /42(hs)/  and  short 
/42/  tone  with  a  /21/  sandhi  tone  is  represented  by  /42Gs)/,  where  's'  stands  for  short  and 
'h'  and  '1'  stand  for  high  and  low  respectively.  Unfortunately,  four  of  the  neutral  toned 
syllables  spoken  by  speaker  A  had  very  low  amplitude  from  which  the  fundamental 
frequency  could  not  be  extracted.  So  in  total  only  80  tokens  were  used  in  the  clustering 
program.  The  clustering  result  based  on  the  combination  of  BF  (the  beginning  point 
frequency),  MP  (the  mid  point  frequency),  EF  (the  end  point  frequency),  BESL  (the 
beginning  to  end  slope),  BMSL  (the  beginning  to  mid  slope),  and  MESL  (the  mid  to  end 
slope)  and  the  converted  group  tone  values  are  presented  in  Table  8. 

I  list  the  citation  tones  and  the  tones  occurring  before  the  neutral  tone  derived  in 
Table  8  above.  The  long  citation  tones  /55/,  /44/,  and  /31/  were  produced  at  lower 
pitches  /44/,  /33/,  and  /21/  respectively.  An  examination  of  the  normalized  frequency 
values  shows  that  syllables  of  tone  /52/  started  at  a  much  higher  frequency  than  syllables 
of  tone  /55/.  The  beginning  point  frequency  (BF)  of  syllables  of  tone  /52/  was  interpreted 
as  5;  however,  the  BF  of  syllables  of  tone  /55/  did  not  fall  within  the  acoustic  range  for  5, 
and  the  BFs  of  syllables  of  tones  /44/  and  /3l/  did  not  fall  within  the  acoustic  ranges  for  4 
and  3  respectively.  Furthermore,  as  these  syllables  were  preceded  by  another  syllables,  it 
is  possible  that  the  preceding  syllable  tone  might  have  affected  the  frequency  lowering  of 
these  syllables.  Besides,  the  method  I  used  to  do  the  conversion  sets  an  absolute  rather 
than  relative  range  for  the  frequency  values.  As  a  result,  syllables  of  citation  tones  /55/, 
/44/,  and  /31/  had  lower  frequencies  before  the  neutral  tone  than  they  had  when  produced 
in  isolation.  There  is  phonetic  difference  but  phonologically  there  is  no  {^fference 
between  the  citation  form  and  the  form  before  the  neutral  tone.  I  would  like  to  assume 
that  phonologically  syllables  of  these  three  citation  tones  /55/,  /44/,  and  /31/  were 
produced  with  the  same  tones  before  the  neutral  toned  syllables.  Thus  citation  tones  /55/, 
/44/,  and  731/  still  remain  unchanged  before  the  neutral  tone. 
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Table  8.  Tone  grouping  of  the  syllables  before  neutral  tone  using  BF,  MF,  EF,  BESL, 
BMSL,  and  MESL 


i  group  # 

12             3             4             5             6                  i 

group  mean    |  BF 

98.554    76.552    61.476    61.412    41.954    40.343 

values 

MF 

65.182    76.552    42.427    51.004    29.770    42.409 

EF 

41.185    71.724    31.704    51.114    24.943    65.801 

BESL 

10.278    60.371     25  201    51.564    43.681     84.353 

BMSL 

25.977    85.909    37.741    61.913    59.590    86.253 

MESL 

17  227    41.039    30.325    55.075    37.193    82.255 

transcrip- 

number of  syllables  in  one  of  the  seven  tones  collected  in 

tion  tones 

each  group 

55 

0            8            0            10            0 

44 

0            0            0           11             10 

52 

12            0            0            0            0            0 

31 

0            0            1            0           10            0 

213 

0            0            1             1             0           10 

42(hs) 

0            0           10            0            2            0 

42(ls) 

0            0            1           11             0            0 

total  number 

12            8           13          24           13           10 

ofsyUables 

in  each  group 

converted 

52           44           32           33           21           24 

tone  values 

BF:  the  beginning  frequency;  MF:  the  mid  frequency;  EF:  the  end  frequency 

BESL:  the  beginning  to  end  slope;  BMSL:  the  beginning  to  mid  slope;  MESL:  the  mid 

to  end  slope 


citation 

computed 

tones 

tones 

55         -> 

44 

44     -— > 

33 

52     -— > 

52 

31     -"-> 

21 

213     -— > 

24 

42     — > 

32 

33 

As  for  the  short  tones,  10  syllables  of  tone  /42(hs)/  had  tone  /32/  before  the  neutral 
tone,  and  eleven  syllables  of  tone  /42(ls)/  were  produced  with  tone  /33/  before  the  neutral 
tone.  This  means  that  in  both  speakers  A's  and  B's  speech,  citation  tone  /42/  has  two 
phonetic  forms,  a  mid  level  /33/  and  a  low  falling  /32/,  before  the  neutral  tone.  In  the 
sandhi  context,  the  citation  /42/  tone  had  two  sandhi  forms  /54/  and  /21/.  To  assume  that 
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there  is  only  one  underlying  short  tone  could  not  explain  why  this  tone  would  have  two 
different  forms  in  the  above  situations.  Thus  I  proposed  that  there  were  two  underlying 
short  tones.  At  that  moment.  I  did  not  know  what  the  phonemic  representation  of  these 
two  tones  would  be.  Since  other  tones  like  /55/,  '44/,  /52/.  and  '31/  do  not  undergo  tone 
sandhi  before  the  neutral  tone,  it  would  be  less  reasonable  to  expect  only  short  tones  to 
undergo  tone  sandhi.  In  this  neutral  tone  experiment,  two  different  phonetic  forms,  a  mid 
level  /33/  and  a  mid  falling  /32/  tones,  were  found  with  the  short  toned  syllables.  Since 
tones  do  not  undergo  sandhi  before  the  neutral  tone,  the  two  phonetic  tone  values  '33/ 
and  /32/  shown  on  the  short  toned  syllables  can  be  said  to  be  the  surface  forms  of  the  two 
underlying  short  tones  from  which  the  sandhi  tones  are  derived.  One  of  the  underlying 
forms  of  the  short  tones  is  a  nud  level  tone  and  the  other  is  a  mid  falling  tone.  These  /33/ 
and  /32/  tones  are  different  from  the  sandhi  /33/  and  '32/  tones  derived  from  the  /55/  and 
/213/  citation  tones  as  the  former  are  perceptually  different  and  the  latter  are  perceptually 
the  same.  Besides,  the  pitch  of  the  short  /32/  and  /33/  tones  could  be  lowered  because  of 
the  preceding  syllable  tone  in  speech  production  just  like  the  1551,  /44/,  and  /31/  citation 
tones.  These  two  short  tones  could  have  started  at  a  higher  pitch.  If  Ho's  claim  (1982) 
that  the  sandhi  tones  are  the  underlying  tones  was  valid,  I  should  have  found  tones  154/ 
and  /21/  for  the  short  toned  syllables  before  the  neutral  tone.  However,  the  result  from 
this  experiment  proves  that  as  far  as  our  data  are  concerned,  the  general  sandhi  forms  are 
not  the  underlying  forms. 

Finally,  syllables  of  citation  tone  /213/  had  a  /24/  tone.  Since  all  other  tones  remain 
unchanged  when  followed  by  the  neutral  tone,  I  propose  that  /213/  is  actually  a  rising 
tone  phonologically.  The  falling  effect  in  /213/  when  it  was  produced  in  isolation  may  be 
due  to  the  fact  that  it  is  more  difficult  for  vocal  cord  to  start  at  the  lowest  pitch  in  a 
person's  pitch  range.  As  a  result,  this  tone  is  produced  at  a  higher  beginning  pitch  then 
dropped  down.  Maddieson  (1976)  states  that  quite  often  the  'extreme'  tones  of  a  system 
are  preceded  by  a  central  onglide.  For  example,  a  /1 1/  tone  is  produced  as  /2l/  and  a  /55/ 
tone  is  produced  as  /45/.  As  to  whether  this  rising  tone  is  a  high  rising  or  a  low  rising 
tone,  the  answer  will  be  given  in  section  4. 

As  the  tone  before  the  neutral  toned  morphemes  does  not  undergo  any  change,  e.g. 
/au44/  'after'  in  [au44  IU21]  'the  day  after  tomorrow',  I  take  the  phonetic  forms  before 
the  neutral  tone  as  the  basis  for  postulating  the  underlying  tones.  Based  on  these  forms  I 
establish  the  tone  inventory  in  Taiwanese.  In  order  to  facilitate  the  tone  transcription,  I 
will  still  use  the  number  system  to  transcribe  the  tones.  Thus  I  also  set  up  a  unified 
transcription  system  even  though  the  acoustic  variation  is  evident  in  the  experimental 
data. 

4.  TONE  INVENTORY  IN  TAIWANESE 

The  high  level  and  mid  level  tones  were  calculated  as  /55/  and  744/  in  the  citation 
tone  experiment,  and  744/  and  733/  in  the  neutral  tone  experiment  respectively.  The 
reason  is  that  high  falling  tone  usually  starts  at  a  higher  pitch  than  a  high  level  tone.  If 
the  pitch  of  the  high  falling  tone  is  considered  to  be  5  in  the  calculation,  then  the  pitch  of 
a  high  level  tone  might  fall  into  the  range  of  4  and  that  of  a  mid  level  might  fall  into  the 
range  of  3  depending  on  how  high  the  high  falling  tone  starts.  The  lowering  in  the 
neutral  tone  environment  could  be  influenced  by  the  preceding  low  toned  syllables.  So 
the  difference  is  only  a  matter  of  acoustic  variation.  In  order  to  make  these  two  tones 
look  more  distinctive  from  each  other,  7557  is  chosen  for  high  level,  and  7337  for  mid 
level.  They  have  general  sandhi  forms  7337  and  7217  respectively  in  the  sandhi  tone 
experiment,  and  therefore,  7337  and  7217  are  used  to  represent  the  sandhi  forms.  The  high 
falling  tone  always  had  a  7527  phonetic  form  in  the  citation  and  neutral  tone  experiments, 
thus  7527  is  used  to  represent  the  underlying  form  of  diis  tone.  Two  phonetic  forms  7317 
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and  llM  were  obtained  with  the  long  low  falling  tone.  Since  this  low  falling  tone  is 
perceptually  identical  to  the  general  sandhi  long  low  falling  tone  /21/  of  tone  /33/ 
obtained  in  the  general  tone  sandhi  experiment.  I  will  assign  /21/  to  this  low  falling  tone. 
The  general  sandhi  form  of  underlying  '52/  tone  was  calculated  as  44/  which  is  distinct 
from  the  '33/  sandhi  form  of  the  underlying  55/  tone  in  the  sandhi  environment.  I  have 
mentioned  above  that  because  of  the  extra  high  of  a  high  falling  tone,  in  the  conversion 
process  a  high  level  tone  can  have  a  '44/  rather  than  /55/  value  if  its  beginning  frequency 
does  not  fall  into  the  frequency  range  of  5.  This  calculated  /44/  is  higher  than  the  33/ 
general  sandhi  form  of  underlying  /55/  tone  and  is  considered  a  high  level  tone,  so  I 
decide  to  use  /55/  instead  of  /44/  to  represent  the  high  sandhi  level  tone.  The  general 
sandhi  form  of  low  falling  tone  /21/  is  /52/  in  the  general  tone  sandhi  experiment  and  I 
use  /52/  to  represent  its  sandhi  form. 

In  the  citation  tone  experiment,  I  obtained  a  falling-rising  contour  tone  /213/.  But 
syllables  in  this  tone  before  the  neutral  tone  were  calculated  to  have  a  /24/  tone.  Since  I 
treat  the  tones  before  the  neutral  tone  as  the  basis  for  the  underlying  forms,  I  say  that  the 
underlying  form  of  this  falling-rising  contour  tone  is  a  rising  tone.  The  initial  falling  is 
due  to  the  articulatory  factor  that  this  tone  starts  at  a  low  pitch,  thus  it  is  possible  to  have 
a  gliding  contour  in  the  beginning.  However,  it  is  somewhat  difTicult  to  determine 
whether  the  rising  tone  is  a  low  rising  /1 3/  or  a  high  rising  /24/.  In  Taiwanese,  the  first 
syllable  of  a  triple  adjective  reduplication  phrase  must  have  a  high  rising  /3S/  tone,  e.g. 
[ki35  ki33  ki2l3]  'extremely  strange'.  If  the  underlying  rising  tone  of /213/  is  perceived 
to  be  the  same  as  this  sandhi  tone  /35/,  I  can  postulate  the  rising  tone  to  be  high; 
otherwise,  it  is  low.  Thus  I  did  a  perceptual  test  asking  14  native  speakers  of  Taiwanese 
to  identify  whether  the  rising  sandhi  tone  of  the  first  adjective  syllable  sounded  the  same 
as  the  rising  tone  of  a  third  syllable  in  the  triple  adjective  reduplication  phrases.  The  test 
utterances  consisted  of  80  pairs  of  monosyllabic  words.  The  two  words  in  each  pair 
could  be  the  same,  i.e.  they  both  were  the  extracted  first  syllables  of  a  triple  adjective 
reduplication  phrase,  e.g.  [Iiau35]  'wrinkled',  or  they  both  were  the  extracted  third 
syllables  of  a  triple  adjective  phrases  whose  citation  tone  was  /213/,  e.g.  [Iiau213] 
'wrinkled*,  or  one  of  the  two  words  was  the  first  syllable  and  the  other  was  the  third 
syllable,  e.g.  [Iiau35]  and  [liau213].  These  80  pairs  were  randomized  in  the  test 
utterances  and  the  subjects  were  asked  to  judge  whether  the  two  words  in  each  pair 
sounded  the  same,  or  the  first  one  was  higher  and  the  second  one  was  lower,  or  the  first 
one  was  lower  and  the  second  one  was  higher.  The  result  shows  that  of  the  1120 
answers,  1119  indicated  that  the  first  syllable  sandhi  /35/  form  was  higher  than  the  third 
syllable  citation  /213/  form.  According  to  the  perception  result,  I  propose  the  rising  tone 
to  be  a  low  rising  /13/  underlyingly. 

Of  the  two  short  tones,  one  is  a  mid  level  and  the  other  is  a  mid  falling  tone.  The 
mid  level  was  calculated  as  /33/  in  the  neutral  tone  experiment  and  was  also  calculated  to 
have  the  same  sandhi  /21/  tone  as  the  long  tone  /33/  in  the  sandhi  context.  Besides,  when 
a  syllable  in  this  short  tone  lost  the  final  consonant,  its  tone  became  long  /33/.  Based  on 
the  similar  tonal  behaviors  of  these  two  tones  in  different  environments,  I  assume  that  the 
long  and  the  short  mid  tones  are  derived  from  one  underlying  mid  level  tone  /33/.  The 
length  difference  can  be  predicted  from  the  syllable  final  stop  consonant 

The  phonetic  mid  falling  short  tone  was  calculated  as  /32y  in  the  neutral  tone 
experiment.  Since  this  nud  falling  tone  is  lower  than  the  high  falling  tone  /52/  and  higher 
than  the  low  falling  tone  /21/  phonetically,  I  use  /32/  to  represent  its  phonetic  form.  This 
short  tone  /32/  was  calculated  to  have  a  general  phonetic  sandhi  tone  of  /54/  in  the  sandhi 
environment,  and  the  long  I2\l  tone  was  calculated  to  have  a  ISlI  sandhi  tone  in  the  same 
sandhi  environment  The  short  sandhi  /54/  tone  is  very  similar  to  the  long  sandhi  tone 
ISII  as  both  are  high  falling  tones.   I  would  like  to  assume  that  phonologicaJly  these  two 


60 

high  falling  tones  are  the  same.  The  end  frequency  difference  is  due  to  the  shortness  of 
the  checked  syllables,  i.e.  syllables  ending  in  stop  consonants,  which  makes  the  short 
sandhi  tone  /54/  not  to  drop  as  low  as  the  longer  /52/  tone.  Thus  these  two  tones,  long 
tone  '21/  and  short  tone  /32/,  were  calculated  to  have  the  same  sandhi  tones  in  the  sandhi 
context.  Moreover,  in  the  clustering  experiment,  a  syllable  m  this  short  tone,  after  losing 
its  final  consonant,  was  clustered  with  syllables  of  the  long  -^l/  tone,  both  of  which  had  a 
long  /52/  sandhi  tone.  This  means  that  when  this  shon  /32/  tone  becomes  a  long  tone,  it 
merges  with  the  tone  that  has  similar  sandhi  tones.  Based  on  the  similar  sandhi  behavior 
and  that  this  short  /32/  tone  merged  with  a  similar  long  tone  after  its  syllable  lost  the  final 
stop  consonant,  I  assume  that  the  long  /21/  tone  and  the  short  /32/  tone  are  derived  from 
one  underlying  tone  /21/.  On  the  surface  the  short  low  falling  tone  is  produced  with  a 
higher  pitched  /32/  tone  than  the  long  low  falling  /21/  tone  before  the  neutral  tone 
contexts. 

Since  the  long  and  short  /33/  phonetic  tones  are  derived  from  one  underlying  tone 
/33/  and  the  long  and  short  /21/  and  /32/  tones  are  derived  from  one  underlying  tone,  our 
Taiwanese  data  have  only  five  tones  phonologically.  They  are  /55/,  1511,  /33/,  /21/,  and 
/13/.  But  there  are  six  citation  tones,  five  long  tones,  1551, 1511, 1'h'il,  /IV,  and  /213/,  and 
one  short  /42/  tone.  The  two  short  forms  derived  from  underlying  /21/  and  /33/  tones 
surface  as  a  742/  tone  when  they  are  produced  in  isolation  or  at  the  final  position  of  a  tone 
sandhi  domain.  Before  the  neutral  tone  /21/,  these  two  short  tones  surface  as  tones  /33/ 
and  732/  respectively.  The  underlying  rising  tone  /1 3/  is  produced  as  a  falling-rising 
/213/  tone  in  isolation  and  a  rising  724/  tone  before  the  neutral  tone  /27/.  For  all  of  the 
other  tones,  the  underlying  and  citation  tones  are  the  same. 

5.  CONCLUSION 

In  this  paper,  I  introduce  a  method  which  incorporates  computer  techniques  and  a 
pattern  matching  algorithm  to  perform  the  classification  of  tones  in  Taiwanese. 
MacQueen  (1965:288)  states,  "the  purpose  of  K-means  is  not  to  find  some  unique 
definitive  grouping,  but  rather  to  simply  aid  the  investigators  in  obtaining  qualitative  and 
quantitative  understanding  of  large  amounts  of  N-dimensional  data  by  providing  them 
with  reasonably  good  similarity  groups."  The  success  of  this  algorithm  depends  on  the 
classification  groups  selected.  As  shown  in  this  paper,  first  the  classification  results  have 
helped  me  discover  the  tone  inventory  and  possible  phonetic  tone  sandhi  in  my 
Taiwanese  speech  data.  Second,  these  results  also  suggest  a  language  development 
process  which  gives  me  an  idea  of  how  the  short  tones  may  have  disappeared  in  some 
Chinese  languages,  e.g.  Mandarin,  where  short  tones  no  longer  exist  It  is  true  that  some 
linguists  could  have  derived  the  same  results  by  perception.  However,  it  is  also  true  that 
not  all  linguists  are  able  to  dictate  different  tone  patterns  just  based  on  jjerception.  The 
tool  I  describe  here  could  be  of  great  help  to  the  above  linguists  because  it  provides  them 
with  a  rather  dependable  tool  for  the  study  of  tones  in  addition  to  perception. 
Furthermore,  using  different  combinations  of  parameters  to  classify  tones  could  provide 
linguists  in  speech  synthesis  with  valuable  information  in  the  production  of  human-like 
tones.  Finally  from  the  classification  results,  we  have  found  that  some  syllables 
transcribed  with  short  tones  had  an  average  tone  duration  between  long  tone  duration  and 
short  tone  duration.  These  are  the  syllables  that  are  in  the  process  of  losing  syllable  final 
consonants  and  gradually  assume  a  long  tone.  This  kind  of  tone  length  change  is  not 
easy  to  detect  perceptually.  Using  this  program  in  the  study  of  Taiwanese  or  Southern 
Min  language,  the  linguists  can  have  an  idea  about  the  process  of  short  tones  becoming 
long  tones  in  different  Southern  Min  dialects.  As  mentioned  earlier,  I  do  not  claim  that 
the  tool  could  replace  perception  as  the  only  dependable  method  in  the  study  of  tones.  It 
is  rather  a  mechanism  to  help  those  linguists  with  less  confidence  on  their  speech 
perception  of  tones.  They  could  obtain  as  solid  evidence  towards  the  tones  by  using  this 
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tool  as  by  speech  perception.  And  what  I  have  described  here  is  just  the  beginning  of 
incorporating  some  non-linguistic  techniques  into  the  field  of  linguistic  study.  I  hope  in 
the  future  there  will  be  more  use  of  other  computational  and  statistical  techniques  to  aid 
the  study  of  linguistic  phenomena. 

NOTES 

ll  am  very  grateful  to  Prof.  C.  C.  Cheng  for  his  guidance  during  the  preparation 
of  this  paper.  I  also  would  like  to  express  my  gratitude  to  Mr.  Matthew  Ratta  for  his 
help  with  the  K-means  algorithm  and  the  computer  software  programming,  to  Ms. 
Dorothy  Evans  and  Mr.  Steve  Helmreich  for  their  valuable  suggestions  and  comments, 
and  to  Ms.  Rachel  Manwell  for  her  help  in  the  speech  recording. 

2Taiwanese  is  a  dialect  of  Southern  Min  language.  It  is  spoken  by  about 
10,000,000  people  in  Taiwan  (Zhang  1983). 

3The  actual  environment  for  the  sandhi  tone  should  be  the  non-final  position  in  a 
tonal  group.  For  detailed  description,  see  Chen  (1986). 

'♦Due  to  typographic  reason,  the  glottal  stop  PI  is  represented  by  111. 

5The  A/D  and  D/A  converter  DT2801 A  was  designed  by  Data  Translation  Inc. 

^According  to  Nyquist's  criteria,  the  sampling  rate  should  be  twice  the  highest 
frequency  of  the  voice  to  be  sampled.  As  female  voice  is  higher  than  male  voice,  the 
sampling  rates  for  male  and  female  voices  are  10  K  and  20  K  respectively. 

7For  these  three  functions,  please  refer  to  the  references. 

80f  course,  this  algorithm  only  works  for  monosyllabic  words.  For  poly-syllabic 
words,  the  program  included  a  splicing  function  which  could  extract  a  syllable  from  an 
input  signal  and  store  this  syllable  speech  signal  on  a  disk.  Later,  this  syllable  speech 
signal  was  loaded  and  the  seven  parameters  were  obtained  by  the  above  algorithm. 

9ln  R.  Cheng  (1977),  there  is  no  falling-rising  tone.  Our  experimental  data, 
however,  show  that  "13"  is  a  falling-rising  tone.  However,  I  still  use  13  to  represent 
this  tone  in  the  computer  program  and  transcription. 

lOxhe  normalization  equation  is  100  X  ((frequency  -  lowest  frequency)/  (.highest 
frequency  -  lowest  frequency)). 

1  iThe  tone  identification  performance  is  determined  as  follows.  When  I  prepared 
the  experiment  data,  there  were  20  monosyllabic  words  for  each  tone.  Each  of  the  two 
speakers  read  140  words,  so  there  were  40  words  for  each  tone.  If  each  of  the  seven 
groups  contains  only  one  tone  and  the  number  of  words  with  this  tone  is  40,  then  the 
identification  performance  is  equal  to  the  sum  of  all  words  in  the  seven  groups  divided 
by  the  total  speech  words  (280/280),  which  is  a  100%  perfect  grouping  performance.  If 
a  group  contains  words  in  one  tone  and  the  number  of  words  is  less  than  30,  then  the 
performance  is  not  good.  This  group  is  dropped.  If  there  are  words  in  different  tones 
included  in  one  group  and  the  number  of  words  in  one  of  the  tones  is  either  greater  than 
10  or  less  than  30,  this  group  is  discarded.  Otherwise,  the  number  in  each  group  is 
added  together  and  the  sum  is  divided  by  280.  The  result  is  the  performance 
percentage. 
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^^Duc  to  typographic  reason,  the  back  vowel  Ol  is  written  as  /o7. 

^^For  example,  in  the  phrase  'au44  lit42/  "after  day",  with  neutral  tone  on  lit42/ 
[au44  litll]  this  phrase  means  'the  day  after  tomorrow';  with  both  syllables  in  fiiU  tones 
[au2l  lit42J  this  phrase  means  'future'. 
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Donna   M.    Far  i  na 
MULTIWORD   LEXICAL    UNITS    IN    FRENCH 


This  paper  will  discuss  multiword  lexical  units  in 
French.  I  will  demonstrate  the  methods  used  to 
determine  whether  a  combination  of  words  does  in  fact 
behave  as  a  single  unit,  or  whether  it  is  a  simply  a  free, 
but  frequent  combination.  The  demonstration  will  use  as 
example  the  combination  chemin  de  fer  (along  with  other 
examples),  and  will  be  based,  for  the  most  part,  on  the 
nine  tests  for  multiword  units  given  in  Zgusta  (1971: 
138-159).  This  demonstration  has  two  purposes.  The 
first  is  to  show  in  detail  one  aspect  of  a  more  general 
lexicographical  problem,  that  of  determining  what  should 
or  should  not  be  a  separate  dictionary  entry.  In  other 
words,  the  question  being  asked  is:  how  does  the 
lexicographer  decide  whether  a  combination  will  have  its 
own  entry?  The  second  purpose  is  to  allow  for  a  more 
informed  appraisal  of  a  French  dictionary's  actual 
treatment  of  multiword  lexical  units.  Thus,  the  paper 
will  look  at  dictionary  entries  in  the  Petit  Robert  {^979). 
When  it  is  pertinent  to  do  so,  these  entries  will  be 
compared  with  those  of  the  one-volume  LittrS  (1958); 
but  the  main  object  of  interest  here  is  the  Petit  Robert. 
Lastly,  the  larger  six-volume  Robert  (1986)  will  be 
briefly  examined,  only  in  order  to  determine  whether  the 
necessity  to  save  space  played  a  role  in  the  one-volume 
Petit   Robert  's  treatment  of  multiword  lexical  units. 


I.     The  Nine  Tests. 

One  can  define  a  multiword  lexical  unit  with  respect  to  its 
function,  and  this  is  how  I  am  using  the  term  here.  A  multiword 
lexical  unit,  then,  is  a  lexical  unit  consisting  of  more  than  one  word, 
which  functions  in  a  sentence  in  the  same  way  as  do  lexical  units  of 
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a  single  word.  Thus  it  is  appropriate  that  the  following  tests  for 
multiword  lexical  units  (henceforth  abbreviated  MLU)  test  their 
function. 

The  combination  chemin  de  fer,  as  will  be  seen  by  the  tests  to 
follow,  is  a  clear  case  of  a  MLU.  In  the  Petit  Robert  (p.  298),  it  is 
given  its  own  entry,  separate  from  the  entries  of  both  chemin  and 
fer.  The  Petit  Robert  notes  the  two  earliest  occurrences  of  this 
combination;  first,  in  1787  as  a  translation  of  railway,  and 
secondly,  in  1823  when  the  railroad  between  Saint-Etienne  and 
Andrezieux  began  service.  This  information  shows  that  chemin  de 
fer  came  from  the  translation  of  an  English  single  word  lexical  unit 
consisting  of  two  morphemes,  each  of  which  may  also  occur 
independently. 

The  Petit  Robert  organizes  definitions  for  an  entry  in 
historical  order.  For  chemin  de  fer,  the  first  definition  is  marked 
Vx.  Vieux',  indicating  that  this  meaning  is  rarely  used  now.  This 
definition  reads:  "Chemin  form6  par  deux  rails  parall^les  sur 
lesquels  roulent  les  trains."  The  second  definition,  marked  Mod. 
'moderne',  reads:  "Le  moyen  de  transport  utilisant  la  voie  ferr§e; 
I'exploitation  de  ce  moyen  de  transport."  The  third  definition  is  the 
company  which  uses  the  railroad  lines  for  profit;  the  fourth,  a  toy 
train  set;  and  the  fifth  is  a  casino  money  game. 

The  significance  of  this  definition  is  twofold:  first  (as  was 
noted  above),  that  there  is  a  separate  entry,  indicating  that  the 
lexicographers  of  the  Petit  Robert  regarded  chemin  de  fer  as  a 
separate  unit;  and  secondly,  that  the  meaning  of  this  combination 
was  extended,  from  the  older  specific  meaning  of  'train  track'  to  the 
more  general,  modern  meanings  of  'means  of  transportation', 
'utilization  of  this  means',  and  'company  profiting  from  this  means'. 
Note  that  the  meaning  was  also  extended  metaphorically  to  the 
casino  game;  in  the  same  way,  English  guinea  pig  has  come  to  mean  a 
person  used  as  a  subject  for  experiments,  alongside  its  original 
meaning  of  a  species  of  animal  often  used  in  experiments  (cf.  The 
Pocket  Oxford  ^  97 8:  385;  The  American  Heritage  Dictionary  1969). 
It  will  be  seen  later  that  other  combinations  in  French  which  clearly 
seem  to  be  MLUs,  but  which  are  not  given  separate  entries  by  the 
lexicographers,  may  be  distinguished  by  the  lexicographers  from 
chemin  de  fer  in  part  by  apparent  lack  of  extended  meanings. 

The  first  test  for  multiword  lexical  units,  is,  of  the  nine  to  be 
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discussed,  one  of  the  two  most  important:  the  test  of  1  ) 
commutation  or  substitution.  In  order  to  discover  whether  a 
combination  is  simply  one  of  great  frequency  of  occurrence  in  the 
language,  or  whether  it  functions  as  a  unit,  one  can  replace  for  each 
part  of  the  combination  a  word  of  the  same  or  similar  meaning.  If 
the  new  combination  resulting  from  this  operation  has  a  different 
meaning  from  the  original  combination,  then  the  original 
combination  is  probably  a  multiword  lexical  unit.  For  chemin  de  fer, 
if  route  or  voie,  which  have  roughly  the  same  meaning  as  chemin 
'path',  are  substituted  for  chemin,  the  resulting  combinations  route 
de  fer  and  voie  de  fer  do  not  have  the  same  meaning  as  chemin  de  fer. 
These  could  mean  'road  of  iron'  if  they  mean  anything  at  all,  but  in  no 
case  could  they  mean  'railroad'.  Likewise,  when  names  of  other 
metals,  such  as  those  meaning  'steel'  and  'bronze',  are  used  in  place 
of  fer  'iron',  the  new  combinations  with  chemin  do  not  mean 
'railroad',  but  only  different  types  of  roads:  chemin  d'acier  'road  of 
steel',  chemin  de  bronze  'road  of  bronze'.  A  further  test  could  be  a 
substitution  of  something  with  the  connotative  meaning  of  de  fer, 
'of  strength  or  durability'  (note  the  English  nerves  of  steel.  Iron 
Horse).  But  here  too,  chemin  de  force  has  nothing  in  common  with 
the  meaning  of  chemin  de  fer.  Finally,  note  that  the  combination 
chemin  de  fer  could  itself  have  the  meaning  'road  of  iron'.  With  this 
meaning,  it  would  be  a  free  combination  (as  the  possibility  of  having 
an  equivalent  voie  de  fer  shows),  and  it  would  thus  contrast  with  the 
set  combination  of  the  same  form,  meaning  'railroad'. 

The  second  possible  test  for  a  MLU  is  2)  to  add  something 
to  the  combination  under  scrutiny.  Often,  but  not  always,  one 
cannot  add  to  a  MLU.  In  the  case  of  chemin  de  fer,  an  adjective  can 
only  modify  the  whole,  and  not  part  of  the  whole.  Thus,  the 
following  have  no  meaning  in  French: 


chen 
Chen 

lin  nsiil    de..ffii 
1         1 

neuf:    'new' 

1 

iin     de    fer    forg6 

forge:   'wrought 

1          1 

1 

1 

while  the  following  do  have  meaning: 
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chemin   de  fer    neuf 
chemin   de  fgr    m^tropQlitain 


Note  that  this  alone  is  not  a  sufficient  test  for  a  MLU.  For  instance, 
some  combinations  with  chemin  which  one  would  not  want  to  call 
MLUs  may  also  not  be  added  to,  like  chemin  de  terre: 


'chemin   d&  terre    abrupt 


"road  of  coarse  earth' 


But, 


Chemin   de  terre    ahnisl 


'rough  earthen  road* 


However,  ISabrusula  (1966)  makes  clear  that  the  real 
significance  of  this  test  is  not  necessarily  that  one  can  add  to  a 
given  combination  (or  where  one  can  add),  but  rather  how  what  is 
added  functions  with  respect  to  the  whole.  For  example,  ^abrusula 
cites  the  MLU  avoir  faim  'to  be  hungry'  in  the  sentence,  "L'enfant  a 
trds  faim,"  'the  child  is  very  hungry'  He  notes  that  the  combination, 
though  fixed,  is  not  so  fixed  that  it  prevents  an  adverb  like  trds 
from  being  inserted  between  the  two  words  of  the  combination,  in 
order  to  modify  the  combination  as  a  whole.  Thus,  "le  syntagme  trds 
faim  n'existe  pas  en  frangais  [as  in:  '(La)  trds  faim  des  enfants], 
pas  plus  que  a  trds.  Tr^s,  du  point  de  vue  de  la  selection,  peut  etre 
combin6  ^  une  unit§  onomatologique  complexe  remplagant  un  verbe." 
Thus,  this  test  should  be  used  with  prudence,  as  it  will  not  always 
be  applicable  to  all  MLUs. 

The  third  test  for  MLUs  again  will  not  apply  to  all  such  units. 
Here  one  determines  whether  3)  the  meaning  of  the  combination 
is    fully    derivable    from    its    parts.     If  it  is  not,  one  is  dealing 
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with  a  MLU.  Mathiot  (1967)  calls  this  gestalt  meaning:  "the  meaning 
of  the  whole  being  more  than  the  sum  of  the  meanings  of  the  parts." 
Chemin  de  fer  obviously  passes  this  test;  as  was  noted  earlier,  the 
whole  meaning,  'railroad',  is  not  derivable  from  the  meaning  of  the 
parts,  'road  of  iron'.  The  original  meaning  of  chemin  de  fer  had  as 
criterial  feature  a  road  made  of  two  parallel  rails;  even  if  these 
rails  were  a  blend  of  metals  (instead  of  just  being  made  of  iron),  one 
would  presumably  still  have  a  chemin  de  fer.  In  the  extended 
definitions  of  chemin  de  fer,  the  meaning  is  totally  divorced  form 
that  of  its  parts. 

Note  that  a  combination  whose  meaning  can  be  derived  from  its 
parts  is  not  ruled  out  as  a  possible  MLU.  This  criterion  therefore 
identifies  positively,  but  not  negatively,  MLUs.  (This  will  become 
important  later  in  the  discussion  of  chemin  de  halage.)  For  this 
reason,  in  her  work  in  compiling  a  Papago-English  dictionary, 
Mathiot  found  that  the  criterion  of  gestalt  meaning  alone  was  not 
operational  for  determining  the  units  of  the  lexicon.  She  noted  the 
necessity  of  structuralist  techniques  such  as  those  in  the  first  and. 
second  tests  above.  Thus,  this  third  test  can  be  a  corroborating  test 
for  MLUs,  but  may  not  be  used  by  lexicographers  as  the  basic  one, 
when  they  attempt  to  determine  what  the  entries  of  a  dictionary 
should  be. 

The  fourth  test  is  that  4)  one  of  the  words  of  a  MLU  may 
be  restricted  in  use,  and  this  does  not  apply  to  the  combination 
being  discussed  here,  chemin  de  fer.  This  means  that  a  word  within 
the  MLU  may  only  be  used  within  the  set  combination  of  which  it  is  a 
part,  or  it  may  be  used  there  and  in  only  a  few  contexts.  The  Manual 
of  Lexicography  (1971:  148)  cites  the  word  ma/din  English  old  maid 
as  an  example  of  this.  An  example  in  French  comes  from  Duneton 
(1978:  249).  The  expression  perdre  la  tramontane  means,  according 
to  the  Petit  Robert  (p.  1999),  "etre  desorient6,  perdre  le  nord."  In 
the  modern  language,  tramontane  means  "vent  du  nord".  Only  in  this 
expression,  however,  does  tramontane  retain  its  archaic  meaning  of 
'North  Star'.  And  that  this  older  meaning  is  really  unknown  to  the 
average  modern  French  person  can  be  assumed  by  the  expression's 
inclusion  in  a  book  attempting  to  explain  popular  expressions  for  the 
general  reader. 

On  the  other  hand,  neither  chemin  nor  fer  is  restricted  in 
usage.  With  chemin,  one  finds  such  free  combinations  as:  chemin 
neuf,  chemin  descendant,  passer  par  un  autre  chemin,  chemin   de 


montagne.     With   fer,   we   have:      fer     forg4,  fer     battu,    etc. 
Nevertheless,  this  test  can  be  a  very  useful  indicator  of  a  MLU. 

The  next  test  is  5)  to  find  a  synonynfi  or  near-synonym 
consisting  of  only  one  word.  For  chemin  de  fer,  there  is  the  one 
word  equivalent,  rail.  In  the  Petit  Robert  (p.  1596),  the  second 
definition  of  this  word  reads:  "Transport  par  voie  ferree."  This 
corresponds  to  the  second  definition  of  chemin  de  fer.  A  quotation 
given  by  the  Petit  Robert  under  rail:  «L'espece  de  competition  qui 
oppose  le  rail  k  la  route» ,  could  easily  have  les  chemins  de  fer  in 
place  of  le  rail,  according  to  a  native  French  speaker.  Thus,  chemin 
de  fer  functions  as  does  a  French  single  word  lexical  unit. 

The  sixth  test  is:  6)  to  find  a  paradigm  of  semantlcally 
related  expressions  where  MLUs  have  identical  status  with 
single  words.  The  quote  with  rail  above  gives  us  a  clue  to  such  a 
paradigm;  one  could  have,  for  instance,  a  paradigm  of  words  denoting 
means  of  transportation,  la  route  --  le  rail  -  I'air  --  les  chemins 
de  fer ....  Chemin  de  fer  is  of  equal  status  with  the  other  members  in 
the  paradigm,  all  of  which  are  single  words. 

The  next  test  is  to  go  from  the  language  of  study  to  foreign 
languages,  and  asks,  7)  are  there  single  word  equivalents  for 
the  combination  in  foreign  languages.  For  the  equivalent  in 
meaning  of  chemin  de  fer,  the  languages  English,  German,  and 
Rumanian  all  have  single  word  lexical  units.  English  has  the  older 
railway,  and  railroad.  German  has  Eisenbahn  {Eisen  'iron',  and  Bahn 
'path'),  Rumanian  has  drum-de-fier  {drum  'chemin';  and  fier  'fer'). 
Rumanian  has  as  well  the  set  combination  cale  ferata  with  the 
meaning  'chemin  de  fer'.  Russian  has  only  the  set  combination 
zeleznaja  doroga  for  chemin  de  fer.  Note  that  in  the  first  three 
languages,  the  single  words  consist  of  morphemes  wnich  may  also  be 
used  as  single  words  themselves.  This  criterion  is  like  that  of 
gestalt  meaning  (number  3),  in  that  it  is  a  positive,  not  a  negative 
criterion;  even  if  no  single  words  could  be  found  in  other  languages 
for  a  given  combination,  one  may  not  rule  out  the  possibility  that  it 
functions  as  a  single  unit. 

The  eighth  test  is  to:  8)  look  for  special  formal, 
grammatical  criteria  in  a  MLU.  This  is  not  applicable  to  chemin 
de  fer,  but  can  be  demonstrated  by  the  expression  mentioned  earlier, 
perdre  la  tramontane.   Sabrusula  (1966)  notes  that  the  Impossibility 
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of  passivizing  expressions  like  this  one.  which  seem  to  correspond 
to  the  model  verb  +  direct  object,  is  a  sign  that  one  is  dealing  with  a 
unit  which  is  indivisible  lexicologically  as  well  as  syntactically. 
Thus,  one  cannot  have:  *La  tramontane  est  perdue.  Another 
expression  functioning  in  this  way  is  tuer  le  temps;  as  in  English, 
one  could  not  say:    'time  is  l<illed. 

Martinet  (1970:  194)  cites  avoir  I'air  as  an  example  of  an 
expression  with  special  grammatical  properties.  He  attributes,  it 
should  be  noted,  "le  figement  du  syntagme"  to  the  fact  that  it  occurs 
with  a  high  frequency,  but  function  and  not  cause  is  of  interest  here. 
Martinet  notes  that  an  adjective  following  this  expression  will 
agree  with  the  subject  of  the  phrase  and  not  with  air  (masculine 
noun);  thus,  'elle  a  I'air  gentille'  'she  seems  nice'.  He  further  notes 
that  in  the  speech  of  children,  the  entire  expression  is  treated  like  a 
verb,  and  is  conjugated:  '"ga  m'alairait  bon"  is  heard  instead  of  the 
correct:  "ga  m'avait  I'air  bon."  Although  it  is  natural  that  children 
overgeneralize  when  they  are  learning  their  language,  this  particular 
overgeneralization  does  vouch  for  the  perceived  unity  of  the 
expression   avoir  I'air. 

The  ninth  criterion  is  closely  related  to  the  fifth  test,  that  of 
finding  a  one  word  synonym  for  a  combination.  This  last  criterion 
asks  the  question:  9)  does  the  combination  function  as  a 
single  word?  It  was  shown  above  that  cliemin  de  fer  can  indeed 
function  as  does  its  one  word  synonym,  rail:  Its  role  in  a  sentence  is 
the  same  as  the  role  of  a  single  word.  This  criterion,  as  the  Manual 
of  Lexicography  (p.  150-1)  notes,  is  used  to  distinguish  set  groups 
from  set  combinations,  thus,  an  expression  jeter  de  la  poudre  aux 
yeux  'to  fool*  (Duneton  1978:  306),  though  definitely  a  set 
expression,  does  not  behave  as  one  word  in  a  sentence,  while  chemin 
de  fer  and  avoir  I'air  do.  This  criterion  and  the  first  criterion 
(commutation)  are  the  two  most  fundamental  in  determining  the 
MLUs  of  a  language. 

To  end  this  section  on  'discovery'  techniques  for  MLUs,  I  would 
like  to  propose  a  tenth  criterion,  and  then  add  a  note  of  caution.  In 
the  Manual  (p.  144),  it  is  mentioned  that  "the  only  possible  change  in 
a  multiword  lexical  unit  seems  to  be  the  dropping  of  one  of  the 
constituent  elements  ...";  and  that  this  dropping  is  elliptical, 
occurring  only  in  disambiguating  contexts.  A  famous  example  of 
such  a  change  through  time  of  a  MLU  is  Martinet's  (1970:  187-8) 
chemin   de   fer   m6tropolitain,  which  became  first  le    m6tropolitain, 
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and  then  le  metro.  However,  it  often  happens  that  multiword  lexical 
units  are  used  In  some  contexts  in  their  full  form,  and  in  others  in 
an  elided  form  (when  none  of  the  contexts  are  ambiguous)  --  in  the 
same  synchronic  state  of  a  language  (This  was  most  probably  the 
case  with  chemin  de  fer  metropolitain  -  m^tropolitain,  and  later 
with  metropolitain  -  m4tro.)  For  example,  ctiemin  de  fer  can  be 
referred  to  as  simply  fer.  Under  fer  in  the  Petit  Robert  (p.  770)  this 
example  appears:  "Transport  par  fer  et  par  air.»  This  context  is 
unambiguous:  fer  here  means  cliemin  de  fer.  In  light  of  examples 
such  as  these,  I  propose  that  a  tenth  test  of  MLUs  would  investigate 
ellipses  of  parts  of  combinations.  If  these  were  found  to  exist,  this 
would  be  still  further  evidence  of  a  MLU,  if  the  combination  also  met 
some  of  the  criteria  mentioned  above  --  including  the  two  most 
important  criteria  (commutation  and  single-word  function  in  a 
sentence). 

As  was  said  earlier,  and  as  has  been  shown  by  the  above  tests, 
chemin  de  fer  is  a  very  clear  case  of  a  MLU.  But  the  reader  should  be 
warned,  for  it  is  not  too  strong  a  statement  to  say  that  chemin  de 
fer  is  for  this  reason  exceptional.  This  combination  was  chosen  here 
because  it  is  very  good  for  illustrative  purposes:  almost  every  test 
could  be  applied  to  it.  But  in  reality,  there  is  no  line  to  draw 
between  free  and  set  combinations,  and  making  a  decision  on  the 
status  of  a  combination  can  be  a  problem  for  the  lexicographer. 
Thus,  the  reason  for  having  many  different  ways  of  analyzing 
combinations  --  so  that  a  lexicographer  can  make  as  intelligent  a 
decision  as  possible  about  the  combination's  status. 


II.     Treatment  of  MLUs  in  the  Petit    Robert. 

The  first  section  of  this  paper  has  shown  how  to  determine 
what  are  the  MLUs  in  a  language,  and  has  indicated  the  difficulties 
that  a  lexicographer  can  face  in  determining  a  combination's  status. 
But  the  problem  of  determining  what  should  have  a  separate 
dictionary  entry  has  not  yet  been  addressed.  For,  even  if  a 
lexicographer  considers  a  combination  a  MLU,  this  does  not 
necessarily  mean  that  it  should  have  its  own  entry.  Among  other 
considerations,  there  is  the  problem  of  space,  and  more  entries  take 
up  more  space.  This  being  the  case,  several  questions  must  be 
raised:  if  a  lexicographer  decides  against  giving  a  MLU  its  own 
entry,  what  then  is  done  with  it?  Is  it  put  under  the  entry  of  the 
first  word  of  the  unit,   or  is  it  put  under  the    entry    of    the    most 
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important  word?  If  the  lexicographer  does  either  of  these  two 
things,  does  she  give  a  cross-reference  for  the  hapless  reader  who 
looks  in  the  wrong  place?  And  lastly,  wherever  she  puts  it,  how 
does  the  lexicographer  indicate  to  the  user  that  the  status  of  the 
MLU  is  different  from  that  of  a  free  combination  that  might  be  used 
for  an  example?  I  will  attempt  to  answer  these  questions  by 
examining  the  treatment  of  MLUs  by  the  Petit  Robert,  with  some 
comparisons  to  the  LittrS. 

First  of  all,  it  will  be  interesting  to  look  at  what  the  Petit 
Robert  itself  says  about  MLUs,  to  see  if  we  can  thus  discover  the 
lexicographer's  answers  to  the  above  questions.  In  the  introduction 
by  Alain  Rey,  the  traditional  treatment  of  MLUs  is  discussed  (p.  X). 
Rey  notes,  for  example,  that  a  set  combination  like  pomme  de  terre 
'potato'  will  usually  be  lost  under  the  numerous  examples  of  the 
word  pomme  'apple'.  That  this  is  indeed  the  case  is  easily 
discovered  by  examining  the  Littr6  (1958:  924).  Likewise  in  this 
dictionary,  chemin  de  fer  is  buried  under  the  entries  of  chemin,  with 
no  entry  of  its  own  (p.  206).  Given  the  problems  of  MLUs,  it  is  not 
surprising  that  a  dictionary  would  choose  to  treat  them  alike  across 
the  board,  by  reserving  entries  for  single  word  units,  and  thus 
avoiding  the  difficulty.  But  the  Petit  Robert  proposes  something 
different,  under  a  paragraph  entitled  "Ou  et  comment  trouver  un 
mot?" 

II  nous  a  paru  plus  raisonnable  de  donner  k  I'ordre 
alphab^tique  ces  vrais  composes  que  sont  pomme  de 
terre,  chemin  de  fer,  ou  point  de  vue.  De  tels  precedes  ne 
peuvent  pas  etre  appliques  syst^matiquement  dans  un 
dictionnaire,  sous  peine  d'augmenter  excessivement  le 
nombre  des  "mots"  trait6s  et  le  volume  de  L'ouvrage. 
Cependant,  quand  certaines  expressions  ou  formes 
verbales,  moins  Importantes,  constituaient  de  v^ritables 
mots  (on  dit  alors  qu'elles  sont  lexicalisees),  elles  ont 
6t6  present6es  en  capitales  dans  les  articles  ou  elles 
sont  incluses,  pour  attirer  I'attention  sur  leur  autonomie. 
Ainsi:  EXCfes  DE  POUVOIR  (^  I'article  excds),  la  locution  A 
L'EXCEPTION  DE  (sous  exception),  la  locution  COMME  IL 
FAUT,  qui  s'emploie  comme  un  adverbe  ou  comme  un 
adjectif  {k  falloir).  (p.  X) 

There  are  many  things  to  note  about  the  above  passage.  Although  at 
first  glance  there  appears  to  be  a  very  systematic  treatment  of  the 
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problem  of  MLUs,  there  is  no  explanation  of  what  criteria  allowed 
the  lexicographers  to  distinguish  between  'more'  and  'less'  important 
expressions  or  verbal  forms.  Without  this  information  ,  it  would  be 
hard  to  guess  why  pomme  de  terre  is  more  important  than  excds  de 
pouvoir,  perhaps  it  is  frequency  or  length  of  time  that  the  set 
expression  has  existed  in  the  language  that  distinguishes  the  two. 
But,  a  few  words  will  be  said  on  this  later. 

Another  criticism  that  one  can  make  is  that  the  information 
contained  in  this  passage  is  present  nowhere  else  in  the  dictionary: 
no  table  of  abbreviations  or  table  of  any  kind  explains  this  use  of 
capital  letters  to  designate  set  combinations.  Since  it  is  doubtful 
that  the  average  person  ever  reads  the  introduction  to  a  dictionary, 
this  information  is  close  to  useless.  But  the  situation  becomes 
more  complicated  in  light  of  additional  information  which  the  Petit 
Robert  includes. 

In  a  later  part  of  the  introduction  (p.  XV),  Rey  discusses  what 
he  calls  "syntagmes",  or  "groupes  de  mots".    He  states: 

On  comprendra  sans  peine  qu'il  faut  distinguer  entre  les 
exemples  d'auteurs  ou  les  phrases  anonymes  qui 
precedent  d'un  choix  n^cessairement  arbitraire,  et  les 
groupes  de  mots  qui  ne  laisse  aucun  choix,  aucune  liberty 
au  parlour  ou  k  I'^crivain. 

As  an  example  of  a  group  where  one  has  no  choice  of  words,  Rey 
cites  fidvre  de  cheval,  a  very  high  fever,  under  the  entry  fidvre.  He 
says: 

lei,  le  frangais  courant  n'a  plus  le  choix:  11  est 
impossible  de  parler  d'une  "fidvre  de  vache"  ou  "de 
poney";  impossible  memo  de  dire:  "il  a  une  fievre  de  petit 
cheval"  ou  "de  cheval  de  labour",  sauf  en  vue  d'un  effet  de 
style. 

Get  exemple  aidera  k  comprendre  la  difference  qui 
existe  entre  une  suite  de  mots  fr6quente  mais  modifiable 
...,  et  une  suite  de  mots  intangible  ...  aussi  indespensable 
k  connaitre  que  le  mot  lui-meme. 

From  this  information,  it  is  impossible  to  tell  if  Rey  regards  these 
"suites  intangibles"  as  the  same  type  of  entity  as  the  "composes" 
mentioned    earlier    in    the    introduction.      These    "suites    de    mots" 
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certainly  do  have  many  of  the  same  characteristics  as  the 
"composes";  for  example,  one  could  not  say  *orange  de  terre  instead 
of  pomme  de  terre.  But,  nevertheless,  the  dictionary  proposes  to 
treat  these  "suites"  in  a  manner  differing  slightly  from  that  of  the 
"composes"  discussed  earlier: 

Pour  la  commodity  du  lecteur,  il  faut  preciser  que  ces 
groupes  de  mots  intangibles  se  trouvent  le  plus  souvent 
mentionn6s  sous  leur  principal  6l6ment:  faire  fete,  k 
fete;  feu  de  joie  et  coup  de  feu,  ci  feu.  Quand  le  groupe 
aurait  pu  etre  classd  k  deux  ou  meme  k  trois  endroits,  le 
mot  ou  le  groupe  est  trait§  a  §t6  signal6  par  un 
asterisque.  Ainsi,  mettre  le  feu  aux  poudres*;  feu 
follet*,  dans  I'article  feu,  signifient  que  les  explications 
et  eventuellement,  les  exemples  se  trouvent  k  poudre  et 
a  follet.  (pp.  XV-XVI) 

This  information,  again,  is  stated  nowhere  else  in  the 
dictionary.  On  the  whole,  the  Introduction  renders  less  than  clear 
the  problems  of  MLUs.  (Note  also  that  there  is  no  distinction  made 
between  set  combinations  such  as  coup  de  feu,  and  set  groups  such 
as  mettre  le  feu  aux  poudres,  by  the  use  of  the  asterisk  notation.) 
Thus,  one  must  go  to  the  dictionary  itself,  to  see  if  the  treatment  of 
these  units  can  be  understood,  and  if  this  treatment  is  consistent. 
To  determine  this,  I  will  again  look  at  chemin  de  fer,  along  with 
other  expressions. 

The  first  entry  to  be  discussed  is  that  of  chemin  {Petit  Robert, 
p.  298).  Since  chemin  de  fer  has  its  own  entry,  it  is  not  mentioned 
anywhere  under  chemin.  The  entry  for  chemin  makes  use  of  the 
capital  letters  technique  once,  for  CHEMIN  DE  RONDE  'parapet  walk', 
and  the  asterisk  technique  twice,  for  Le  chemin  de  la  croix*  'way  of 
the  cross',  and  Le  chemin  des  dcoliers*  'long  way  around'.  One 
expression  which  has  neither  asterisk  nor  capital  letters  is  chemin 
de  halage  'towing  path',  defined  briefly  by  "sur  une  berge".  But  this 
combination  nevertheless  does  seem  to  show  characteristics  of  a 
MLU.  Under  halage  (note  that  there  was  no  asterisk  to  warn  us  to 
look  under  halage),  we  find  a  more  complete  definition:  'Chemin  de 
halage  et  ellipt.  Le  halage,  chemin  qui  longe  un  cours  d'eau  pour 
permettre  le  halage  des  bateaux."  (p.  908) 

Thus,  we  see  that  chemin  de  halage  has  an  elliptical 
equivalent.     When  we  look  up  two  words  noted  as  being  associated 
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with  chemin  de  halage,  we  find,  first  under  16:  "Largeur  d'un  chemin 
de  halage;  par  ext.  Ce  chemin."  (p.  1080)  So,  chemin  de  halage  has  a 
one  word  equivalent  in  French,  and  it  functions  in  a  sentence  in  the 
same  way  as  does  one  word,  or  as  does  chemin  de  fer.  Under 
marchepied,  also  indicated  in  the  definition  of  chemin  de  halage,  we 
find  one  definition  which  reads:  "Chemin  qui  longe  un  cours  d'eau  sur 
la  rive  oppos6e  au  chemin  de  halage."  (p.  1152)  Thus,  chemin  de 
halage  can  be  said  to  be  part  of  a  paradigm  with  one  word  lexical 
units:    chemin  de  halage  --  marchepied  -- 14. 

Note  that  chemin  de  halage  does  not  meet  the  third  criterion, 
of  gestalt  meaning:  the  meaning  of  the  whole  is  easily  derivable 
from  that  of  its  parts.  But,  as  was  said  earlier,  this  is  not  a 
criterion  of  primary  importance.  All  that  this  might  tell  us  is  that 
on  a  continuum,  we  might  want  to  rank  chemin  de  fer  as  being  'more 
MLU-like'  than  chemin  de  halage,  although  both  should  still  be 
considered   multiword   lexical   units. 

I  have  briefly  shown  that  chemin  de  halage  is  a  MLU;  however, 
this  is  not  meant  to  argue  that  it  needs  its  own  entry.  If  one 
measures  the  total  length  of  the  entries  chemin  and  chemin  de  fer, 
one  finds  that  they  take  up  25.6  centimeters  in  the  Petit  Robert.  On 
the  other  hand,  the  space  taken  up  by  halage  (the  word  that  chemin 
de  halage  is  defined  under)  and  chemin  de  halage  together  is  only  1 .3 
centimeters.  The  small  amount  of  space  required  by  these  two 
definitions  does  not  merit  two  separate  entries.  On  the  other  hand, 
one  can  and  should  argue  that  the  status  of  chemin  de  halage  as  a 
MLU  should  be  indicated  by  the  Petit  Robert;  furthermore,  under 
chemin,  there  should  be  a  cross-reference  to  halage,  using  the 
asterisk  notation.  All  this  would  only  be  a  consistent  realization  by 
the  lexicographers  of  what  they  claim  to  do  in  the  introduction. 

A  footnote  to  the  above  observations  may  be  added:  earlier,  in 
the  enumeration  of  the  definitions  of  chemin  de  fer,  it  was 
mentioned  that  perhaps  MLUs  without  separate  entries  in  the 
dictionary  lack  extended  or  multiple  meanings.  This  speculation 
brings  us  again  to  the  Petit  Robert  's  criterion  of  more  and  lesser 
importance  of  MLUs.  One  can  see,  when  one  compares  the  definitions 
of  chemin  de  fer  with  the  single  meaning  of  chemin  de  halage,  that 
this  indeed  seems  to  have  some  relevance;  point  de  vue  (p.  1470), 
mentioned  in  Rey's  introduction,  also  has  extended  meanings. 
However  pomme  de  terre  has  only  two  definitions,  although  these 
are   fairly   long   (once   examples   are   included).      To   finish   with   this 
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problem  of  relative  Importance,  I  would  like  to  conclude  that  the 
only  clear-cut  determiner  of  what  MLUs  will  become  entries  seems 
to  be  length:  other  considerations,  such  as  word  frequency,  number 
of  definitions,  extended  meanings,  social  importance,  etc.,  if  they 
were  considerations  at  all,  were  most  surely  secondary  to  length.  In 
any  case,  when  they  do  not  have  their  own  entry,  MLUs  should  be 
marked  as  such  for  the  dictionary  user  and  cross-referenced  when 
necessary. 

The  last  task  of  this  paper  is  to  check  the  six-volume  Robert 
(1966),  to  see  if  there  is  any  radical  difference  of  approach  with 
respect  to  MLUs  in  the  'unabridged'  dictionary.  Perhaps  what  seems 
to  be  inconsistency  in  the  Petit  Robert  is  only  due  to  considerations 
of  space:  perhaps  a  larger  dictionary  would  treat  chemin  de  halage 
in  a  different  way.  My  examination  of  the  Robert  proved  that  this 
was  not  the  case:  for  the  entries  I  was  interested  in  {chemin, 
chemin  de  fer,  halage,  14,  marchepied),  there  was  no  major  change  in 
the  large  Robert.  In  each  case  the  definitions  remained  unchanged, 
while  examples  and  quotations  were  present  which  had  been  left  out 
of  the  Petit  Robert.  Thus,  chemin  de  halage  was  not  recognized  as  a 
MLU  in  the  large  Robert,  either.  One  interesting  detail  gleaned  from 
the  larger  Robert  is  that  chemin  de  halage  is  used  metaphorically  in 
addition  to  its  designative  use:  as  we  saw  in  the  case  of  guinea  pig, 
perhaps  metaphorical  usage  is  also  an  indication  of  a  MLU.  Thus  I 
will  end  with  a  quote  from  Chateaubriand',  from  the  Robert  (V.3,  p. 
417): 

L'epoque  oij  nous  entrons  est  le  chemin  de  halage  par 
lequel  des  generations  fatalement  condamn^es  tirent 
I'ancien  monde  vers  un  monde  inconnu. 

In  conclusion,  this  paper  set  out  to  analyze  multiword  lexical 
units  and  some  of  the  difficulties  which  they  present  to 
lexicographers.  The  effort  in  modern  dictionaries,  such  as  the  Petit 
Robert,  to  deal  with  the  problems  systematically  (however 
imperfect  the  results  may  be)  is  one  which  stands  to  benefit  greatly 
the  dictionary  user:  not  only  will  information  searches  be  more 
fruitful  and  less  time-consuming,  but  also  the  status  of  MLUs  in  the 
whole  lexicon  will  be  made  clearer. 


76 

Bi  bl  i  ogr  aphy 

DUNETON,   Claude.      1978.      La   puce   ^   Toreille:      anthologie   des 

expressions  populaires  avec  leur  origine.   Paris:  Editions  Stocl<. 
GREIMAS,  A. -J.     1960.     "Idiotismes,  proverbes,  dictons."     Cahiers  de 

Lexicologie    2:    41-61. 
LANGENSCHEIDT.     1975.    German-English,  English-German  dictionary. 

New  York:    Pocket  Books. 
LITTRE,  Emile.     1958.     Dictionnaire  de  la  langue  frangaise,  abrege 

par  A.  Beaujean.     Paris:     Editions  Universitaires. 
MARTINET,  Andr6.     1970.     Elements  de  linguistique  g^nerale.     Paris: 

Librairie  Armand  Colin. 
MATHIOT,  Madeleine.     1967.    "The  place  of  the  dictionary  in  linguistic 

description."    Language    43:    703-724. 
MORRIS,  William,  ed.     1969.    The  American  heritage  dictionary  of  the 

English  language.    Boston:    Houghton  Mifflin  Company. 
ROBERT,    Paul.      1966.      Le   Robert:      dictionnaire   alphabetique   et 

analogique  de  la  langue  frangaise.    Paris:     Societe  du  nouveau 

Littr§    (v.1-6). 
ROBERT,  Paul.     1979.     Le  petit  Robert:     dictionnaire  alphabetique  et 

analogique  de  la  langue  frangaise,  redaction  dirigee  par  A.  Rey 

et  J.  Rey-Debove.    Paris:  Society  du  nouveau  Littre. 
SARA§,  Marcel.    1977.     Dicjionar  roman-francez.  Bucurepti:   Editura 

^tiinJificS    ^i   Enciclopedica. 
SARA^,  Marcel.     1978.     Dicjionar  francez-roman.   Bucure^ti:   Editura 

5tiin)ifica    ^i   Enciclopedica. 
STEINER,   Roger  J.      1978.      The   new  college   French   and   English 

dictionary.     Oxford:     Amsco  School  Publications,  Inc. 
SABRUSULA,  Jan.     1966.    "Un  probldme  de  la  peripherie  du  systeme 

morphologique:     A  propos  des  formations  premorphologiques." 

Travaux  linguistiques  de  Prague    2:     183-192. 
SYKES,  J.B.     1978.     The  pocket  Oxford  dictionary  of  Current  English, 

6th  ed.    Oxford:    Clarendon  Press. 
ZGUSTA,  Ladislav.    Manual  of  lexicography.  Prague:    Academia,  1971. 
ZGUSTA,   Ladislav.      1967.      "Multiword  lexical   units."      In:   Linguistic 

Studies  Presented  to  Andre  Martinet,    ed.  by  Alphonse  Juilland. 

Word  23:     578-587. 


SlfudiBB  in  ^he  Linguistic  Sci 
Volume  17,  Nuuber  S,  Fall  19B7 


SOME  REMARKS  ON  WHY  THERE  IS  IMPLICATURE 
Georgia  M.  Green 


"What  I  means  and  what  I  says  is 
two  different  things,"  the  BF6 
announced  rather  grandly.  (Roald 
Dahl,  The  BFG.  p.  AA,  New  Yorkj 
Farrar  Straus  GirouM.  1382) 


Apparently  it  is  a  commonly  held  belief  that  direct 
communication  is  more  effective  than  indirect.  Why  then  is 
implicature  so  pervasive  in  natural  discourse  of  all  genres, 
registers,  and  styles?  Perhaps  the  answer  is  that, 
conventional  wisdom  to  the  contrary,  implicature  is  in  fact 
quicker,  safer,  and  more  effective  than  "just  saying  exactly 
what  you  mean.  "  This  idea  is  compared  with  proposals  by  Brown 
and  Levinson  <1978),  Katz  <19ai),  and  Sperber  and  Wilson 
<19B&)  which  touch  on  some  of  these  issues,  but  the  following 
question  arises]  if  implicature  is  quicker,  safer,  and  more 
effective  than  plain,  literal  speech,  why  isn't  there  only 
implicature?  The  remainder  of  the  paper  is  devoted  to  a 
resolution  of  this  paradox. 


0.  Readers  who  are  addicted  to  reading  "Dear  Abby"  may  recall  that 
around  the  end  of  1986  a  number  of  her  readers  got  hot  under  the  collar 
about  such  "barbaric",  "violent",  "inhumane"  expressions  as  to  kill  two 
birds  with  one  stone,  and  more  than  one  way  to  skin  a  cat.  Ms.  Van  Buren 
thought  they  had  a  point,  and  attempted  to  close  the  discussion  with 
this  observationt 

DEAR  CAT  LOVER:  Communication  is  at  its  best  when  one  says  exactly 
what  one  means  in  the  most  direct  and  simple  way,  so  when  stating 
that  something  can  be  done  in  more  ways  than  one,  there  is  no 
reason  to  drag  a  cat  into  it. 

C"Dear  Abby",  Champaion-Urbana  News-Gazette.  11-19-86,  p.  C-23 

Apparently  it  is  a  commonly  held  belief  that  direct  communication 
is  more  effective  than  indirect.  Why  then  is  implicature  so  pervasive  in 
natural  discourse  of  all  genres,  registers,  and  styles?  One  can  get  some 
feel  for  just  how  pervasive  it  is  from  the  survey  of  texts  sketched  in 
Appertdix  I.  The  texts  cited  there  contain  an  average  of  a  little  more 
than  one  implicature  per  hundred  words  of  text,  with  implicatures 
comprising  an  average  of  IIX  of  the  words  in  the  texts,  counting  as 
implicature  any  intentional,  non-conventional  use  of  language  with  a 
reasonably  determinable  specific  intended  import.  Thus,  they  include, 
among  other  kinds  of  examples! 

1)  cases  of  directive  hints  <I  was  hoping  you'd  oive  we  a  ride  to 
implicate  <abbreviated  "I>")  'Please  give  me  a  ride*)  and  other 
Manipulative  maneuvers). 
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S)  rhatorical  Maneuvers,  e.g. 

a.  Rhetorical  quest iona  like  I«  the  Pope  Catholic?.   and  How 

could  I  give  vou  a  ride?  <1>  'I  couldn't  give  you  a  ride) 

b.  implied  challenges  like  The  car  isn't  fixed  vet?  (I>  'fire 
you  sure  the  car  isn't  fixed  yet?' /'What  do  you  mean,  "The 
car  isn't  fixed  yet"?'), 

c.  use  of  factive  constructions  to  (conventionally)  implicate 
a  presupposition,  e.g.  Do  vou  realize  your  liohts  are  on? 
<I>  'Your  lights  are  on') 

3)  explanations,  including 

a)  asserting  an  effect  to  implicate  its  cause  (The  little  girl 
said.  I  will  get  a  policeman  doll."  I)  'The  little  girl 
thought  a  policeman  doll  would  keep  vandals  away  from  the 
dollhouse' ), 

b)  asserting  the  antecedent  of  a  mutually  manifest  (cf.  Sperber 
and  Wilson  1986)  hypothetical  to  implicate  (by  modus  ponens) 
its  consequent  (The  car's  not  working  I)  'I  can't  give  you  a 
ride' ) } 

c)  similarly,  by  modus  tollens,  denying  the  consequent  of  a 
mutually  manifest  hypothetical  to  implicate  that  the 
antecedent  is  false  (Fred's  not  old  enough  to  get  a  driver's 
1 i cense  I)  'Fred  doesn't  have  a  (validly  obtained)  driver's 
1 i  cense' ) 

Still,  this  survey  gives  a  fairly  conservative  picture  of  the 
pervasiveness  of  implicature,  because  the  count  did  not  include  two 
kinds  of  utterly  pervasive  implicaturesi 

1)   very   obvious   implications   of   connection   between   two 
propositions,  for  example! 

a)  implications  of  temporal  sequence  (Claire  sat  down  heavily  at 
her  desk.  She  drank  the  contents  of  her  inkwell. ) 

b)  implications  of  causality  or  consequence  (I  will  rake  the 
leaves  that  have  fallen  on  Toad's  lawn.  He  will  be 
surprised. ) 

S)  metaphors  and  extensions  of  literal   meaning  1,   for  example,   a 
spark  of  anger,  to  sail  a  report  across  a  desk. 

Returning  to  the  question  of  why  implicature  is  so  pervasive, 
perhaps  the  answer  is  that,  "Dear  Abby"  and  conventional  wisdom  to  the 
contrary,  implicature  is  in  fact  quicker  (because  it  leaves  out  a  lot  of 
what  is  intended  to  be  understood),  safer  (because  what  is  intended  is 
not  said),  and  more  effective  than  "just  saying  exactly  what  you  mean" 
(because  it  depends  on  the  hearer  to  work  out  just  what  is  intended). 

1.  Quicker.  Obviously,  if  you  can  implicate  in  S  words  and  half  a 
second,  what  it  would  take  £5  words  and  two  and  a  half  seconds  to  spell 
out  or  explicate,  it  saves  time  and  effort  to  implicate  it. 2  For 
example,  suppose  that  fidele  has  just  asked  her  boyfriend  Bruce  how  she 
will  get  home  if  he  doesn't  pick  her  up,  and  he  tells  her  to  take  a  cab. 
She  may,  without  hesitation,  respond  with  (la),  intending  this  to  serve 
as  a  rational,  motivated  rejection  of  his  suggestion,  as  spelled  out  in 
<lb). 
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la.  I  don't  have  any  money. 

lb.   That's  no  good.   I  can't  take  a  cab  because  you  have  to  have 
money  to  take  a  cab,  and  I  don't  have  any  money. 

Of  course,  if  reconstructing  the  intended  implicature  requires  time- 
consuming  inferericing  by  the  addres«ee,  as  is  likely  in  examples  like 
those  in  <£>,  then  the  speaker  doesn't  save  anything  by  choosing 
implicature  over  directness,  except  perhaps  wear  and  tear  on  her  vocal 
cords. 3 

&a.   Don't  they  put  turn  signals  on  Chevrolets  anymore?   <I>... 
3a'.  You  have  inconvenienced  me  by  not  signalling  when  you  turned.) 

CfroM  Wilson  and  Sperber  1981}   1£5,  as  a  response  to  the  question, 
"Do  you  ever  talk  to  Charles?"3 

2b.   I  never  talk  to  plagiarists.    <I>... 
Sb*.  No,  because  he's  a  plagiarist.) 

But  often  enough,  the  assumptions  necessary  to  reconstruct  the  intended 
implicature  are  readily  available/immediately  accessible,  as  is 
presumably  the  case  in  <1>.^  In  such  cases,  the  implicature  may  be 
evoked^  almost  instantaneously.  Coriceivably,  inferring  the  implicature 
(lb)  from  (la)  may  take  more  effort^  on  the  part  of  the  addressee  than 
just  understanding  what  is  explicit  in  (lb),  but  as  long  as  it  doesn't 
take  a  noticeable  amount  of  time,  the  speaker  has  no  reason  to  be 
concerned — consciously  or  subconsciously,  and  in  fact,  the  amount  of 
time  required  to  actually  utter  (lb)  is  probably  several  times  greater 
than  the  amount  of  time  required  to  infer  it  from  (la).  Still,  I  think 
it  is  safe  to  say  that  sometimes  people  choose  to  implicate  messages 
rather  than  explicate  them  because  it  takes  less  effort^  and  less  time. 

Katz  has  speculated  (lS77i  S0)  that  "pragmatics  saves  us  from  C...3 
wasteful  verbosity"  and  "lengthy  utterances"  that  would  be  required  "if 
everything  we  wanted  to  express  had  to  be  spelled  out  explicitly  in  the 
grammar  Csicl  of  our  sentences, "  but  I  do  not  want  to  be  misunderstood 
as  subscribing  to  the  Katzian  principle  of  Effability,  which  entails 
that  every  implicature  could  be  expressed  explicitly  as  the  literal 
interpretation  of  some  sentence  (Katz  19611  £26),  because  I  don't  think 
this  is  true.  Sometimes  an  intended  implicature  cannot  be  adequately 
paraphrased  by  any  amount  of  additional  clauses  and  subclauses.  The 
implicatures  of  the  rhetorical  retorts  in  (3)  do  not  seem  to  me 
adequately  paraphrased  by  any  of  the  corresponding  sentences  in  (A). 

3a.   Do  chickens  have  lips? 

3b.   And  I'm  the  queen  of  Rumania. 

4a.   Obviously  not. 

4a'.  No,  arid  you  ought  to  have  known  it. 

Ab.   That's  a  ridiculous  assertion. 

Ab' .  That  assertion  is  as  ridiculous  as  the  claim  that  I'm  the 

queen  of  Rumania. 
Ab".  There's  no  more  evidence  for  that  assertion  than  there  is 

for  the  claim  that  I'm  the  queen  of  Rumania. 
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To  taUa  a  mora  contaxtualizad  axampla,  in  the  movia  Casablanca,  there* a 
a  Bcerte  tovnarda  the  middle  where  Claude  Rains  as  police  Captain  Louis 
Renault  aska  Humphrey  Bogart  <his  friend  the  cafe  owner,  Rick)  if  he  has 
some  stolen  letters  of  transit  which  will  enable  a  suspected  Free  French 
spy  to  escape  from  Vichy-controlled  Casablanca.  Rick  responds  with  <5). 

5.  Louis!  Ore  you  pro-Vichy  or  Free  French? 

The  Captain  immediately  understands  this  as  a  rebuke  for  asking  the 
question,  and  responds,  "Ha-ha!  Serves  me  right  for  asking  a  direct 
question.  The  subject  is  closed."  Could  Rick  have  achieved  the  same 
effect  by  answering  more  explicitly,  as  in  (6)? 

S.  You  shouldn't  ask  me  that.  If  I  answer,  it  will  change  forever 
the  nature  of  our  relationship,  and  we  won* t  be  able  to  get 
along  as  we  have  in  the  past. 

I  think  not.  I  think  if  Rick  responds  with  anything  more  explicit  than 
(7),  which  implicates  <£)  just  as  much  as  <5)  does,  then  the  damage 
referred  to  in  <£)  is  already  done. 

7.  Vou  don't  really  want  to  make  me  answer  that,  do  you? 

So  maybe  there  is  no  way  to  explictly  convey  exactly  what  <5)  implicates 
BO  well. 


2.  Safer.  Regardless  of  whether  iraplicature  is  quicker  or  easier 
than  being  explicit,  it  is  often  safer,  as  is  evident  from  even  a 
superficial  reading  of  Grice  <1975)j  since  implicatures  are  not  explicit 
or  entailed,  they  are  cancellable,  and  the  speaker  cannot  be  held 
responsible  for  committing  herself  to  whatever  a  hearer  infers  MIGHT  <or 
might  not)  have  been  intended  to  be  inferred. ^  For  example,  the 
rhetorical  question  in  <aa)  has  exactly  as  many  words  as  its  literal 
gloss  in  <8b),  and  only  one  more  syllable,  so  it  can  hardly  count  as 
being  easier  to  utter.  But  being  a  question,  it  can't  be  literally 
false,  so  uttering  <6a)  is  a  pretty  safe  way  to  convey  what  (Bb) 
expresses  literally,  without  risking  losing  face  by  being  contradicted. 

6a.  Uho  uriderstands  Joyce? 
8b.  No  one  understands  Joyce. 

Since  the  speaker  hasn't  made  an  assertion,  she  can't  be  shown  to  have 
made  a  false  one,  and  accused  of  ignorance,  deceit,  or  worse.  (Moral > 
don't  debate  with  someone  who  relies  on  such  rhetorical  figures.) 

There  are  lots  of  other  kinds  of  situations  in  which  it  is  safer 
to  implicate  than  to  "speak  plainly."  To  take  an  example  from  real  life, 
after  the  University  of  Pittsburgh  basketball  team  took  a  IS-point  half- 
time  lead  over  Georgetown,  one  of  the  Pitt  players  addressed  the 
compliment  in  (9)  to  the  Georgetown  team  captain,  saying  it  "like  it  was 
over",  according  to  the  Georgetown  coach. ^ 

9.  Nice  game. 
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The  Georgetown  team  took  the  insult  to  heart  and  won  the  game  65-5£,  but 
the  point  i*  that  however  obvious  the  Pitt  player's  insult  was,  it 
wasn't  "fightin'  words,"  because  it  wasn't  explicit.  Had  the  Pitt 
player  chosen  to  make  his  insult  more  explicit,  and  less  deniable,  he 
would  have  been  inviting  a  charge  of  being  a  poor  sport,  or  even 
physical  retaliation.  As  it  was,  the  worst  one  can  accuse  him  of  is  poor 
judgement. 

Indeed,  Brown  and  Levinson  <197a)  regard  conversational 
implicature  as  a  strategy  of  major  importance  in  situations  which 
constitute  a  serious  threat  to  the  addressee's  face,  and  thus  require 
extreme  tact.  The  tactic  of  hinting  for  something  desired  rather  than 
asking  for  it  is  one  of  the  clearest  examples  of  this.  If  Adele  choses 
<lda)  over  (10b)  or  (10c)  to  try  to  get  her  uncle  to  lend  her  his 
camera,  she  doesn't  risk  putting  him  on  the  spot  if  he  can't  or  doesn't 
want  to,  and  she  doesn't  risk  being  rejected  and  hurt  if  he  doesn't  mind 
saying  no. 

lOa.  I   want   to  take  pictures  at   the  game  tomorrow,   but  my 

camera' s  broken. 
10b.  Mill  you  lend  me  your  camera? 
10c.  Lend  me  your  camera. 

Adele's  uncle  doesn't  have  to  choose  between  acceding  or  being  an  ogre 
or  making  excuses  for  not  complying {  since  Adele  has  just  made  a 
statement,  he  can  decline  to  pick  up  the  implied  hint,  and  address  the 
issue  of  the  broken  camera.  Maybe  it  can  be  repaired.  Maybe  Uncle  once 
had  a  broken  camera.  If  he  doesn't  pick  up  on  the  hint,  Adele  may  infer 
that  he  doesn't  want  to  lend  her  his  camera,  but  neither  has  threatened 
the  other's  face,  or  done  anything  to  make  the  other  resentful. 

3.  More  effective.  Sperber  and  Wilson  note  that  metaphor  arid 
other  figures  of  speech  are  more  effective  than  their  literal  glosses, 1® 
saying  that  their  "poetic  effects  are  generally  destroyed  by  being 
explicitly  spelled  out"  (13a6t  56).  I  think  this  is  correct,  and  that 
their  greater  effectiveness  is  one  of  the  reasons  people  use  rhetorical 
figures  to  implicate,  rather  than  speaking  plainly.  (Indeed)  examples 
were  cited  above  which  showed  that  the  explicit  gloss  of  the  content  of 
an  implicature  could  completely  counteract  the  effect  achieved  by 
implicating  rather  than  asserting  that  content.  But  to  show  that 
implicatures  can  be  more  effective  than  their  literal  glosses,  it  is 
necessary  to  look  at  cases  where  the  gloss  expresses  the  same  content  as 
the  implicature,  just  not  as  forcefully.  Example  (Sb)  above  is  a  case  in 
point.  Example  (£b)  and  its  gloss  (Sb' )  have  exactly  the  same  number  of 
syllables,  so  (Sb)  isn't  quicker  to  say,  and  as  was  suggested  above,  it 
May  well  take  longer  for  A  to  compute. 

CThe  speaker  has  just  been  asked,  "Do  you  ever  talk  to  Charles?! 
Sb.   I  nmvar   talk  to  plagiarists. 
Sb'.  No,  because  he's  a  plagiarist. 

It  isn't  much  safer,  either,  because  although  it  doesn't  assert,  as 
(Sb' )  does,  that  Charles  is  a  plagiarist,  other  immediately  available 
interpretations  (e.g.  those  in  (ID)  are  hardly  less  risky,  and  none  is 
so  readily  available.  Thus,  in  the  same  context,  (Sb)  might  implicate 
(11a)  or  (lib) — or,   as  in  (lie),  that  for  some  reason,  the  speaker  does 


not  want  to  be  cooperative. 

11a.   Speaker  won't  answer  the  question  because  the  questioner  is  a 

plagiarist, 
lib.   Speaker   is  changing  the  subject   because  the  topic   is 

dangerous, 
lie.    Speaker   is   being   irrelevant   because   s/he   is   ignorant, 
hostile,  stoned,  ... 

Since  example  <£b)  is  not  particularly  "poetic",  it  is  not 
surprising  that  cleverer  figures  that  make  more  dernands  on  the  addressee 
are  even  more  effective  compared  to  the  literal  representations  of  their 
intended  interpretations.  The  examples  in  (12)  and  (13)  are 
representative  of  what  I  have  in  mind — not  very  subtle,  and  not  utterly 
original,  but  very  effective. 

l£a.   Christmas  Snow  is  about  a  widow  (Katherine   Helmond)   with 

two  children  who* s  going  to  be  evicted  from  her  home  and 

candy  store  right  after  Christmas.   Will  the  hard-hearted 

landlord   <Sid  Caesar)  get  the  holiday  spirit  before  it*s 

— >  too  late?  Did  the  glass  slipper  fit  Cinderella? 

(Parade  Magazine  12/14/86,  p.  9) 

!...)  12b.   Of  course  the  landlord  gets  the  holiday  spirit  before  it's 
too  late;  stories  like  this  have  to  have  a  happy  ending. 

13a.  The  Bulls  must  have  realized  by  now  that  they  can't  win 
many  ball  games  if  Michael  Jordan  has  to  do  all  the 
scoring.  What  they  can  do  about  it — the  rest  of  the 
players  look  at  double  digits  the  way  runners  once  looked 
at  the  four-minute-mile — is  a  mystery.  Cheer  up!  By  the 
time  you  read  this,  the  Bulls  will  have  traded  their  way 
— >  into  the  playoffs.  By  the  time  you  read  this,  Ed  Meese 
will  be  writing  a  column  for  Penthouse. 
(Chicago  Magazine  12/86,  p.  81) 

I)  13b.  That's  wishful  thinking,  of  course;  the  Bulls  are  no  more 
likely  to  get  to  the  playoffs  than  presidential 
pornography  commission  chairman  Ed  Meese  is  to  write  a 
column  for  the  skin  magazine  Penthouse. 

In  the  case  of  <12)  and  <13),  the  paraphrases  of  the  intended 
implicatures  do  not  make  explicit  anything  that  the  utterer  would  have 
good  reason  to  keep  implicit,  as  was  the  case  with  examples  <5),  (9), 
and  (10a),  which  demonstrated  the  content  and  safety  aspects  of 
implicature.  So  how  can  (12)  and  (13)  be  effective  rather  than 
uncooperatively,  or  even  annoyingly,  obscure?^*  Perhaps  I  need  to  say  a 
little  at  this  point  about  what  I  mean  by  "effective".  I  don't  pretend 
to  have  a  lot  of  statistical  data  that  experimental  subjects  are  more 
moved  or  persuaded  by  implicatures  of  this  sort  than  by  their  plain 
language  counterparts.  And  I'm  not  sure  people  have  intuitions  of 
effectiveness.  But  I  think  they  do  experience,  in  at  least  some 
situations,  a  feeling  of  satisfaction  at  producing  implicatures 
appropriate  to  their  social  needs,  and  the  "Hunh!  I  know  exactly  what 
you  mean"  reaction  on  inferring  them  in  understanding  discourse.  Why 
should  implicatures  provoke  these  reactions? 
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Mayb*  because  Mi  thin  limits,  »  message  that  takes  more  work  to 
process  makes  a  deeper,  more  permanent  impression!^,  and  speakers 
subconsciously  assume  this.  This  is,  after  all,  the  principle  that 
underlies  our  (Western?  American?)  assumption  that,  within  limits, 
people  learn  the  hows  and  whys  of  rules  and  principles  better  by  working 
out  (and  maybe  experiericing)  the  consequences  on  their  own  than  by  being 
told  what  the  consequences  are.  This  is  the  basis  for  teaching  geometry 
by  having  students  do  proofs,  anatomy  by  having  them  do  dissection,  arid 
linguistics  by  having  them  do  analysis  and  argumentation.  Uhat  I  am 
suggesting  is  that  when  a  person  works  out  an  implicature,  by  expending 
whatever  effort  is  required  to  do  that,  he  has  a  stake  in  the 
interpretation  he  (re) constructs  being  the  correct  one.  Narcissist  that 
he  is,  having  worked  hard  on  it,  he  is  inclined  to  accept  it.  This  does 
not  mean  that  he  believes  it,  of  course,  if  it  is  an  assertion,  but  it 
does  mean  that  he  has  entertained  and  evaluated  (at  some  level  of 
consciousness)  the  proposition  it  represents  and  the  ones  that  connect 
it  to  what  was  said,  and  has  considered  to  what  extent  the  speaker 
accepts  those  propositions.  In  other  words,  it  has  a  much  better  chance 
of  making  an  impression  on  him  than  a  "plain  language"  message  that  he 
merely  decodes. ^ 3  Probably  most  exploiters  of  implicature  have  never 
thought  about  why  it  is  an  effective  rhetorical  device,  but  they  may 
take  such  a  mechanism  as  this  for  granted  nonetheless. 

So  maybe  that*s  a  case  for  saying  that  (within  limits) 
implicatures  SHOULD  BE  more  effective  than  plain  speech,  because  they 
demand  more  of  the  hearer.  But  is  there  any  corroboration  for  this  idea 
that  deeper  processing  results  in  a  greater  effect?  As  a  matter  of  fact, 
there  is. 

Epstein,  Phillips,  and  Johnson  (1975)  (cf.  also  Phillips  and 
Epstein  1979)  found  that  when  experimental  subjects  were  asked  to  recall 
pairs  of  words,  subjects  who  were  instructed  to  focus  on  semantic 
similarities  or  differences  did  better,  on  recall  tasks,  than  subjects 
who  were  told  to  focus  on  orthographic  similarities,  arid  than  subjects 
who  were  simply  instructed  to  try  to  learn  the  pairs.  This  superiority 
held  both  for  related  and  unrelated  pairs  of  words,  and  on  delayed 
recall  as  well  as  immediate  recall.  Epstein  and  Phillips  (197&)  showed 
that  such  semantic  processing  facilitated  recall  over  the  other 
processing  modes  even  after  S  days  and  21  days. 

And  there  is  older  evidence  that  use  of  metaphor  has  a  significant 
persuasive  effect  (Bowers  and  Osborn  19&&).!^  In  other  persuasion 
experiments,  subjects  who  drew  the  conclusions  for  themselves  from 
stimuli  which  left  them  implicit  were  more  persuaded  thereby  (Katz, 
Sarnoff,  and  McClintock  1956). 

Thus  there  is  experimental  evidence  that  the  more  one  thinks  about 
the  meaning  of  what  is  said,  the  deeper  and  more  long  lasting  the 
impression  it  makes.  It  follows  that  implicatures  should  be  more 
persuasive  in  communicating  what  was  intended  (when  they  are  worked  out 
as  intended)  than  their  plain  language  counterparts,   precisely  BECAUSE 


they  require  more  effort  to  work  out. 


IS 


The   idea  that   implicature  should   be  more  effective  than  plain 
speech,   because  it  demands  more  work  by  the  hearer  is  contrary  to  what 
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Sparber  and  Wilson**  account  predicts.  Sperber  and  Wilson  assert 
<i98&tS16)  that  "a  speaker  aiming  at  optimal  relevance  will  leave 
implicit  everything  that  her  addressee  can  be  trusted  to  supply  with 
less  effort  than  would  be  needed  to  process  an  implicit  prompt."  This 
follows  from  the  definition  of  optimally  relevant.  which  aiflounts  to 
'generating  the  largest  number  of  contextual  effects  in  the  addressee 
while  requiring  the  least  processing  effort  by  the  addressee*  (196&:i53, 
158).  But  it  implies  that  implicatures  would  have  to  be  either  very 
obvious,  or  very  pregnant  with  meaning  to  even  be  candidates  for  being 
reconstructed  correctly,  and  that  speakers  and  addressees  should  judge 
implicatures  like  HZ)    and  <13>  to  be  more  trouble  than  they  are  worth. 

On  my  account,  part  of  what  makes  implicature  (and  figurative 
language  of  all  sorts)  effective  is  the  extra  processing  effort  it 
demands.  But  this  extra  processing  effort  should  make  it  (relatively) 
irrelevant  for  Sperber  and  Wilson;  since  understanding  <lSa)  and  <13a) 
as  intended  requires  a  significant  amount  of  inferencing,  while  the 
explicit  counterparts  in  <12b)  and  <13b)  are  not  so  much  longer  that 
they  would  seem  to  require  a  substantial  processing  effort,  utterances 
like  (iSb)  and  (13b>  ought  to  be  preferred  to  (ISa)  and  <13a)  by  an 
optimally  relevant  speaker,  according  to  the  Sperber  and  Wilson  account. 
Furthermore,  discourses  like  <lSa)  and  <13a)  ought  to  be 
incomprehensible  to  the  addressee,  given  the  availability  <to  the 
speaker)  of  <12b)  and  <13b).  I  doubt  that  either  is  the  case,  though  it 
is  not  clear  that  'optimal  relevance*  and  'speaker  aiming  at  optimal 
relevance'  are  empirically  vulnerable  notions. 


4.  Why  is  there  literal  speech?  If  implicature  is  quicker,  safer,  and 
more  effective  than  plain,  literal  speech,  why  isn't  there  only 
implicature?  Since  it  is  obvious  that  people  do  find  uses  for  "plain, 
literal"  speech,  I  will  discuss  three  reasons  why  this  is  not  an  anomaly 
for  the  explanation  proposed  here.  These  are,  of  course,  above  arid 
beyond  the  obvious  point  that  some  sense  of  literal  meaning  or  normal 
use  is  necessary  to  calculate  implicatures  from. 

First  of  all,  so-called  literal,  explicit  speech  is  considerably 
less  literal  and  more  figurative  than  unreflecting  judgments  would 
estimate,  as  Nunberg  <197B)  has  suggested,  as  I  have  described  elsewhere 
at  some  length  <6reen  In  press),  and  as  Sperber  and  Wilson  indicate  at 
several  points  < 1966: 193,  S30-S37).l&  In  the  unremarkable,  prosaic 
sentence  in  <1A),  at  least  three  referring  expressions  are  not  intended 
or  understood  in  their  most  (likely)  literal  senses. 

14.   This  newspaper  claims  that  the   president   bombed  the  hamlet 
because  North  Vietnam  would  not  come  to  terms. 

Newspaper  refers  to  a  reporter  or  an  editorial  staff,  not  a  paper  and 
ink  product!  bombed  refers  to  an  authorization  to  bomb,  not  the  actual 
loosing  of  explosive  materiel {  and  North  Vietnam  does  not  refer  to  a 
geographic  or  political  entity,  but  to  the  authorized  representative  of 
one.  Perhaps  as  much  as  20%  of  the  content  words  in  a  narrative  are 
extended,  metaphorical  or  metonymic  uses  that  are  not  listed  in 
dictionaries^^,  but  are  related  to  senses  that  are,  by  one  or  more 
regularly  recurring  referring  functions  (Nunberg  1978i29ff). 
Implicature   is  involved  in  the  determination  of  reference  here  in  that 
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th*  assumption  that  tha  speaker  is  saying  something  relevant  is 
necessary  to  enable  the  hearer  to  infer  what  is  being  said  "literally". 

Second,   sometimes  it  is  more  effective  to  be  more  explicit  BECAUSE 

it  takes  longet the  same  reason  that,   in  other  circumstances,   it   is 

more  effective  to  implicate  than  to  be  explicit.  Here  the  speaker 
guarantees  that  the  addressee  Mill  focus  attention  on  complex 
relationships  involving  the  subject  at  hand  by  elaborating  them  artd 
thereby  extending  the  discussion.  By  being  explicit,  the  speaker 
guarantees  that  the  addressee  will  know  what  conclusions  the  speaker 
intends  to  be  drawn.  As  teachers,  one  of  the  first  things  we  learn  is 
that  the  crisp,  tight  prose  we  learned  to  write  for  our  peers  and  our 
teachers  is  totally  inappropriate  for  lecturing  to  our  students,  because 
in  a  lecture  situation,  students  don*t  have  enough  control  over  the  flow 
of  content  to  ensure  that  they  have  time  to  interpret  and  absorb  what  is 
being  said,  the  way  they  would  in  informal  conversation  or  in  reading  a 
book. 

Third,  although  I  have  indicated  circumstances  when  it  is  safer  to 
implicate  than  to  be  explicit,  in  other  circumstances  implicating  may  be 
riskier,  not  safer,  than  being  explicit.  A  speaker  who  cannot 
confidently  estimate  how  many  of  his  assumptions  his  audience  shares, 
and  assumes  he  assumes  they  share,  risks  producing  discourse  that  will 
be  misinterpreted,  or  even  disregarded  as  too  much  trouble  to  interpret. 
All  of  us  surely  have  read  papers  where  it  just  didn't  seem  worth  the 
effort  to  try  to  figure  out  why  some  respected  colleague  was  making  such 
apparently  false  <or  unwarranted,  or  absurd)  claims,  or  what  some  writer 
assumed  the  connections  were  among  the  propositions  which  got 
represented  explicitly.  (Moral i  if  you  need  to  make  sure  someone  sees 
things  as  you  see  them,  spell  it  out.  This  is  Rule  One  of  didactic 
exposition. ) 

Implicating  can  be  socially  risky  as  well,  and  for  the  same  reason. 
While  ambiguity  is  what  makes  implicature  safe — giving  the  implicator  as 
well  as  the  addressee  a  reading  to  hide  behind,  it  is  also  what  makes  it 
risky  in  other  circumstances,  when  the  implicator  canrtot  confidently 
calculate  which  interpretation  will  be  inferred.  For  example,  suppose 
that  Adele,  who  wanted  to  borrow  her  uncle's  camera,  was  not  worried 
about  putting  her  uncle  on  the  spot,  but  instead  was  very  concerned  to 
let  him  know  she  needed  a  camera.  She  would  be  foolish  to  use  an 
implicature  like  <10a)  <l»lv  camera's  broken),  for  the  very  reason  that  it 
was  useful  in  the  other  situationi  if  the  uncle  interprets  her  saying, 
"My  camera's  broken"  as  just  an  observation,  and  fails  to  infer  that 
Adele  wants  him  to  lend  her  his  camera,  she's  wasted  an  opportunity. 


16 


Finally,  a  person  who  never  spoke  explicitly  and  "said  what  he 
meant",  but  only  implicated  it,  would  make  a  rather  uncongenial 
conversational  partner.  Imagine  trying  to  carry  on  a  serious 
conversation  with  someone  like  Kit  in  the  hypothetical  dialogue  in  (IS). 

15a.  Christ  Are  you  going  to  Kim's  party? 

15b.  Kit  I    Is  the  Pope  Catholic? 

15c.  Christ  Who  are  you  going  to  take? 

ISd.  Kitt   Lou  said  Sandy  was  going  to  be  in  town. 

ISe.  Christ  Do  you  think  Lou  would  mind  if  I  brought  Dana? 

iSf.  Kitt   Who  cares  what  Lou  thinks. 
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15g.  Chrie:  Did  you  see   this   morning's  papet Iran  is  calling  for 

their  people  to  overthrow  the  Saudi  royal  family. 

15h]  Kit  I  And  they  laughed  Hhen  the  Ayatollah  sat  down  to  play 
the  piano. 

A  person  who  spoke  only  in  implicatures  would  come  across  as  cold, 
suspicious,  cynical,  and  whatever  else  you  would  call  someone  who 
Meticulously  avoided  committing  himself  to  a  position. 

5.  Having  your  cake  and  eating  it  too.  There  is  another  paradox  (a 
double  paradox,  in  fact)  that  deserves  to  be  addressed.  On  the  one  hand, 
as  mentioned  above,  implicature  is  a  valuable  politeness  strategy  in 
situations  where  it  is  important  not  to  impose  on,  or  make  demands  of 
the  addressee.  Vet  I  have  claimed  that  implicature  is  effective  BECAUSE 
it  makes  demands  on  the  hearer.  On  the  other  hand,  because  use  of 
implicature  presumes  that  speaker  and  addressee  share  a  lot  of 
assumptions  and  values,  including  the  assumption  that  the  addressee  will 
be  willing  to  do  a  little  work  to  figure  out  what  the  speaker  wants  to 
convey,  implicature  is  also  a  useful  social  interaction  tactic  in  more 
intimate  situations  where  it  is  important  to  display  friendliness,  trust 
and  shared  values  (cf.  Lakoff  1S73  and  Brown  and  Levinson  1978iie&-13A). 
Implicature  imposes  processing  demands  on  the  addressee}  how  can  it  be  a 
politeness  strategy  in  situations  where  making  demands  is  inconsiderate, 
and  at  the  same  time  be  a  politeness  strategy  in  other  situations 
BECAUSE  it  makes  demands  on  the  addressee?  The  answer  is  that  in 
situations  where  it  is  important  to  avoid  imposing  on  the  addressee, 
speakers  seem  to  treat  the  processing  demands  of  a  polite  implicating 
speech  act  as  insignificant  compared  to  the  social  impositions  that 
being  direct  entails;  making  a  direct  request  puts  the  addressee  on  the 
spot  while  hinting  does  not  (because  its  vagueness  makes  the  interided 
inference  only  one  of  the  plausible  inferences  that  would  be  rational  in 
that  context).  But  an  implicature  like  (15h)  can  be  a  positive 
politeness  tactic  <Brown  and  Levinson  19781106-134)  among  intimates, 
because  it  saves  time  and  promotes  solidarity,  implying,  *we  share  so 
much  and  you  are  so  knowledgeable  and  alert  that  you  can  tell  what  I 
mean  even  if  I  only  allude  to  it  obliquely*.  Such  implicatures  are 
polite  BECAUSE  1)  to  the  extent  that  they  make  a  processing  demand,  they 
trade  on  it,  conveying,  'among  close  friends,  a  little  processing  demand 
is  insignificant',  and  2)  to  the  extent  that  the  speaker  and  addressee 
do  know  each  other  well,  such  implicatures  don't  make  as  much  of  an 
inferential  demand  as  they  would  if  addressed  to  a  stranger  or  a  mere 
acquaintance.  Implicatures  like  (15h>  would  be  inconsiderate  addressed 
to  a  stranger. 

This  paradox  of  implicature — that  it  can  be  at  the  same  tiHW 
ineffably  suited  to  particular  conversational  purposes,  and  singularly 
unsuitable  as  a  general  conversational  strategy — has  parallels  in 
certain  of  Darwin's  arguments  for  natural  selection.  Gould  (1985)  points 
out  that  Darwin  concentrates  not  on  the  "beautiful,  optimally  designed 
adaptations  of  organisms  to  their  environments"  <1985t42),  since  good 
design  is  what  should  be  expected  from  any  evolutionary  theory.  Rather, 
he  focusses  on  the  anomalies  that  are  only  attributable  to  selection 
based  on  reproductive  success — the  crux  of  natural  selection.  Gould 
points  out  that  the  most  striking  of  these  are  "those  structures  and 
habits  that  plainly  compromise  the  good  design  of  organisms  (and  the 
ultimate  success  of  species)   but   just   as  clearly    increase   the 
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r«productivB  prowBSB  of  individuals  b*aring  thsM" — a. g.  tha  tail 
faathars  of  paacocks  and  tha  iaunanaa  antlars  of  tha  axtinct  Iriah  alk. 
iMplicature  is  logically  an  unneceaBary  excreacenca  given  tha  aina  qua 
non  of  a  coMMunicativa  ayataMi  a  finite  recuraive  ayntax  and  a 
coMpoaitional  aaMantica.  And  in  the  long  run,  we  could  aurely  get  along 
without  it.  But  in  the  short  run,  how  could  we  give  up  a  tool  so  useful 
in  our  daily  struggles  for  conversational  auccass? 


Appendix  I 

As  indicated  in  Table  I,  saMples  (averaging  £70  worda  each)  were 
taken  froM  three  genres  of  discourses  children's  literature 
(specifically,  picture  books  intended  for  the  very  young),  expository 
prose  (an  article  frou  Newsweek  and  a  two-page  spread  from  a  linguistics 
Monograph  on  graimiatical  theory),  and  natural  apeech  (excerpts  froM  five 
narratives  in  Terkel  (i97S)  and  one  telephone  conversation).  The  texts 
were  not  selected  completely  at  random,  of  course,  and  they  may  not 
constitute  a  representative  sample  of  English-language  discourse. 
However,  texts  of  all  genres  can  undoubtedly  be  found  with  much  lower 
(and  much  higher)  incidence  of  implicatures  of  the  sort  counted.  More 
importantly,  none  of  the  texts  is  remarkable  on  account  of  its 
implicaturesi  examining  them  in  another  context,  the  reader  would  not  be 
likely  to  notice  any  of  tha  implicatures. 

The  number  of  implicatures  per  100  words  ranged  from  .&  to  2. S| 
the  percent  of  text  that  invited  implicature  ranged  from  7.5  to  19. 

TEXT  (length  in  words)      IMPLICATURES  UORDS  PER  100  WORDS  X   OF  TEXT 

Pry  YoM  Wy  Mo^h^f  (699) 
"Down  the  Hill"  (363) 
Two  Bad  Mice      (972) 

Brfinmatical  Thfory  (67A) 
Waiting  for  Muamnar  (961) 

telephone  convers.  (1293) 
Working  pp.  168-169  (766) 
Working  pp.  25S-259  (834) 
Working  pp.  682-683  (808) 


Table  Ii  Selected  implicatures  in  discourse 

The  sampling  below  of  implicatures  froai  the  texts  analyzed  should 
provide  a  sense  of  just  how  ordinary  the  sorts  of  implicatures  I 
enumerated  are|  the  ones  in  these  texts  are  not  especially  clever  or 
witty,  but  represent  a  comaK>nly  chosen  option  in  natural,  unaffected 
dii 
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133 

1.72 

19 

a 

59 

2.2 

16 

i» 

110 

1.03 

11 

5 

126 

.74 

19 

9 

96 

.94 

10 

14 

100 

1.00 

7.7 

6 

59 

.78 

7.6 

5 

63 

.60 

7.5 

9 

98 

1.11 

12 

From  Beatrix  Potter's  Ta^g  Qf  Twp  pfd  M^cei 

The  little  airl  that  the  doll-house  belonged  to. said.— "I  wil 


D«t  a  doll  dresBBd  Ilka  «  oolicewan. " 
I)  Th>  little  girl  thought  •  policauan  doll   could  k»«p  vandals 
out. 

But  the  nurse  said. — "I  will  oat  a  wou»B-trap!" 
I>  The  nurse  knew  the  vandals  were  nice,   and  that   a  doll 
wouldn't  deter  them. 


From  a  conversation: 

Ci  The  car's  not  running. 
K;  The  car's  not  running? 

I>  Are  you  sure  the  car's  not  running? 

Frow  P. D.  Eastman's  fire  You  Wv  Mother?! 

Cto  a  baby  bird]  I'm  not  your  mother.  I'm  a  dog. 
I>  I  couldn't  be  your  mother  because  I'm  a  dog   and  you're  a 
bird  and  a  dog  can't  be  a  bird's  mother. 

From  Studs  TerWel's  Working  (Janitor) i 

They  talk   about   heart   attacks  shoveling  snow.  In  one  of  my 

buildings  alone.   I  almost  had  a  block  of  snow  to  shovel — plus 
the  entrances,  plus  back  porches. 
I)  I  have  a  lot  of  snow  to  shovel. 

From  a  Newsweek  article  about  Khadaffii 

Despite  17  years  of  a  revolutionary  regime,  Libya  remains  in 
mnay  ways  a  traditional  Islamic  country;  Kaddaf i*  s 
liberalization  of  divorce  laws  and  draft  of  women  into  the  ftrmv 
passed  earlier  this  year  in  the  face  of  unprecedented 
opposition  from  the  regime's  building  blocks,  the  People's 
Congresses. 
I>  The  fact  that  revolutionary  moves  were  bitterly  opposed  is 
evidence  that  Libya  remains  a  traditional  Islamic  country. 
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X  II 

Tabl*  II  baloM  indicatas  tha  avaraga  parcant  of  Nouna,  Varbs,  and 
Adjactivas  in  nina  arbitrarily  salactad  S00-««ord  passagas  of  Modam 
British  and  Amarican  fiction  usad  in  aanaaa  not  liatad  in  larga  daak 
dictionariaa. 


Taxt  nuMbar  Parcant  of  N,  V,  A  whoea  aanaai 

Mara  not  liatad 

1  29 

2  <Ton  Robbins)  17 

3  <8aul  Ballow)  8 

4  9 

5  IS 

6  17 

7  a 

a  KTolkian)  9 

9  (Hanry  Jaitaa)  S4 

Tabla  III  Content  words  used  in  unlistad  sanses  in  salactad  texts 


^For  arguMants  that  these  represent  iMplicatura,  cf.  Grice  1975, 
Nunbarg  1978,  Qraen  1984,  Green  in  press. 

£Cf.  Horn's  (1984)  Q-Principlei  Say  as  much  as  you  can. 

3flnd  her  brain,  if  it  takes  more  effort  to  (decide  to)  use  an 
implicature  than  to  use  a  corresponding  explicit  utterance.  Because  of 
all  the  (unconscious)  decisions  that  are  involved  in  planning  ANY 
utterance  (cf.  Green  1982),  it*s  not  at  all  obvious  that  it  necessarily 
Mould.   In  any  case,  it's  an  ei«pirical  question. 

Ladislav  Zgusta  (p.c. )  points  out  that  (2b)  is  also  More  informative 
than  (2b'),  in  that  it  implicates  that  the  reason  for  not  talking  to 
Charles  is  the  speaker's  (stated)  policy  of  never  talt-.ing  to  (any) 
plagiarists.  But  (2b' >  implicates  the  existence  and  the  consequence  of 
such  a  policy  as  well,  so  it  seems  valid  to  ignore  this  in  comparing 
Mhat  the  two  utterances  would  be  intended  to  convey.  Each  asserts  a 
different  one  of  tha  two  premises  (Charles  is  a  plagiarist |  I  never  talk 
to  plagiarists)  in  the  syllogistic  answer,  and  requires  the  other  to  be 
inferred!  <2b)  also  requires  the  conclusion  to  be  inferred. 

^Levinson  (198Si4)  notes  that  "appropriate  responses  to  prior 
utterances  and  their  associated  implicatures  are  often  produced  with 
lass  time  lapse  in  speaker-transition  than  is  required  to  produce  a 
vowel"  — under  200  milliseconds. 

^By  using  this  term  I  mean  to  imply  that  the  reasoning  involved  in 
reconstructing  implicatures  is  not  conscious,  though  it  may  well  involve 
deductive  logic  of  a  familiar  sort  (cf.  Sperber  and  Milson  1986i  93- 
103). 
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^It  is  hard  to  ses  how  one  could  measura  this,  and  compara  the 
amount  of  work  involved  in  decoding,  disambiguating,  and  inferring 
senses  and  references  necessary  to  understand  <ib)  "literally"  to  the 
amount  of  work  involved  in  inferring  (lb)  from  (la)  in  context,  although 
Sperber  and  Wilson  (198&>  SIB)  assume  that  such  comparisons  are 
sensible! 

fl  speaker  aiming  at  optimal  relevance  will  leave  implicit 
everything  her  hearer  can  be  trusted  to  supply  with  less  effort 
than  would  be  needed  to  process  an  explicit  prompt. 

7l  am  referring  here  to  the  physical   effort   involved   in  the 

mechanical    aspects   of   producing   an  utterance   (movements  of  the 

diaphragm,  pharynx,  mandible,  velum,  tongue,  lips,  vocal  cords,  etc.  (or 
hand(s)  and  arm(s)). 

SSperber  and  Wilson  (19a&:S&)  suppose  that  it  is  obvious  "that  what 
is  implicitly  conveyed  by  an  utterance  is  generally  much  vaguer  than 
what  is  explicitly  expressed,"  but  it  seems  to  roe  that  neither  the 
intended  implication  nor  the  implication  inferred  is  vague.  Rather,  what 
is  SAID  only  vaguely  suggests  what  is  intended  to  be  inferred,  and 
strictly  speaking,  probably  doesn't  even  determine  a  finite  number  of 
implicatures.  Sperber  and  Wilson  go  on  to  say  that  "when  the  implicit 
import  of  an  utterance  is  explictly  spelled  out,  it  tends  to  be 
distorted  by  the  elimination  of  this  often  intentional  vagueness," 
(198&:5&).  Perhaps  this  is  too  nice  a  distinction,  but  it  may  be  that 
the  distortion  comes  less  from  "eliminating  vagueness"  as  a  general 
property,  and  more  from  speaking  the  unspeakable — propositions  or 
expressions  which  offend  or  insult  or  entail  some  face-threatening  act. 

'^Champaign-Urbana  News-Gazette.  Feb.  19,  1987,  p.  B-3,  from  the 
Associated  Press. 

^^This  seems  to  contradict  Sperber  and  Wilson's  claim  a  few  pages 
later  (198&t  £0)  that  implicit  communication  (e.g.  implicature)  is 
generally  weakly  communicated,  apparently  because  what  is  said  is  vague 
(and  this  is  why  implicating  may  be  safer).  Of  course,  what  is  said  is 
not  the  same  as  what  is  implicated  (or  it  wouldn't  be  implicated — it 
would  be  said),  so  it  may  happen  in  context  that  something  very  specific 
is  readily  inferred  (and  understood  to  be  intended  to  be  inferred)  with 
reasonable  certainty  from  some  fairly  vague  utterance,  and  such  an 
implicature  will  constitute  communication  essentially  as  strong  as  what 
is  intended  by  some  more  literally  meant  precise  and  explicit  utterance. 

llOf  course,  (13)  might  be  annoyingly  obscure  to  someone  who  didn't 
know,  and  couldn't  guess,  that  Meese  was  strongly  associated  with 
positions  advocating  censorship  and  a  very  broad  definition  of 
pornography,  and  that  Penthouse  is  considered  a  soft-core  pornographic 
magazine. 

1^1  realize  that  these  claims  bear  a  certain  resemblance  to 
advertising  slogans,  but  I  think  these  are  exactly  the  right  words. 

^^Of  course,  "merely  decoding"  an  utterance,  arriving  at  a 
representation  of  its  intended  sense  and  reference  is  no  small  feat,  and 
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irivolves  implicature  too  <cf.  Nunberg  1976,  Green  In  press  (Chs.  3  and 
S)  for  some  explication  of  exactly  what  is  involved).  The  point  is  that 
the  processing  involved  in  interpreting  an  utterance  as  inviting  a 
particular  implicature  is  all  in  addition  to  what  is  required  for 
decoding  it. 

^^McGuire  <19&9:3da>  notes  that  "Aristotle  arid  Cicero  suggested 
that  metaphor  would  add  to  persuasive  impact  by  increasing  the  perceived 
intelligence  of  the  speaker"  and  that  "Quint ilian  suggested  that 
metaphor  would  add  to  persuasive  impact  by  entertaining  and  pleasing  the 
audience.  "  Obviously,  I  agree  with  their  conclusion  that  metaphor  should 
be  effective,  though  not  with  their  assumptions  about  what  makes  it 
effective. 

ISQreen  <1980:  598)  speculates  that  the  effectiveness  of  one  use  of 
non-canonical,  inverted  syntax  arises  from  its  requiring  non-automatic 
processing. 

I^Sperber  and  Wilson  do  not  take  as  extreme  a  position  here  as 
Nunberg  does  <cf.  1986:175). 

^7fls  evidence  for  the  pervasiveness  and  creativity  of  extended 
uses,  a  survey  of  nine  arbitrarily  chosen  200-word  passages  from  works 
of  Modern  British  and  American  fiction  revealed  that  an  average  of 
around  15%  of  the  nouns,  verbs,  and  adjectives  in  them  were  used  in 
senses  that  are  not  covered  by  the  entries  in  a  large  desk  dictionary. 
<The  range  was  7*  to  53%.      See  appendix  2. ) 

IBin  fact,  she  has  also  made  it  awkward  to  try  again  right  away, 
because  that  would  risk  drawing  the  uncle's  attention  to  this  misfire, 
and  his  part  in  it,  and  generally  increase  the  social  tension  in  this 
somewhat  face-threatening  situation. 
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SHANGHAI  TONES:  A  NONLINEAR  ANALYSIS  * 
Zhiji  Lu 


This  article  presents  a  detailed  analysis  of  Shanghai  tones  within 
the  frameworic  of  current  phonological  theories,  especially  the  recent 
development  of  lexical  phonology  and  theory  of  underspecification.  I 
show  that  only  two  level  tones  /H/  and  lU  are  necessary  in  the  underlying 
forms  of  Shanghai  tones  and  all  the  surface  forms  for  both  citation  form 
and  sandhi  form  arc  derived  by  a  set  of  phonological  rules,  which  include 
the  default  rules  supplied  by  Universal  Grammar  (UG)  and  some  other 
language-specific  rules.  On  the  basis  of  the  analysis,  I  discuss  some 
related  issues  such  as  the  role  of  segmental  information  for  tonal 
derivation,  the  tonal  typology  of  Chinese  dialects,  the  asymmetry  of  the 
tonal  features  and  the  feature  underspecification.  Finally,  in  the  light  of 
this  study,  I  argue  that  tone  features  are  coplanar  and  only  the  coplanar 
model  is  proved  optimally  adequate  for  tonal  representation. 

1.  BACKGROUND 

1.1  Data 

The  tonal  phenomena  studied  here  refer  to  the  standard  speech  of  metropolitan 
Shanghai  which  belongs  to  the  Wu  Dialects  spoken  in  eastern  China,  one  of  the  seven 
major  dialect  groups  of  Chinese.  In  traditional  analysis,  there  are  five  tone  classes 
assigned  to  the  monosyllabic  morpheme  or  word  in  citation  form.  The  phonetic  values  of 
these  five  tone  classes  are  as  follows:  (M'  represents  the  raised  mid  tone) 


(1)  Tone 


HL  Examples:        sa  HL  (wound);  fon  HL(wind) 
MM'  tisin  MM'  (enter);  eia  MM'  (small) 

LM'  zu  LM*  (sit);  Re  LM'(salt) 

M'  ha?  M'  (black);  Vd  ?  (put  ofO 

LM  zd?  LM  (thief);  filo?  (fall) 


Tone  sandhi  occurs  when  these  monosyllabic  morphemes  or  words  with  different 
tones  in  citation  form  are  combined  into  disyllabic  or  polysyllabic  words  or  compounds. 
For  example: 

(2)  HL   +   HL    —    H   ML 

sa  foi)  sa   foQ 

(wound)  (wind)  (catch  cold) 

Based  on  my  phonetic  experiment  (Lu  1986),  the  phonetic  values  of  the 
disyllabic,   trisyllabic    and   four-syllabic    words   or  compounds   from   the   different 
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combinations  of  these  five  tones  are  given  below  (Xs  represent  the  tones  of  the  syllables 
after  the  leftmost  syllable): 

(3)  2-syl  3-syl  4-syl 


AX(XX): 

H-ML 

H-M-ML 

H-M-M-ML 

BX(XX): 

MM- 

M-H-ML 

M-H-M-ML 

CX(XX): 

L-M' 

L-H-ML 

L-H-M-ML 

DX(XX): 

M-M' 

M-H-ML 

M-H-M-ML 

EX(XX): 

L-LM 

L-L-LM 

L-L-L-LM 

In  Shanghai  dialect,  stress  always  appears  on  the  initial  (or  leftmost)  syllable  of 
the  word  or  compound. 


1.2  Tones  and  the  segmental  properties 

There  exist  close  relationships  between  the  tone  class  and  the  properties  of  the 
segmental  sequence  within  the  tone-bearing  syllable  in  Shanghai.  Traditionally,  Chinese 
syllable  structure  is  described  in  terms  of  tones,  initials,  medials,  nuclei,  and  endings. 
Thus,  the  segmental  sequence  in  the  syllable  of  Shanghai  may  contain  the  consonant 
initial,  the  medial  (glide),  main  vowel  or  nuclei,  and  the  consonant  ending.  The  internal 
constituency  of  syllable  structure  is  illustrated  below: 


(4) 


A 

X  X  X  X 

C  G  V  C 

The  crucial  points  here  are  the  properties  of  consonants  in  the  initial  and  final 
positions.  There  are  33  initial  consonants  in  surface  form  in  Shanghai  (Xu  et  al.  1981). 
They  can  be  divided  into  two  groups:  Group  A  consists  of  voiceless  stops,  affricates, 
fricatives,  and  nasals/laterals  with  a  preceding  voiceless  glottal  stop;  and  group  B 
contains  voiced  stops,  affricates,  fricatives,  and  nasals/laterals  with  a  preceding  voiced 
glottal  fricatives.  The  feature  [+murmur]  is  used  to  mark  the  initials  of  Group  B,  as  in 
several  previous  studies  of  the  Shanghai  tones,  and  the  initials  of  Group  A  are  left  as 
unmarked  ones. 

There  exist  three  consonants  as  the  endings  for  the  closed  syllables:  [n],  [g],  and 
glottal  stop  [?]. 

The  relations  between  the  tone  classes  and  the  groups  of  the  initials  and  the 
endings  are  as  follows:  (a)  Tones  A,  B  and  D  only  occur  in  the  syllables  with  the  initials 
of  group  A;  while  tones  C  and  E  are  only  with  the  initials  of  group  B.  (b)  Tones  A,  B  and 
C  are  only  related  to  the  syllables  without  a  glottal  ending,  and  tones  D  and  E  only  occur 
in  the  syllables  containing  a  glottal  ending.  They  can  be  shown  below: 
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Tone 

C      G 

V      C 

A 
B 
C 

D 
E 

[+mur] 
[+mur] 

[+glotta]] 
[+glottal] 

(5) 


Having  a  glottal  ending,  tones  D  and  E  are  short  in  duration.    They  are  called 
Entering  tones  in  traditional  Chinese  phonology. 


1.3  Previous  studies 

Sherard  (1972)  first  claims  that  the  contour  of  polysyllabic  words  depends  on  the 
tone  of  the  initial  syllable.  Ballard  (1977)  states  that  "in  Shanghai  apparently  the  only 
sandhi  that  occurs  is  of  the  type  I  shall  call  'right  spreading"'(p.l2).  This  claim  was 
modified  by  Zee  and  Maddieson  (1979,  henceforth  Z  &  M)  who,  based  on  the  results  of 
their  phonetic  experiment,  suggested  that  two  major  processes  are  involved,  "the 
spreading  of  the  tone(s)  of  the  first  syllable  of  a  compound  to  the  second  syllable,  and  the 
replacement  of  the  tone(s)  on  third  and  subsequent  syllables  by  Low  tone"  (p.  127).  In  Z 
&  M  (1979),  an  autosegmental  analysis  was  first  given  to  the  Shanghai  tonal  phenomena, 
and  four  underlying  forms  were  proposed: 

(6)        Tone 


A: 

/HU 

B: 

/MH/ 

C,E: 

/LH/ 

D: 

/H/ 

A  reanalysis  is  presented  in  Yip  (1980).  Yip  proposes  for  tone  a  hierarchical 
feature  system  in  which  two  binary  features  together  define  four  tonal  levels:  one  is  the 
Register  feature  [upper]  which  splits  the  overall  pitch  range  into  two  registers;  the  second 
feature  is  Tone  [high]  which  divides  each  register  into  two  sub-registers.  She  claims  that 
each  of  these  features  is  potentially  an  independent  autosegmental  tier,  and  "only  the 
feature  Tone  may  occur  in  sequences  underlyingly",  while  registers  do  not  occur  in 
sequence:  "Register  remains  constant  over  the  syllable" (p.  196).  Thus  the  tonal 
representation  looks  like  the  following: 

(7) 


[+Upper] 

1 

or  (for  brevity:) 

[-HUpper] 

<r 

L   H 

A 

L   H 

Under  her  Register  system,  she  reduces  the  tone  inventory  of  Shanghai  into  three 
tones: 

(8)       Tone    A:  [+Upper,HL) 

B,  D:  [+Upper,  L] 

C,  E:  [Upper,  LH] 

For  the  compounds,  she  assumes  that  the  syllables  after  the  first  one  are  toneless, 
thus  abandoning  the  mechanisms  of  tone  deletion  as  Z  &  M  did.  The  following  is  an 
example  of  her  analysis  (p.208): 


(9) 

A: 

+U 
1 

AX: 

+U 

a 

o  a 

K 

1    .' 

H   L 

H  L 

96 


AXX(X): 


+u 

-U 

r^^. 

r^-. 

<y         a 

a         (o) 

1      ,-' 

U"" 

HL 

L 

(L-insertion) 

For  the  same  data,  Wright  (1983)  provides  metrical  analysis  with  the  notion  of  the 
moraic  timing  device.  She  proposes  that  syllables  in  strong  or  isolated  positions  have 
two  moras  while  syllables  in  weak  positions  have  only  one  mora,  and  mora  is  assumed  as 
tone  bearing  unit  instead  of  syllable  as  tone  bearing  unit.  Thus,  under  her  approach  of  the 
timing  device,  two  Entering  tones  of  Shanghai,  tones  D  and  E,  have  one  mora,  and  the 
weakly  stressed  syllables  within  the  trisyllabic  and  four-syllabic  sandhi  forms  are  all 
reduced  to  a  single  mora.  I  do  not  enter  into  the  details  of  her  analysis  here  since,  in  my 
view,  the  analysis  of  Shanghai  tones  can  be  carried  out  without  this  special  timing  device. 


2.  THE  ANALYSIS 

2.1  The  theoretical  assumptions 

I  assume  the  major  theoretical  claims  made  in  the  current  development  of  lexical 
phonology  (Mohananan  1982,  Kiparsky  1982,  Pulley  blank  1986  and  others). 

The  tonal  feature  system  used  here  is  taken  from  Yip  (1980)  and  Clements 
(1981).  Four  basic  pitch  levels  are  shown  in  (10)  below  under  this  hierarchical  system 
(see  Pulley  blank  (1986)): 

(10) 


Register 

Subregister 
+raised 

H 

+upper 

-raised 

M' 

+raised 

M 

-upper 

-raised 

L 

For  the  tonal  representations,  following  Archangeli  (1985)  for  analysis  of  vowel 
features,  I  propose  a  coplanar  model  in  which  the  tonal  melody  is  formed  by  a  single 
plane  with  internal  organization  of  two  feature  tiers:  the  register  tier  which  anchors  on  the 
tone-bearing  unit  (T),  and  the  subregister  tier  which  anchors  on  the  register  tier,  as 
illustrated  telow: 

(11)  Coplanar  model 


tone 
melody 


[upper]  • 

I 
[raised] 


register  tier 
subregister  tier 
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Thus,  the  full  specifications  of  the  four  basic  level  tones  under  this  model  arc 
given  below:  (henceforth  [U]  for  [upper]  and  [R]  for  [raised]) 

(12)  T  T  T         T 

I  I  I  I 

[+U]     [+U]     [-U]      [-U] 

I       I       I       I 

[+R]     [-R]      [+R]     [-R] 

H         M'       M        L 

I  also  assume  that  the  associations  between  the  feature  matrices  (or  melody  units) 
and  their  anchors  follow  the  Association  Conventions  which  map  a  sequence  of  free  (or 
floating)  melody  units  onto  a  sequence  of  anchors  from  left  to  right.  Of  course, 
associations  either  by  the  Association  Conventions  or  by  rule  cannot  violate  the  Well- 
Formedness  Condition  (i.e.  association  lines  cannot  across). 


2.2  Underlying  forms,  rules  and  derivations 

Let  us  turn  to  the  data.  The  surface  forms  of  the  monosyllabic  words  of  the  five 
different  tone  classes  and  disyllabic  words  or  compounds  with  the  combinations  of  these 
five  tones  are  given  as  follows  with  the  full  feature  specification  on  the  register  tier: 

(13) 

A:  a  AX:         &     & 

/\  I       I 

[+U][-U]  [+U]  [-U] 

[HL]  [H   ML] 

B:  a  BX:         a     o 

/\  I      I 

[-U][+U]  [-U]  [+U] 

[MM']  [M   M'] 

C:  a  CX:         a     (7 

/X  I      I 

l-U][+U]  [-U]  [+U1 

(LM'l  [L  M'l 

D:  a  DX:         a    a 

I  I      I 

[+U]  [-U1  [+U] 

[M*]  [M   M'] 

E:  a  EX:         a     a 

I  I      I 

[-U]  [-U]  [-U] 

[LM]  [L   LM] 

It  is  easy  to  see  that  within  the  domain  of  a  word  (or  compound),  either 
monosyllabic  or  disyllabic,  on  the  register  tier,  only  those  with  tone  A  as  the  accented  or 


leftmost  syllable  begin  with  [+U]  while  the  others  all  have  [-U]  at  the  beginning  position 
except  a  monosyllabic  word  of  tone  D  which  is  specified  as  [+U].  However,  the 
disyllabic  word  with  tone  D  as  the  first  syllable  also  begins  with  [-U],  and  tone  D,  as 
mentioned  above,  is  associated  only  with  a  syllable  of  short  duration  ending  in  a  glottal 
consonant.  Therefore,  we  have  reason  to  assume  that  for  a  monosyllabic  word  of  tone  D 
there  still  exists  contour  tone  beginning  [-U]  in  the  early  stage  of  the  derivation  and  that 
this  [-U]  is  delinked  during  the  derivation  caused  by  contour  simplification  on  short 
syllables.  In  the  light  of  this  observation,  I  propose  that  there  are  only  two  level  tones  for 
the  underlying  representation  in  Shanghai,  namely  /H/  and  lU.  /H/  is  the  underlying 
form  for  tone  A  while  lU  is  the  underlying  form  for  tones  B,  C,  D,  E.  Of  course,  the  H  is 
lexically  specified  as  [+U]  and  the  L  is  lexically  [-U].  With  the  theory  of 
underspecification  proposed  in  Kiparsky  (1982),  Archangeli  (1984),  Pulleyblank  (1986) 
and  Sohn  (1987),  I  assume  that  only  the  H  is  specified  as  [+U]  and  the  L  is  left 
underspecified.  The  specification  of  [-U]  for  the  register  tier  in  the  unmarked  case  can  be 
provided  by  the  following  default  rule:  1 

(14)  Default  [-U]-insertion: 


[-U] 

Now,  let  us  first  deal  with  monosylabic  or  disyllabic  words  related  to  tones  A,  B, 
C,  D  as  the  accented  or  leftmost  syllable,  and  leave  those  of  tone  E  for  a  while.  The 
contours  can  be  observed  on  the  register  tier  for  these  words.  I  assume  that  they  are 
directly  derived  by  the  following  Contour  Formation  rule: 


(15)  Contour  Formation 


0  -*    [-«U]  /     a 

[«U] 

Take  a  monosyllabic  word  of  tone  A  as  an  example.  The  underlying  form  of  tone 
A  is  /H/  as  proposed  above.  /H/  is  specified  as  [+U]  lexically.  The  contour  on  the 
register  tier  of  the  syllable  is  directly  generated  by  applying  the  Contour  Formation  rule 
(15).  Contour  Formation  inserts  [-U]  immediately  after  [+U],  and  then  this  floating  [-U] 
is  linked  to  the  tone-bearing  unit  by  the  Association  Conventions,  as  follows: 

(16)  A: 

*  *  * 

0  — -        a        — —        o 

1  I  As 

[+U]  [+U][-U1         [+U][-U] 

/H/ 

Consider  those  with  l\J  as  the  underlying  form.  Take  a  monosyllabic  word  of 
tone  B  as  an  example.  The  underlying  form  of  tone  B  is  /IV  as  proposed  above.  Since 
lU  is  left  underspecified,  the  default  rule  (14)  provides  the  specification  of  [-U]  on  the 
register  tier.  Then  the  Contour  Formation  rule  (15)  gives  the  word  a  contour  tone,  as 
illustrated  below: 
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(17)  B: 


(14)     I  (15)  I  A. 

[-U]  [-U][+U1         [-U][+U] 


lU 


Obviously,  the  Default  [-U]-insertion  rule  should  be  applied  at  the  beginning  of 
the  lexical  cycle  if  no  marked  underlying  form  of  the  accented  syllable  occurs.  Then,  the 
Contour  Formation  rule,  as  a  language-specific  rule,  is  applied  to  form  the  contour  tones. 

For  disyllabic  or  polysyllabic  words  or  compounds  in  Shanghai,  I  assume  that 
only  the  accented  or  first  syllables  (or  morphemes)  within  the  domain  of  these  words 
keep  their  inherent  tones  while  the  unaccented  syllables  are  toneless  in  the  underlying 
representation.  Of  course,  the  unspecified  tone-bearing  units  will  be  specified  by  the 
default  rule  (14).  Here,  I  show  the  derivation  of  BX  as  an  example: 

(18)  BX: 

*  «  *  * 

a  a     — "       a  o      ►  a         a      — -      o-       a 

(14)         !     !       (15)  II  \      /^ 

[-U][-U]  [-U][+U][-U]  [-U][-hU][-U] 

The  output  does  not  exactly  match  what  is  shown  in  (13)  since  the  second  syllable 
is  linked  with  both  [+U]  and  [-U].  Observing  the  data  carefully,  we  can  find  that  there  is 
no  contour,  on  the  register  tier,  linked  to  a  syllable  if  the  syllable  is  unaccented  or  if  it  has 
a  glottal  ending.  I  propose  a  general  constraint  rule  of  this  dialect:  i.e.  the  Contour 
Simplification  rule  on  the  register  tier,  which  delinks  an  already  linked  [U]  from  the  tone- 
bearing  unit  when  another  floating  [U]  links  to  it  by  the  Association  Conventions.  Now, 
the  output  of  BX  in  this  stage  will  be: 

(19)  BX: 

«  *  * 

a         o       ►        a        o       ►       o      a 

I       /\  I      /X  II 

[-U]  [+U][-U]  [-U]  [+U][-U]  [-U]  [+U1 

This  Contour  Simplification  rule  on  the  register  tier  also  helps  to  derive  the 
correct  form  displayed  in  (13)  for  monosyllabic  words  of  tone  D  which  have  a  glottal 
ending: 

(20)  D: 

*  •  «  *  * 

a       — ►       a      — — *-       <r       — ►       a      — »-     a 
(14)         !         (15)        I  ><^^.  I 

[-U]  [-U][+U]         [-U][+U]  [+U] 

Now,  let  us  consider  the  monosyllabic  words  of  tone  E  or  the  disyllabic  words 
with  tone  E  as  the  leftmost  syllable.  There  is  no  contour  on  the  register  tier  for  them.  If 
the  Contour  Formation  rule  applies  to  them,  we  cannot  get  the  correct  output  as  in  (13) 
even  by  using  the  Contour  Simplification  rule.  Therefore,  our  only  recourse  is  to  modify 
the  Contour  Formation  rule  (14)  by  adding  the  condition  that  it  cannot  apply  to  a  syllable 
having  both  murmured  initial  consonant  and  glottal  final  consonant.   Thus,  the  Contour 
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Formation  rule  becomes  a  syllable-sensitive  rule  which  only  applies  to  the  accented 
syllable  of  tones  A,  B,  C  and  D. 

We  are  now  ready  to  deal  with  feature  specifications  on  subregister  tier.  I  assume 
that  all  feature  specifications  on  the  subregister  tier  can  be  predicted  or  derived  by  rule. 
The  following  is  the  surface  representation  fully  specified  in  both  register  and  subregister 
tiers  for  monosyllabic  and  disyllabic  words: 

(21) 

A:  o-  AX:         a  a 

/\  I        I 

[+U][-U]  [+U]  [-U] 

II  I    /\ 

[+R][-R1  [-R]  [+R][-R1 

[HL]  [H    ML] 

B:              a  BX:         a    a 

/\  I      I 

[-U][+U]  [-U1  [+U] 

II  II 

[+R][-R]  [+R]  [-R1 

[MM']  [M    M'] 

C:              a  ex.         &    (T 

/\  I      I 

[-U][+U]  [-U]  [+U] 

II  II 

[-R][-R]  [-R1  [-R1 

[LM']  [L    M'] 

D:  a  DX:         a    a 

I  I      I 

[+U]  [-U]  [+U] 

I  I     I 

[-R1  [+R]  [-R] 

[M'l  [M    M'] 

E:  a  EX:         a      a 

I  I       I 

[-U1  [-U]  [-U] 

Av  I    A 

[-Rl[+R]  [-R]  [-R][+R] 

[LM]  [L    LM] 

It  can  be  observed  that  accented  or  leftmost  syllables  are  specified  with  [-R]  in  the 
beginning  position  only  if  they  have  murmured  initial  consonants  (i.  e.  those  of  tones  C 
and  E).  We  can  explain  it  on  phonetic  grounds  that  the  voiced  consonant  at  the 
beginning  of  the  syllable  may  lower  the  tone  pitch.  Thus  a  Lowering  rule  is  formulated 
to  capture  this  fact: 
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(22)  Lowering  (syllable-sensitive) 

a  —   a       /  CGVC 

I  I  I 

[  U]      [  U]         [+munnur] 

[-R] 

Take  a  monosyllabic  word  of  tone  C  as  an  example: 

(23)  C: 

*  *  «  *  * 

a       — *      a        — -     <j  — ►      a         — ►       a 

(14)        !         (22)       I  (15)        I  /"^. 

[-U]  [-U]  [-U][+U]  [-U][+U] 

I  I  I 

[-R)  [-R]  [-R] 

The  Lowering  rule  is  applied  before  the  Contour  Formation  rule. 

Up  to  this  stage,  the  second  position  on  the  subregister  tier  associated  with  the 
[-i-U]  is  still  unspecified.  I  propose  another  default  rule  to  fill  in  this  position  with  a  [+R] 
by  the  end  of  lexical  cycle.  This  default  rule  is  formulated  as  follows: 

(24)  Default  [+R]-insertion: 
[Ul      —      [U] 

[+R1 
By  applying  this  default  rule  (24),  the  output  of  (23)  will  be: 

(25)  C: 

*  * 

a  o- 

/\  A 

[-U][+U]  [-U][+U] 

I  I     ! 

[-R]  [-R][+R] 

Obviously,  this  is  not  the  desirable  result  --  cf.  (21).  It  is  observed  from  (21)  that 
words  with  tones  A,  B,  C,  D  as  the  accented  or  leftmost  syllable  all  end  with  [-R]  on  the 
subregister  tier.  This  is  exactly  the  opposite  value  of  what  is  shown  in  (25).  Thus,  I 
propose  a  post-lexical  rule  named  [R] -attachment  rule  to  derive  the  correct  form: 

(26)  [R]- Attachment 

[U]     ]#  (pause) 

[orR]     [-«R] 

Now,  [-R]  will  be  attached  to  the  right  edge  of  the  output  of  (25)  by  the  [R]- 
attachment  rule  on  the  p»ost  lexical  cycle: 
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(27)  C: 


a      — ►       a     ]#       — ► 

or 

[-U][+U]          [-U][+U] 

[-U1  [+U] 

1     i           1      hv 

1      /\ 

[-R][+R]          [-R][+R]  [-R] 

[-R][+R][-R] 

It  can  be  observed  that  no  anchor  on  the  subregister  tier  specified  with  [+U]  is 
associated  with  a  contour.  Thus  there  exists  another  constraint  rule  for  contour 
simplification  on  the  subregister  tier  which  delinks  the  already  linked  [R]  from  [+U] 
when  another  floating  [R]  attaches  to  it  Now,  the  output  of  (27)  will  be: 

(28)  C: 

*  * 

o-      — ►       o- 

/\  /\ 

[-U]  [+U]         [-U][+U] 

I    ^       II 

[-R][+R][-R]      [-R][-R] 
[MM'] 

This  is  precisely  the  result  we  need.  Take  disyllabic  words  with  tone  A  or  tone  B 
as  the  leftmost  syllable  as  further  examples.  The  whole  derivations  can  be  illustrated  as 
follows: 

(29)  AX: 

a  a]# 
I  ( 

[+U] 


(26) 

[+U]    [-U] 
I       A. 
[+R][+R][-R1 
[H  ML] 

(30)  BX: 

*  *  ♦  * 

aa]#  a    o]#        ►      o         a  ]# ►      a    o  ]# 

(14)        !      !  (15)        I        /V     (24)         I     I 

[-U][-U1  [-U][+U][-U]  [-U][+U] 


*                      * 

♦ 

14) 

.     a    a]# — ►  a          a  ]# 
1     !     (15)     1        /X 
[+U][-U]         [+U][-U][-U] 

*■      a    a  ]# 

(24)        1      1 
[+U][-U] 

4i 

[+R1[+R1 

or 

1 

O]. 

[+R][+R] 
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►       o-    a  ]#      ► 

o-    a 

(26)         1      1 

1      1 

[-U1[+U1 

[-U1L+U] 

1  =r-. 

1      1 

[+R][+R]    [-R] 

[+R][-R] 

[M   M'l 

Now,  consider  the  derivation  of  words  an  accented  syllable  of  tone  E.  The 
monosyllabic  word  of  tone  E  is  derived  by  the  same  set  of  rules  mentioned  above: 

(30)  E: 

*  *  *  * 

a]#   ►      o    ]# al#    ►     a  ]# 

(14)        !         (22)       I        (26)        I 

[-U]  [-U]  [-yi 

[-R]  [-Rl[+R] 

[LM] 

But  there  is  problem  for  disyllabic  words  with  tone  E  as  the  leftmost  syllable: 

(31)  EX: 

*  *  *  ♦ 

a  a]#    .►    a     o]#        ►     a  a]#         ►      a  a  ]# 

(14)      !      :  (22)       I     I  (24)        I     I 

[-U][-U1  [-U][-U]  [-U][-U] 

!  I    1 

[-R]  l-R][+R] 

— -      a    a  ]# 
(26)         I      I 
[-U][-U] 

[-R][+R]  [-R] 
[L  ML] 

This  is  not  the  correct  result  The  problem  is  caused  by  the  application  of  the 
default  [+R]-insertion  rule  which  assigns  [+R]  to  the  unmarked  position  associated  with 
the  second  [-U].  If  this  position  is  filled  with  [-R]  before  applying  this  default  rule,  this 
problem  can  be  avoided.  Thus,  a  rule  named  [-R]-spreading  is  proposed  to  solve  this 
problem: 

(32)  [-R]-Spreading 

[-U]  [;U] 
[-R] 

This  is  a  lexical  rule.  The  application  of  this  rule  is  ordered  before  the  default 
[+R]-insertion  rule.  Now,  the  derivation  of  EX  after  the  application  of  the  Lowering  rule 
will  be: 
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(33)  EX: 


o-  a  ]#  — -    cr  o  ]if        ►       a  a  ]# 

I     I       (32)      I     I  (26)         I     I 

[-U][-U]  [-U][-U1  [-UIMJ] 

[-R]  [-R]  [-R]    [+R] 

[L  LM] 

To  summarize,  only  two  underlying  tones  are  proposed  here  for  Shanghai,  namely 
/H/  and  /U:  the  H  is  assumed  for  tone  class  A  which  is  lexically  specified  with  [+U], 
while  the  L  is  assumed  for  tone  classes  B,  C,  D  and  E  which  are  left  lexically 
unspecified.  In  polysyllabic  words  or  compounds,  only  the  accented  or  leftmost  syllable 
will  keep  its  inherent  tone;  others  will  be  toneless.  In  all  cases,  the  surface  forms  can  be 
derived  by  the  default  rules  ([-U] -insertion  rule  and  [+R]-insertion  rule)  and  a  set  of 
language-specific  rules.  The  set  of  language-specific  rules  contain  the  lexical  rules,  the 
post-lexical  rule  and  constraint  rules: 

(34)  Lexical  rules  (ordered  as  follows): 

a.  Lowering  rule 

b.  Contour  Formation  rule 

c.  [-R]- spreading  rule 

The  Lowering  rule  and  the  Contour  Formation  rule  are  syllable-sensitive  rules: 
the  former  is  applied  for  syllables  with  a  murmured  initial  consonant  while  the  latter  is 
not  applied  for  syllables  having  both  a  murmured  initial  consonant  and  a  glottal  final 
consonant. 

The  post-lexical  rule  is: 

(35)  [R]-attachment  rule 

The  post-lexical  rule  is  only  involved  with  feature  specification  in  the  subregister 


The  constraint  rules  (i.e.  Contour  Simplification  rules)  that  disallow  the  linking  of 
a  contour  on  the  register  tier  to  an  unaccented  or  short  syllable,  and  also  disallow  the 
linking  of  a  contour  on  the  subregister  tier  to  a  single  [+U]. 

With  the  above  sets  of  rules,  the  surface  forms  of  words  or  compounds  of  more 
than  two  syllables  can  also  be  derived.  To  save  space,  only  the  derivations  of  BXXX  and 
EXXX  are  shown  below: 

(36)  BXXX: 

«                                     *  * 

crao-a]#      — *■        o-    a    a    a]#  •-  o-        o-       a     a    ]# 

(14)        !     !     I     I  (15)  I       /'X     I      I 

[-U][-U][-U][-U]  [-U][+U][-U][-U][-U] 
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^      o-    a     o-    a    ]#  ►     o-     a     a     o   ]# 

(24)        I      I      i      I  (26)        I       I       I       I 
[-U][+U][-U][-U]  [-U][+U][-U][-U] 

!    ;    !    :  III    r-^. 

[+R][+R][+R][+R]  [+R][+R][+R][+R]  [-R] 


[M    H   M  ML] 


(37)  EXXX: 


a  a  o  a]#      ►      cr     o     o     cr  ]if     "     a     a     o     a]# 

(14)        111:  (22)        III! 

[-U][-U][-U][-U]  [-U][-U][-U][-U] 

[-R] 


"     a    o-    a     a  ]#     ^     o-    o-    o-    cr  ]# 

(32)        I       I      I      I  (26)        I      I      I      I 

[-U][-U]L:U]L-U]  [-U][-U]I-UmJ] 


[-R]  [-R]  [+R] 

[L  L  L  LM] 


3.  Discussion 

3.1  Tonal  typology 

3.11  Segmental  information 

Compared  with  the  tonal  phenomena  in  African  languages,  those  in  Chinese  seem 
more  complex  with  regard  to  the  tone  inventory.  In  African  languages,  two-tone  systems 
are  very  common,  and  most  of  languages  have  less  than  four  level  tones.  But,  in 
Chinese,  the  number  of  tones  in  surface  form  may  go  up  to  nine  or  ten  (as  in  Cantonese). 
In  the  Wu  Dialects,  the  number  of  tones  in  citation  form  are  from  five  to  eight  However, 
as  shown  in  section  2,  wi\h  the  principle  allowing  only  unpredictable  properties 
underlyingly,  I  have  reduced  the  tone  inventory  of  Shanghai  to  only  two  tones  in 
underlying  representation.  Obviously,  the  segmental  information  plays  an  important  role 
in  tonal  derivation. 

Concerning  the  historical  development  of  Chinese,  it  is  assumed  that  Chinese  has 
undergone  the  shift  from  some  segmental  distinctions  to  tonal  distinctions.  In  Ancient 
Chinese  (around  600  A.D.),  it  is  generally  held  that  there  were  four  tone  categories. 
Conditioned  mainly  by  the  contrast  of  voiceless  and  voiced  initial  consonants,  tonal  splits 
occurred.  Some  dialects,  which  have  eight  tones  in  citation  form  (e.g.  Wenzhou  dialect, 
a  subdialect  of  the  Wu  dialects),  neatly  reflect  such  splitting.  Thus,  tone  properties  to  a 
large  extent  are  related  to  the  properties  of  segments  within  the  syllable.  Of  course,  some 
dialects,  especially  in  Mandarin  groups,  have  already  finished  this  shifting  process  and 
lost  the  contrast  on  the  segmental  plane  regarding  the  tonal  contrast  by  the  processes  of 
simplification  of  sound  systems.  For  instance,  in  Mandarin,  there  are  two  tone  categories 
in  the  "Even  Tone":  Yin  Ping  (Yin  Even  Tone)  and  Yang  Ping  (Yang  Even  Tone).  The 
former  is  originated  from  the  syllables  of  the  Even  Tone  with  the  voiceless  initial 
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consonants  in  Ancient  Chinese  while  the  latter  from  the  syllables  of  the  Even  Tone  with 
voiced  initial  consonants,  though  the  voiced  initial  consonants  have  already  disappeared 
now.  In  contrast  with  Mandarin,  many  dialects  in  southern  and  eastern  Qiina  are  still  in 
the  late  stage  of  this  shifting  process,  keeping  in  different  degrees  of  the  distinction  at 
both  levels:  tone  plane  and  segmental  plane.  In  my  view,  Shanghai  is  among  them.  This 
makes  it  possible  to  reduce  the  tone  inventory  of  these  dialects  by  efficiendy  using  the 
segmental  information.  Thus,  only  two  level  tones  /H/  and  fU  are  proposed  for  the 
underlying  representations  of  Shanghai  tones,  and  the  application  of  the  lexical  rules,  to 
derive  the  surface  forms,  in  most  cases,  uses  the  segmental  information. 

3.12  From  the  syllable-based  tone  melody  to  the  word-based  tone  melody 

In  traditional  analyses  of  Chinese  tonal  phenomena,  the  tones  in  citation  form  are 
regarded  as  basic  form  and  the  tones  in  sandhi  form  are  considered  as  changed  or  derived 
form.  Therefore,  the  tonal  analysis  always  breaks  into  two  stages:  first,  to  determine  the 
tones  in  citation  form,  and  then,  to  derive  the  sandhi  form  from  the  citation  form.  My 
analysis  here  is  different  from  the  traditional  analyses  as  well  as  most  of  previous  studies 
in  Western  Literature,  since  I  assume  that  the  derivation  of  tones  either  in  citation  form  or 
in  sandhi  form  shares  the  same  phonological  processes. 

Wright  (1983)  has  noted  that  "Shanghai  appears  to  be  moving  toward  a  pitch 
accent  system"  (p.  182).  Her  observation  can  be  strongly  supported  by  the  fact  that  here 
only  the  accented  syllable  has  an  underlying  tone  melody  as  shown  in  my  analysis.  If 
Shanghai  is  still  a  "pure  tone  language",  there  will  be  25  possible  prosodic  patterns  for 
disyllabic  words  and  125  for  trisyllabic  ones,  since  five  tones  are  observed  in  citation 
form.  In  fact,  the  data  show  that  only  five  patterns  are  relevant  to  each  of  the  different 
length  of  words.  Each  pattern  is  strictly  related  to  the  inherent  tone  and  syllable  structure 
of  the  accented  (or  leftmost)  syllable  within  the  word  or  compound  domain.  This 
provides  the  evidence  that  the  lexical  tonal  pattern  of  Shanghai  is  undergoing  a  shift  from 
the  syllable-based  tone  melody  to  the  word-based  tone  melody.  That  is  to  say,  the  word 
or  compound,  monosyllabic,  disyllabic  or  polysyllabic,  composes  a  unique  unit  for  a 
lexical  prosodic  pattern.  This  kind  of  shifting  can  be  highly  clarified  if  we  relate  it  to  the 
general  tendency  of  lexical  change  in  the  development  of  Chinese.  In  archaic  Chinese, 
most  words  in  lexicon  were  monosyllabic,  but,  today,  the  majority  words  arc  disyllabic  in 
Modem  Chinese.  The  burden  of  differentiating  the  homophones  of  monosyllabic  words 
has  been  reduced,  and  sound  systems  have  been  greatly  simplified.  Thus,  the  syllable- 
based  tone  melody  is  being  gradually  replaced  by  a  word-based  tone  melody.  This  kind 
of  shifting  can  also  be  observed  in  other  Wu  dialects  in  different  degrees.  It  is  with  this 
new  perspective  that  I  propose  the  same  phonological  processes  to  derive  the  surface 
forms  for  either  citation  forms  or  sandhi  forms. 


3.13  Melodic  units:  reconsidering  the  data  of  Wuxl 

Based  on  the  analysis  of  tone  data  of  three  Chinese  dialects,  i.e.  Wuxi,  Amoy  and 
Danyang,  Yip  (1987)  recendy  argues  that  tonal  contours  must  be  regarded  as  melodic 
units,  just  like  the  treatment  of  segmental  affricates  as  units.  Thus  a  branching  structure 
within  the  tonal  feature  node  is  proposed,  parallel  to  the  segmental  ones,  as  illustrated 
below: 
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(38)  a.  V       instead  of:  V 

I  /\ 

a  L  H 

A 
L  H 

b.  C       instead  of:         C 

I  A 

a  t  s 

A 

t   s 

Though  both  in  (38a)  have  a  single  syllabic  position  associated  with  an  ordered 
sequence  of  elements,  only  the  representation  on  the  left  side  allows  reference  to  that 
sequence  as  a  melodic  unit  which  can  be  spread  as  whole.  For  example,  tone  pattern  A  in 
Wuxi,  suggested  by  Yip,  is  /L(LH)/  underlyingly,  which  has  a  level  part  L  and  a  contour 
part  (LH).  By  spreading,  each  part  anchors  as  a  melodic  unit  to  one  edge  of  the  domain: 

(39)  Pattern  A: 

a  a     a  aao-  <r  cr  a  a 

A  I      I  I        I  II 

LLH  LLH  LLH  LLH 

Then  phonetic  interpolation  from  L  to  L  will  give  continuous  low  pitch  as 
observed  tone  pattern.  As  a  consequence,  from  a  typological  point  of  view,  she  points 
out,  "Chinese  languages  make  widespread  use  of  tonaJ  affricates,  whereas  African 
languages  tend  to  lack  them"  (pp.  22-23). 

Wuxi,  spoken  in  Wuxi  city  about  80  miles  north-west  of  Shanghai,  it  shares  some 
properties  of  Shanghai.  By  reconsidering  the  data  given  in  Yip  (1987),  I  will  propose 
another  analysis  which  is  similar  to  what  I  did  above.  For  tone  pattern  A  of  Wuxi,  I 
would  rather  propose  only  /U  as  the  underlying  representation  and  add  a  post-lexical  rule 
applying  at  the  post-lexical  cycle  of  the  derivation,  Tone- Attachment  rule,  which  attaches 
a  tone  with  the  opposite  value  of  its  preceding  one  to  the  edge  of  the  lexical  domain, 
similar  to  the  [R] -Attachment  rule  presented  in  (26).  The  derivation  of  Pattern  A,  now, 
will  be: 

(40) 

a  ]#  — ►     a  1#  <T  or  ]#         ►      <r  a]» 

I  K  t''  I    rv 

L  L  H  L  LLH 

a  o  a]#       — ►       a  a  a]#         aa<T<j]#       — «-      aoraa]# 

L'-'  I      N.         L'-'  I       K 

L  LLHL  LLH 

The  output  will  be  the  same  as  what  is  derived  from  (39). 

The  same  analysis  can  be  applied  to  the  other  three  tone  patterns  in  Wuxi.  Yip 
assumes  that  Pattern  B,  C,  and  D  have  the  underlying  forms  /(LH)L/,  /H/,  and  /L(HL)/ 
respectively.  The  derivations  of  them  are  given  by  her  as  follows: 
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(41)  Pattern  B: 

a 

A 

LH  L 

0  0- 

1  1 
LH   L 

0  o  a 

1  1 
LH       L 

a  o-  a  a 

1             1 

LH            L 

Pattern  C: 

A 
H 

1/ 
H 

a  o-  a 
H 

a  a  cr  a 
H 

Pattern  D: 

o 

A 

L  HL 

0-    cr 

1      1 
L   HL 

0  o    <y 

1  1 
L         HL 

a  a 

1 

L 

o  cr 

1 
HL 

Different  from  Yip's  analysis,  I  will  propose  that  Pattern  B  has  the  underlying 
form  /LH/  and  both  Pattern  C  and  D  share  the  same  underlying  form  /H/.  The  segmental 
property  of  the  initial  syllables  within  Pattern  C  and  Pattern  D  is  different:  the  former  has 
a  voiceless  initial  consonant  while  the  latter  has  a  voiced  one  (see  chart  (13)  in  Yip 
(1987)).  Therefore,  a  lowering  rule,  similar  to  the  rule  (22),  to  lower  the  tone  in  the 
leftmost  position  can  be  proposed  for  Pattern  D.  After  the  spreading  of  H  to  the  right 
edge,  the  H  in  the  leftmost  position  can  be  replaced  by  a  L  tone  when  applying  this 
Lowering  Rule,  then,  the  Tone-attachment  rule  is  applied  at  the  post-lexical  cycle: 

(42)  Pattern  D: 

a  ]#  ^  a  ]#  —  a  1#  a  a]#  -*  a  a]#   —  a  o-]# 

A  A  /r^.  1/  II  /A 

H  LHLHL  H  LHLHL 

a  o  o-]#  — *  a  a  a]#        — ^      a  a  ol# 

I  II  I     r\ 

H  L     H  L     H  L 

o  a  <T  a]#    — "      cr  o  a  a]#     — *     o  o  a  ol# 

u--^  II  I        rv. 

H  L         H  L  H  L 

For  Pattern  C,  according  to  the  original  pitch  patterns  shown  in  (14)  of  Yip's 
paper,  there  is  a  fall  near  the  end  in  disyllabic,  trisyllabic  and  quadrisyllabic  forms.  Thus 
the  application  of  the  Tone-attachment  rule  will  give  a  more  delicate  output  than  what  is 
shown  in  (41). 

Now,  let  us  consider  the  Pattern  B.  The  original  pitch  pattern  of  Pattern  B  shown 
in  (14)  of  Yip's  paper  shows  that  the  high  points  in  polysyllabic  forms  arc  on  the  second 
syllable  instead  of  on  the  first  syllable.  When  I  propose  a  rising  tone  /LH/  for  Pattern  B,  I 
assume  that  only  the  level  tone  can  spread  the  whole  domain  such  as  in  Pattern  A,  C,  and 
D,  i.e.  only  level  tone  can  associate  to  the  two  ends  of  the  domain  as  proposed  by  Yip. 
For  the  rising  tone,  the  spreading  of  H  will  be  blocked  on  the  third  syllable  of  the  whole 
domain.  This  kind  phenomenon  can  be  observed  in  many  Chinese  dialects.  For 
example,  the  data  shown  in  (3)  of  Shanghai,  the  H  does  not  occur  on  the  syllables  after 
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the  second  one  when  it  is  a  contour  tone  covering  the  whole  domain.     Thus,  the 
derivation  of  Pattern  B  will  be: 

(43)  Pattern  B: 

a]#    — ►      a]#  (f  a]#  — ►      a  o-]# 

A  An  II  I     r^ 

LH  LHL  LH  LHL 

a  a  a]#  — ►ao-a]*  a  a  a  a]#    — ^      cr  cr  (t  o-]if 

II  I    I   \  II  I    I        \ 

LH  LHL  LH  LHL 

Obviously,  there  is  no  need  to  propose  the  tonal  contours  as  melodic  units  in  the 
underlying  forms  of  Wuxi.  I  think  that  the  main  part  of  the  data  in  Danyang  can  be  given 
the  same  analysis  as  I  did  above.  Of  course,  as  I  pointed  in  Section  3. 12  that,  in  the  view 
of  tonal  typology,  modem  Giinese  dialects  seem  on  the  way  from  the  syllable-based  tone 
melody  to  the  word-based  tone  melody,  there  exist  the  phenomena  that  the  contour  tones 
are  fixed  on  the  syllable  instead  of  on  the  domain  of  word  when  some  dialects  still 
remain  in  the  early  stage.  Even  in  these  cases,  I  do  not  see  the  reason  to  take  contours  as 
melodic  units  since  they  have  already  been  considered  as  the  syllable-based  tone  melody. 

3.2  Feature  underspeciflcation 

As  noted  in  Sohn  (1987),  "a  phonological  system  is  more  valued  if  it  encodes  aU 
and  only  unpredictable  properties  in  underlying  representation"  (p.l).  Crucially 
depending  on  the  asynmietry  in  phonological  phenomena,  the  theory  of 
underspeciflcation  provides  a  systematic  way  of  eliminating  overall  redundant 
specifications  in  the  underlying  form. 

In  Shanghai,  the  asymmetrical  behavior  of  the  tonal  features  can  also  be  detected. 
First,  with  respect  to  the  specification  of  the  underiying  representations,  there  exists,  of 
course,  the  contrast  between  [+U]  and  [-U],  but  there  is  no  contrast  between  the  syllable 
specified  by  [-U]  and  the  toneless  one.  This  is  the  evidence  to  characterize  the 
asymmetry  between  [+U]  and  [-U].  Secondly,  concerning  subregister  features,  within  the 
lexical  level,  no  language-specific  rules  may  refer  to  the  feature  [+R].  [+R]  can  be 
specified  at  the  end  of  lexical  domain.  Thinlly,  the  tone  of  the  syllable  such  as  in  the 
third  position  of  AXXX,  BXXX,  CXXX  and  DXXX,  to  which  no  language-specific  rules 
apply,  always  surfaces  as  [M]  which  is  specified  by  [-U]  and  [+R].  Obviously,  the 
existence  of  such  asymmetjy  of  the  tonal  features  in  Shanghai  gives  the  possibility  of 
taking  [-U]  and  [+R]  as  default  values.  As  a  consequence,  to  gain  the  minimal 
specifications  in  underlying  representation,  only  the  H  is  marked  as  [+U]  while  the  L  is 
left  unspecified  in  my  analysis.  As  proposed  in  Pulleyblank  (1986),  [-U]  is  the  default 
value  assigned  by  UG  for  two-tone  system,  and  [+R]  is  the  default  value  also  assigned  by 
UG  in  realization  of  three  or  four  tones.  Following  his  suggestion,  it  is  reasonable  to 
assume  that  the  default  rules  formulated  above  are  supplied  by  UG,  and  the  default  [-IJj- 
insertion  rule  (14)  applies  immediately  after  the  entering  of  the  marked  underlying  form 
of  the  accented  or  leftmost  syllable  (or  morpheme),  while  the  default  [+R]-insertion  rule 
(24)  applies  at  the  end  of  lexical  cycle. 

The  result  of  allowing  minimal  necessary  information  in  the  underlying 
representation  under  the  feature  underspeciflcation  system  simplifies  the  phonological 
rules.  There  is  no  need  to  propose  the  tone  deletion  rules  as  in  Z  &  M  (1979)  or  the  rule 
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of  L-insertion  as  a  language-specific  rule  both  in  Z  &  M  and  Yip  (1980). 

3.3  Coplanar  representations  for  tone  melody 

Roughly  speaking,  there  are  two  models  of  tonal  representation  used  in  the 
current  theory.  One  is  "multiplanar"  model;  another  is  "unitiered"  model.  Here  I  use  the 
terms  "multiplanar"  and  "unitiered"  as  well  as  "coplanar"  as  strictly  defined  by 
Archangel!  (1985)  in  her  analysis  of  vowel  features  in  Yokuts  harmony.  Namely,  in  the 
unitiered  model  the  features  arc  in  a  single  matrix  on  a  single  plane,  and  with  the 
multiplanar  model  the  features  are  in  single  matrices  on  separated  planes,  while  the 
coplanar  model  is  that  the  features  are  on  the  same  plane  but  in  separated  tiers.  In  this 
article  1  use  the  coplanar  model  for  tonal  representation.  Drawing  on  data  from 
Shanghai,  I  argue  here  that  tonal  features  are  coplanar. 

The  analysis  of  Shanghai  tones  shows  that  the  register  feature  [upper]  and 
subregister  feature  [raised]  are  relatively  independent.  The  phonological  processes  may 
only  refer  to  either  of  them,  for  example,  only  register  feature  is  involved  in  application 
of  the  rule  of  Contour  Formation  (15),  and  only  subregister  feature  is  involved  when  [R]- 
attachment  rule  (26)  is  applied.  Thus,  that  [upper]  and  [raised]  are  assumed  as 
autosegments  and  are  arrayed  on  separate  tiers  may  capture  such  characteristic  of  the 
phonological  processes.  On  the  other  hand,  there  exists  some  sense  of  dependency 
between  these  two  tonal  features.  Logically,  register  feature  is  primary,  comparing  to  the 
subregister  feature,  to  describe  the  pitch  value  of  certain  tone  melody  since  it  stands 
higher  than  subregister  feature  in  the  feature  hierarchy  and  it  can  be  defined  without  the 
specification  of  subregister  feature.  That,  in  Shanghai,  only  register  features  are  used  to 
specify  the  underlying  representation  of  tone  melody  can  be  viewed  as  evidence  of  that 
On  the  contrary,  the  assignment  of  subregister  feature  may  only  refer  to  the  values  of 
register  feature.  For  example,  [-R]-spreading  (32)  takes  the  feature  values  of  the  register 
tier  as  its  condition.  Thus  these  two  tiers  form  a  single  plane  with  register  tier  anchors  on 
the  tone-bearing  unit  and  subregister  tier  anchors  on  the  register  tier,  which  may 
represent  the  internal  structure  of  these  two  different  features  within  tone  melody  plane. 

Now,  let  us  compare  this  coplanar  model  with  other  two  models  in  handling  the 
data  of  Shanghai.  Yip's  (1980)  proposal  can  be  viewed  as  a  multiplanar  model  since 
register  feature  and  subregister  feature  are  arrayed  in  the  separate  planes  as  shown  in 
(11).  With  regard  to  the  analysis  of  Shanghai  data,  there  exist  at  least  two  serious 
problems.  First,  her  model  does  not  allow  the  value  of  the  contour  tone  crosses  the 
middle  line  of  the  pitch  range  within  a  syllable  because  she  assumes  that  only  subregister 
feature  (Tone)  may  occur  in  sequence  while  Register  remains  constant  over  the  syllable. 
Therefore,  she  failed  to  derive  the  desirable  output,  for  example,  [HL]  for  tone  A  and  [H- 
ML]  for  AX  since  they  are  not  limited  in  each  half  of  the  pitch  range.  She  noted  it  as 
"nothing  but  a  non-phonological  prolongation  of  the  essential  tone"  though  it  "actually 
falls  right  down  to  the  lowest  possible  pitch"  (p.201).  But,  anyway,  such  kind  "cutting"  is 
not  satisfactory  for  the  description  of  tone  melody  in  Chinese.  Secondly,  the  multiplanar 
representation  cannot,  in  formal  sense,  capture  the  internal  relation  between  these  two 
different  features  as  noted  above.  For  example,  the  rule  of  [-R]-spreading  (32)  expressed 
in  coplanar  model  shows  neatly  that  this  phonological  process  is  dependent  on  the  values 
of  register  tier  and  irrelevant  to  the  sequence  of  the  tone-bearing  units,  while  under  the 
multiplanar  model  the  tone-bearing  units  are  always  involved  and  thus  blur  the  picture,  as 
illustrated  below: 


(44)  Coplanar:  Multiplanar: 

[-U]  [-U]  [-U1  [-U] 

L'-'  I        I 

[R]  o-      o- 

[-R] 

Most  studies  on  tone  phenomena  in  cuirent  theory  take  the  unitiered  model  for 
tonal  representation,  in  which  tone  features  are  assumed  to  occur  in  the  feature  matrices 
which  comprise  a  single  plane  or  a  single  tier,  and  "all  tonal  specifications  in  such  an 
approach  would  be  on  a  single  tier"  (Pulleyblank  1986,  p.  15).  There  is  no  difference 
between  unitiered  model  and  coplanar  model  if  only  register  features  are  used  to  specify 
the  tonal  values  in  a  language.  But,  if  the  subregister  features  are  involved  in  the 
phonological  processes,  the  formulation  in  terms  of  unitiered  model  is  far  more  clumsy. 
For  example,  under  this  model,  the  rule  of  [R]- attachment  will  be  presented  as  below: 

(45)  Unitiered: 
&      ]# 

PU        /JU 

«rJ  [-«r 

Obviously,  since  the  unitiered  model  fails  to  represent  the  internal  organization  of 
features,  the  rule  in  (45)  is  costly,  comparing  to  (26). 

Based  on  the  discussion  above,  I  conclude  that  the  coplanar  model  is  preferred  for 
tonal  representations  as  evidenced  by  the  Shanghai  tones,  and  furthermore,  the  argument 
here  may  also  be  considered  to  support  the  suggestion,  made  in  Archangel!  (1985),  that 
"coplanar  representations  are  the  universal  default  representations". 


NOTES 


I  would  like  to  express  my  gratitude  to  Professor  Michael  Kenstowicz  who  read 
the  manuscript  of  this  paper  and  gave  me  several  valuable  comments  and  suggestions.  I 
have  also  received  much  benefit  from  discussions  with  Professor  Charles  Kisseberth, 
Hyang-sook  Sohn,  Xing-ping  Zhou  and  Rick  Treece  during  preparation  of  this  paper.  Of 
course,  all  errors  are  mine. 

1  The  formal  notations  used  in  this  article  are:  *a  '  as  syllable  for  the  tone- 
bearing  unit;  '*'  as  accent  marker,  ']#'  as  word  or  compound  boundary;  *|'  means  linking 
and  '4='  delinking. 
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RULE  EXPANSION  ON  THE  FLY: 
A  GPSG  PARSER  FOR  JAPANESE/ENGLISH 
USING  A  BIT  VECTOR  REPRESENTATION  OF  FEATURES  AND  RULE  SCHEMAS 


Tsuneko  Nakazawa 
Laura  Neher 


We  have  implemented  a  Japanese/English  parser 
based  on  Generalized  Phrase  Structure  Grammar. 
Syntactic  categories  (sets  of  features)  and  rule 
schemas  are  represented  internally  by  bit  vectors, 
which  allows  efficient  expression  of  rule  schemas 
on  the  fly  during  parsing,  according  to  the 
principles  of  GPSG,  avoiding  the  combinatorial 
explosion  of  expanded  rules. 


1.  INTRODUCTION 

We  have  implemented  a  Japanese/English  parser,  based  on 
the  linguistic  framework  of  Generalized  Phrase  Structure 
Grammar  (GPSG),  as  presented  in  Gazdar  (1985).  The  focus  of 
this  paper  is  on  the  internal  representation  of  syntactic 
categories  and  rule  schemas,  and  the  incorporation  of  the 
representation  into  a  modified  chart  parser.  An  efficient 
representation  using  bit  vectors  has  been  devised,  which 
allows  rule  schemas  to  be  expanded  on  the  fly  during  parsing 
in  accordance  with  the  principles  of  GPSG.  Because  of  the 
representation,  operations  on  features  are  reduced  to 
logical  bit  operations. 

The  program  has  several  parts.  The  first  takes  a  set  of 
feature  value  pairs  as  input  and  produces  a  bit  vector 
representation  of  them  as  output.  A  lexicon  and  grammar  are 
then  translated  into  a  corresponding  bit  vector 
representation.  Finally,  the  categories  which  are  used  in 
the  grammar  and  lexicon  are  organized  into  a  discrimination 
net  and  lattice  to  facilitate  retrieval  and  matching  during 
parsing. 


2.  RULE  EXPANSION 

Our  decision  to  expand  rule  schemas  during  parsing  is 
motivated  by  the  desire  to  avoid  the  combinatorial 
explosion  which  would  result  if  all  of  the  rules  of  the 
grammar  were  fully  expanded  before  parsing.  In  GPSG, 
syntactic  categories  are  represented  as  sets  of  features; 
within  rule  schemas,  the  values  of  some  or  all  of  these 
features   may    be    left   unspecified.   The  set  of  possible 
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values  for  these  unspecified  features   is   constrained 
general   feature  instantiation  principles. 


by 


To  illustrate  the  combinatorial  explosion  which 
may  result  from  instantiating  rule  schemas,  consider  a  rule 
schema  with  four  syntactic  categories,  each  of  which  has 
three  unspecified  features.  Assume  that  each  feature  can 
take  two  possible  values.  The  number  of  possible  rule 
instantiations  for  this  schema  would  be  2i2.  (However, 
some  of  these  would  be  blocked  by  feature  instantiation 
principles.)  For  this  reason  we  have  avoided  expanding  the 
grammar  prior  to  parsing. 


3.  FEATURE  AND  CATEGORY  REPRESENTATION 

GPSG  features  are  of  two  types:  atomic-valued 
features  and  category-valued  features.  Atomic-valued 
features  are  those  whose  values  are  not  decomposable, 
while  category-valued  features  have  categories,  and  thus 
sets  of  features,  as  their  values.  Both  kinds  of  features 
are  represented  by  bit  vectors. 

Let  us  consider  a  simple  set  of  features: 


( 1 )  feature 
V 
H 

BAR 

SUBCAT 

PAST 

AGR 


values 


0,1,2 
1,2,3 


( category-value ) 


The  feature  representation  program  first  counts  up  the 
number  of  distinctive  feature-value  pairs  and  the  number 
of  category-valued  features,  to  determine  the  size  of  the 
vector.  For  this  example,  a  vector  of  thirteen  ones  is 
created,  and  each  feature  value  pair  is  assigned  a  bit  value 
in  the  vector,  as  indicated  below: 


(2) 


»*1111111111111 


(V  +) 

(V  -) 

(N  +) 

(N  -) 

(BAR  0) 

(BAR  1) 

(BAR  2) 

(SUBCAT  1). 
(SUBCAT  2). 
(SUBCAT  3). 

(PAST  +) 

(PAST  -) 

(AGR    ) 
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The  order  of  bit  assignment  is  arbitrary,  but  is  fixed,  and 
the  size  of  the  vector  is  fixed  given  the  specific  grammar. 

Next,  the  category  representation  program  converts 
syntactic  categories  in  the  grammar  to  feature-value 
representations  and  then  to  vectors.  VP,  for  example,  is 
converted  to  [(V  +)(N  -)(BAR  2)],  and  the  inapplicable  bits 
are  turned  off  in  the  corresponding  vector: 

(3)  <»*1001001111111 

(V  -) I 

(N  +) 


(BAR  0) 
(BAR  1). 


In  this  example,  the  values  of  the  bits  for  SUBCAT,  PAST  and 
AGR  features  are  one,  indicating  they  are  unspecified. 

When  a  category  has  a  category-valued  feature 
specified,  the  bit  corresponding  to  the  feature  is 
turned  off  and  its  category-value  is  represented  by  the 
second  vector: 

(4)     VP(AGR  NP)   1**1001001111110 

AGR 

NP  «*0110001111111 

Finally,  for  each  syntactic  category,  a  unique  symbol 
of  the  form  CATn  (n  is  an  integer)  is  internally  created. 
The  first  vector  is  associated  with  the  symbol  as  a  property 
called  BIT,  and  the  second  vector,  if  any,  as  a  property 
AGR,  whose  existence  is  indicated  by  the  zero  value  for  the 
AGR  bit  in  the  BIT  vector.  If  more  than  one 
category-valued  feature  is  specified,  vectors 
corresponding  to  each  category-value  are  associated  in  the 
same  manner. 


3.  FEATURE  OPERATIONS 

Gazdar  (1985)  defines  two  feature  operations  which  are 
important  in  parsing:  extension  and  unification.  An 
extension  is  defined  as  follows:  category  A  is  an  extension 
of  category  B  if  and  only  if  (1)  all  of  the  feature-value 
pairs  of  category  B  are  present  in  category  A,  and  perhaps 
more,  and  (2)  all  of  the  category-valued  features  of  B  have 
the  same  values,  or  extensions  of  those  values  in  A.  VP, 
VP(PAST  +)  and  VP(AGR  NP)  are  all  extensions  of  VP. 

Extensions  can  be  defined  in  terms  of  bit  vectors: 
Category  A  is  an  extension  of  category  B  iff 

(A's  BIT  value)  v  (B's  BIT  value)  =  (As  BIT  value). 
This   is  implemented  by   a   primitive   function   log-ior 
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(inclusive  or)  which  takes  two  vectors  as  arguments.  If 
either  category  contains  a  category-valued  feature,  the 
extension  operation  is  repeated  for  the  vectors  associated 
with  the  category-valued  feature. 

The  second  feature  operation  is  unification.  The 
unification  of  two  or  more  categories  is  the  smallest 
category  which  is  an  extension  of  each,  if  one  exists.  For 
example,  the  unification  of  [(N  +)  (V  -)  (BAR  0)]  and 
[(N  +)  (SUBCAT  1)]  is  [(N  +)  (V  -)  (BAR  0)  (SUBCAT  1)].  The 
unification  of  [(N  +)  (V  +)  (BAR  0)]  and  [(N  +)  (V  -)]  is 
undefined  because  of  the  conflicting  values  for  V.  In 
our  bit  vector  representation,  unification  involves 
comparing  the  values  of  categories  for  each  atomic  feature, 
using  log-and.  If  the  result  is  zero,  the  operation  fails. 
Otherwise,  the  result  becomes  the  new  value  for  that 
feature,  and  the  operation  is  repeated  for  the  next 
feature  in  the  vector.  Finally,  the  unification  of  category 
valued  features  involves  unifying  each  atomic  feature 
within  the  category  vectors. 


4.  THE  PARSER 

Our  implementation  uses  a  chart  parser,  Earley  (1970), 
a  bottom-  up,  non-deterministic  device.  The  input  directs 
which  rules  are  tried.  All  completed  nodes  are  recorded  in 
the  chart,   and   all  intermediate  results  are  saved. 

After  the  rule  schemas  of  the  grammar  are 
represented  in  bit  vector  format,  they  are  organized 
into  a  network  which  facilitates  bottom-up  parsing.  The 
first  constituent  of  each  rule  schema  is  given  a  PREDICTS 
property.  The  value  of  that  property  is  a  list  of  the 
remaining  sisters  in  the  rule  schema,  followed  by  the 
mother.  If  a  category  is  the  first  constituent  in 
more  than  one  rule,  then  the  PREDICTS  property  for  that 
category  contains  a  list  of  such  elements.  The  parser  adds 
new  predictions  to  the  chart,  and  attempts  to  match  the  new 
predictions  to  the  previous  ones,  until  all  of  the 
input   is  processed. 


5.  LATTICE  OF  EXTENSIONS 

Because  we  are  dealing  with  rule  schemas  in 
parsing,  in  determining  which  categories'  predictions  to 
consider,  note  that  any  categories  to  which  the  input 
category  is  an  extension  are  candidates.  To  provide  an 
efficient  mechanism  to  retrieve  all  underspecif led 
categories  given  an  extension,  we  organize  all 
categories  into  a  lattice.  Following  the  directed  arcs 
from  an  input  category  through  all  possible  paths  retrieves 
all  and  only  the  categories  to  which  an   input  category   is 
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an  extension, 


6.  FEATURE  INSTANTIATION  PRINCIPLES 

GPSG  proposes  feature  instantiation  principles, 
restrictions,  and  default  values  which  we  discuss 
below.  The  feature  instantiation  principles  are  invoked 
when  a  rule  is  about  to  be  added   to  a  parse  tree. 


7.  THE  HEAD  FEATURE  CONVENTION 

When  the  head  of  a  rule  schema  is  matched  with  an 
extension,  the  head  features  of  the  extension  must  be 
"passed  up"  to  the  mother.  If,  for  example,  the  head  of  a  VP 
expansion  rule  matches  with  VO(PAST  +),  the  mother,  VP, 
must  be  marked  as  (PAST  +)  as  well,  since  PAST  is  a  head 
feature.  Mapping  the  head  features  onto  the  mother 
involves  several  additional  pieces  of  information.  The 
first  is  a  bit  vector,  initially  of  ones,  which  indicates 
which  features  are  head  features.  This  is  created  when 
the  feature  representation  is  created,  by  turning  off  those 
features  marked  in  the  input  as  head  features.  This  vector 
is  associated  with  a  global  variable  *head-feature*  as  a  BIT 
property . 

A  second  vector  associated  with  each  category 
(called  BIT2),  screens  out  the  inherited  features  of  the 
mother,  since  they  are  not  subject  to  the  HFC.  In  the  BIT2 
vector,  all  bits  that  correspond  to  inherited  features 
are  turned  on.  The  BIT2  vector  for  VP  is  represented  as: 


(5) 


«*11  11  111  000     00     0 
V   N   BAR  SUBCAT  PAST   AGR 


Now  assume  that  the  head  of  the  rule  schema 
(VP  ->  VO  NP)  matches  with  V0(PAST  +).  The  program  alters 
the  mother  VP  to  VP(PAST  +)  by  the  following  sequence  of 
logic  operations! : 

(6)   BIT  of  head  (VO  (PAST  +))  »*1001100000101  1 

BIT  of  mother  VP  0*1001001000111  2 

BIT2  of  mother  VP  <»*1111111000000  3 

BIT  of  *head-feature*  «*0000000111000  4 


3  V  4 

5  V  1 

6  "  2  =  VP(PAST  +) 


«*1111111111000  5 
1»*1111111111101  6 
#♦1001001000101   7 


When  the  head  feature  modifies  an  existing  vector,  a 
new  category  symbol  is  created  if  necessary.  All  of  the 
categories  of  the  grammar  are  stored  in  a  binary 
discrimination  tree  for  efficient  retrieval. 


120 


8.  FOOT  FEATURE  PRINCIPLE 

The  Foot  Feature  Principle  (FFP)  governs  the  agreement 
properties  of  unbounded  dependencies.  All  foot  features  are 
category-valued.  The  FFP  constrains  the  instantiation  of 
these  features  in  the  following  manner:  the  unification 
of  the  foot  features  instantiated  on  the  daughters  must 
also  be  instantiated  on  the  mother. 

The  implementation  of  the  foot  feature  principle  is 
straight-  forward.  Inherited  feature  specifications  and 
non-foot  features  are  screened  out  of  the  vector  of  each 
daughter  in  a  similar  manner  to  the  HFC.  The  unification 
of  the  resultant  vectors  of  all  daughters  is  then  obtained; 
the  unification  and  the  BIT  vector  of  the  mother  is  and-ed 
to  turn  off  bits  for  instantiated  foot  features  on  the 
mother;  and  the  category  under  the  relevant  property  of  the 
daughter  is  copied  onto  the  same  property  on  the  mother . 


9.  FEATURE  COOCCURRENCE  RESTRICTIONS 

Feature  cooccurrence  restrictions  (FCRs)  provide 
further  constraints  on  possible  categories  by  indicating 
two  or  more  feature  values  which  cannot  coexist  in  a 
category.  FCRs  typically  take  the  form  of  logical 
implications;  for  example  (PAST  +)  -->  (V  +)  states  that  any 
syntactic  category  which  has  the  value  +  for  PAST  must  also 
have  a  +  value  for  V. 

In  our  implementation,  FCRs  are  formulated  as  illegal 
bit  patterns  in  BIT  vectors.  First,  note  that  the  above  FCR 
is  equivalent  to  "[(PAST  +)  "  ~(V+)3.  (V+)  is  represented 
as  1**1011111111111  (assuming  our  original  feature  set)  and 
~(V  +)  is  represented  as  *»*0011111111111  v  *#01111111111H 
V  1**1111111111111.  Since  (PAST  +)  is  represented  by 
1»*  111111111 10 11,  the  above  FCR  is  then  represented  by 
the  following  three  bit  patterns:  ~lt*0011111111101 , 
"11*0111111111101  and  "11*1111111111101.  Syntactic 
categories  which  exhibit  these  patterns  (checked  with 
log-and)  are  blocked  as  ungrammatical . 


10.  FEATURE  SPECIFICATION  DEFAULTS 

Finally,  feature  specification  defaults  (FSDs)  are 
applied  before  the  parse  tree  is  accepted.  These  are 
default  values  for  specific  features  which  are  valid  if  and 
only  if  the  values  of  the  features  involved  have  not 
been  specified  after  all  instantiation  principles  have  been 
applied.  Invoking  defaults  is  carried  out  with  log-and  as 
well. 
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11.  JAPANESE  PARSER  AND  MORPHOLOGICAL  COMPONENT 

Given  an  appropriate  grammar  and   lexicon,  our  parser 

processes   English  and  Japanese   sentences   in  a  similar 

manner;  however,  the  morphological  component  has  specif icly 

been   developed   for   Japanese.   The   concept  of  word  has 
little  reality  in  agglutinative  languages  like  Japanese,  and 

an   input   sentence   in  Japanese  is  a  string   of  morphemes. 

The  task  of  the  morphological  component  is  to  segment  an 

input   string    into  morphemes  and  to   label  them  with 
categories  prior  to  parsing. 

Entries  of  the  Japanese  lexicon  consist  of  morphemes 
with  their  categories  and  conjugation  classes.  Since 
morphological  alterations  of  morphemes  are  complex  but 
regular  with  few  exceptions,  only  one  form  of  each  morpheme 
is  listed  in  the  lexicon,  and  it  is  altered  during  the 
segmentation  of  an  input  string. 

The  morphological  component  is  provided  with 
context-free  rules  that  represent  morpho-phonological 
constraints  on  sequences  of  morphemes:  ie.  the  rules  admit 
a  particular  allomorph  depending  upon  the  category  and 
conjugation  class  of  the  morpheme,  and  also  upon  those  of 
the  immediately  following  morpheme.  Although  the  basic 
mechanism  is  that  of  a  context-free  parser,  a 
theoretical  distinction  should  be  drawn  between  a  syntactic 
parser  and  a  morphological  parser:  while  the  former  assigns 
constituent  structures  to  an  input  sentence,  the  latter 
checks  the  morpho-phonological  alterations  of  morphemes  with 
respect  to  adjacent  morphemes.  After  morphemes  are 
identified  and  labeled  with  syntactic  categories,  the 
morphological  information  irrelevant  to  syntax  is  dropped 
and  the  syntactic  parser  is  not  concerned  with  specific 
forms  of  morphemes . 

A  linguistic  question  is  the  deterr-ination  of 
syntactic  units:  strictly  speaking,  morphemes  are  the 
unit  of  morphology  and  not  necessarily  of  syntax.  The 
decision  employed  in  the  implementation  is  to  list  the 
morphemes  in  the  lexicon  which  play  syntactic  roles  in 
determination  of  syntactic  structures  of  sentences:  eg. 
morphemes  that  determine  subcategorization  frames  ("want 
to"  -tai-)  and  morphemes  that  iteratively  appear  in  a 
sentence  (causative  -sase-,  passive  -rare-).  The  other 
morphemes  are  handled  in  terms  of  features  in  categories  of 
those  listed  morphemes,  as  was  traditionally  the  case  with 
the  English  plural  noun  morpheme  -s. 
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12.  EXAMPLE  PARSES 

The  example  grammar  for  Japanese  passive  and  causative 
constructions  contains  17  schematic  phrase  structure  rules, 
and  each  syntactic  category  consists  of  9  features  which 
take  the  total  of  24  possible  values.  The  complete 
instantiation  of  the  rule  schemas  would  expand  the  grammar 
of  17  rule  schemas  to  at  least  3ii  instantiated  rules,  which 
would  require  a  considerable  amount  of  search  space  and  time 
for  parsing.  The  parser  instantiates  unspecified  features 
in  the  rule  schemas  only  as  necessity  arises,  according  to 
the  principles  of  GPSG  and  information  associated  with  the 
input  string  in  the  lexicon. 

The  following  input  sentence  is  two-way  ambiguous,  and 
two  parse  trees  are  produced  by  the  parser.  The  rule 
schemas  are  instantiated  to  no  more  than  30  rules  in  the 
course  of  parsing,  which  takes  approximately  1.1  seconds. 

(7)  Tarou  ga   Tsuneko  ni    sakana  o   tabe  sase  rareta 
Tarou  SDBJ  Tsuneko  to/by  fish   OBJ  eat  CAUS  PASS 
'Tarou  was  made  to  eat  fish  by  Tsuneko.' 
'Tarou  was  made  to  feed  fish  to  Tsuneko.' 


TAROU  GA  TSUNtKO  MI  SAKANA  O  TABE  iAit  RARBTA 


«it«  of  iniwen 
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TAROU      GA      T3UNEX0 


13.  CONCLUSION 

Others  working  within  the  framework  of  GPSG  have 
adopted  a  similar  approach,  among  them  Kay  (1979),  who 
proposes  expanding  metarules  on  the  fly  while  parsing,  and 
Barton  (1985),  who  examines  the  implications  of  parsing 
with  the  separation  of  the  immediate  dominance/linear 
precedence  (ID/LP)  relation.  Our  parser  takes  a  grammar  in 
the  form  of  context-free  rules:  metarules  and  the  ID/LP 
separation  have  not  been  incorporated,  nor  has  the  Control 
Agreement  Principle.  As  described  in  this  paper,  our  main 
emphasis  has  been  on  representing  features  and  rule  schemas, 
and  on  feature  instantiation  during  parsing. 


NOTE 
1  In  this  example,  SUBCAT  is  not  a  head  feature. 
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GEMINATES  AND  INALTERABILITY  IN  MALAY* 


Boon  Seong  Teoh 


In  Malay,  an  Austronesian  language  spoken  in 
Malaysia,  there  is  a  very  general  rule  that 
glottalizes  /k/  to  [?]  by  removal  of  the 
point  of  articulation  features  in  syllable 
codas,  e.g.  /masak/  'to  cook'  [masa?].  Malay 
also  has  a  pair  of  vowel  initial  suffixes 
which  geminate  a  preceding  consonant  and 
begin  with  a  homorganic  glide  after  high 
vowels.  A  natural  analysis  posits  an  empty 
X-slot  for  these  suffixes  which  links  to  the 
preceding  segment.  With  stems  ending  in  a 
velar  we  incorrectly  predict  blockage  of 
glottalization  by  geminate  inalterability. 
The  purpose  of  this  paper  is  to  attempt  a 
solution  to  the  above  problem. 


1 . 0  Introduction 

Recent  work  in  phonological  theory  has  centered  largely  on 
the  nature  of  phonological  representations.  Issues  of 
phonological  representation  presented  a  very  fertile  area  of 
research  in  the  past  decade.  This  work  has  suggested  that  the 
essentially  linear  representations  of  standard  generative 
phonology  need  to  be  expanded  in  various  non- linear  ways  to 
include  the  organization  of  segments  into  syllable  'trees'  and 
metrical  structures  as  well  as  autosegmental  tiers.  The 
development  of  richer  notions  of  phonetic  structure  implies  a 
richer  notion  of  phonological  structure  as  well.  But  as 
Kenstowicz  (1985)  aptly  observes,  such  development  'counts  as 
real  progress  only  if  we  have  not  simply  shifted  complexity  and 
indeterminacy  from  the  rule  system  to  the  representational 
system.  Thus  crucial  to  the  entire  enterprise  has  been  the 
search  for  principles  constraining  the  structure  of  phonological 
representations  and  the  ways  in  which  rules  may  refer  to  and 
manipulate  such  representations'. 

In  this  paper  we  examine  the  general  rules  of  glottal 
formation  and  gemination  and  see  how  phonological  representation 
can  restrict  rule  application  and  account  for  the  phonetic  facts 
in  Malay. 


2 . 0   Data 
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Malay  has  19  consonantal  phonemes  which  are  as  follows; 
(1) 


stop 

affricate 

continuant 

liquid 

nasal 

glide 


P  b 


t  d 

c  j 
s 

1  r 

n  n 

h  y 


k  g 


and  a  vowel  inventory  of  six  underlying  vowels  /a.i.u.e.o,  and  e/ 
(e  =  schwa).   Malay  has  three  suffixes: 


(2) 

/-an/ 
/-i/ 

/-kan/ 


nominal  suffix 

verbal    suffix    (semantic   role   of  goal, 

benef active) . 

verbal  suffix   ( transitivization  and  semantic 

role  of  accusative,  benef active) . 


We  find  the  following  alternations  in  the  language: 


(3) 

/masak/ 
/masak  +  kan/ 
/masak  +  an/ 


/masak  +  i/ 


/galak/ 
/galak  +  kan/ 
/galak  +  an/ 


/masuk/ 

/ke  +  masuk  +  an/ 


/masuk  +  kan/ 


to  cook' 
cook'  (imp. ) 
cooking,  dish' 


cook  for' 


to  encourage' 
encourage! ' 
encouragement ' 


to  enter' 
entrance' 


enter! '  (imp. ) 


[masa? ) 

[masa?kan] 

[masa?kan] 

*lmasa?an] 

*[masakan] 
[masa?ki ] 

*[masa?i] 

*[masaki] 

[gala?] 
[ gala?kan] 
[gala?kan] 
*[gala?an] 
*[galakanj 

[maso? ] 

[kemaso?kan] 
*[kemasokan] 
*[kemaso?anl 

(maso?kanl 
*[masokan] 
*[maso?an] 
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/gerak/ 
/gerak  +  kan/ 
/gerak  +  an/ 


to  move' 
move! '  (imp. ) 
movement ' 


[gera?] 
Igera?kan] 
(gera?kan] 
♦igerakan] 
*[gera?anl 


/hendak/  'to  want'  [henda?] 

/di  +  ke  +  hendak  +  i/    'needed'  (pass) [dikehenda?ki ] 

♦Idikehendaki) 
*[dikehenda?i] 


/ber  +  ke  +  hendak  +  an/ 


'need  for  something' 

[ bekehenda?kan ] 
* [ bekehendakan ] 
*(bekehenda?anl 


/rosak/ 
/meN  +  rosak 


kan/ 


/ke  +  rosak  +  an/ 


/di  +  rosak  +  i/ 


'spoilt' 

'to  spoil'  (act, 


spoilt 


' spoilt' (pass. ) 


[ rosa?) 

[merosa?kan] 
*Imerosakan] 
*[merosa?an] 

[kerosa?kan] 
*[kerosakan] 
*[kerosa?an] 

[dirosa?ki ] 
*[dirosaki ] 
*[dirosa?i] 


In  the  above  data  we  note  that  the  velar  stop  in  syllable 
coda  position  glottalizes  to  [?].  In  loanwords  too  this 
glottalizing  effect  seems  to  be  at  work. 

(4) 


[bE?  or  bEk"*  ] 

•beg' 

[ra?bi] 

' rugby ' 

[ba?ti] 

'service 

[ta?de] 

'fate' 

Note:  (E  =  e  ) 


deed 


The  following  are  further  examples  of  consonant  stem  final 
words  when  suffixed  with  /-an/,  /-i/  and  /-kan/. 


(5) 

slow  speech 

fast  speech 

(a)   /letup/ 

/letup  +  an/ 
/letup  +  i/ 
/letup  +  kan/ 

'explode'       [ letop^  ] 
explosion'     (letop^pan) 
to  explode'    [letop^pi] 
explode'  (imp) ( letop^ kan] 

[  letop^  ] 
[letopan] 
I letopi ] 
lletop^kan] 

(b)   /ikat/ 

/ikat  +  an/ 
/ikat  +  i/ 
/ikat  +  kan/ 

tie'        (?ikat  or  ikat] 
ties'         [ikat' tan] 
'to  tie'       [ikat'  ti] 
'tie'          (ikat^kan) 

(?ikat  -  ikat] 
[ikatan] 
[ikati] 
[ikat'  kan] 
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(c)   /kenal  +  an/    'acquaintance'  [kenallan] 
/di  +  kenal  +i/  known'       [dikenalli] 


(d) 


/rumus  +  an/ 
/asas  +  i/ 


formula' 
fundamental 


[ rumossan] 
[ asassi] 


(e)   /tanam  +  an/    'plants'       [tanamman] 
/di  +  tanam  +  i/  'planted  at'  [ditanammi] 


[ kenalan] 
[ dikenali ] 

( rumosan] 
[ asasi ] 

[ tanaman] 
(ditanami ] 


(f) 

(g) 

(h) 


/per  +  tandirj  +  an/  'competition' 

[petandei3i}anl  [petanderjan] 
/meN  +  tandirj  +  i/   'to  compete'  [  menander}i}i  ]  [mSnandegi] 

/per  +  jalan  +  an/   'journey'  [pejalannan]    [pejalanan] 
/meN  +  jalan  +  i/    'to  undergo'   [menjalanni]  [mejalani] 


/putar  +  an/ 
/meN  +  putar 


'revolutions'   (putarran] 
i/  'to  revolve'  [memutarri] 


[putaran] 
[ memutari ] 


(i)   /pilih  +  an/    'choice'       [pilehhan]      [pilehan] 

/meN  +  pilih  +  i/   'to  choose  for'  [mSmilehhi]  [mSmilehi] 

In  the  above  data,  we  note  that  when  consonant- final  stems 
are  suffixed  with  vowel-initial  suffixes  the  final  consonant  of 
the  stem  is  geminated.  In  slow  speech  this  is  very  obvious.  In 
the  so-called  fast  speech.  however,  there  seems  to  be 
degemination. 

Further  data  reveal  that  when  vowel  final  stems  are  suffixed 
with  /-an/  and  /-i/  we  have  the  following: 

(6) 


/ke  +jadi  +  an/ 
/pandu  +  an/ 
/temu  +  i/ 
/suka  +  i/ 
/ke  +  suka  +  an/ 


'happening' 
'guide' 
'to  meet' 
'to  like' 
'happiness' 


[kejadiyan] 
[panduwan] 
[ temuwi ] 
( suka?i] 
[ke suka? an] 


The  data  above  show  that  vowel- final  stems  surface 
differently  when  suffixed  with  /-an/  and  /-i/,  i.e.  a  homorganic 
glide  is  found  after  high  vowels  and  a  glottal  stop  after  the 
non-high  vowel. 

Data  from  two  other  dialects  in  Malaysia,  namely  the 
Kelantan  and  the  Terengganu  dialects,  reveal  the  following: 


(7) 

/masak/  'to  cook' 

/masak  +  an/  'dish' 

/masak  +  kan/  'cook' 


Standard  Malay  Kelantan  Terengganu 

[masa?]  [masa?]^       [masa?] 

[masa?kan]  [masa?kE]  [masa?kai}] 

[masa?kan]  [masa?kE]  [masa?kar}] 
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/ikat/         'to  tie'       [ikat  ]     [ika?]         [ika?] 
/ikat  +  an/    'ties'         [ikatan]    [ika?tE]    [ika?tai}] 
/ikat  +  kan/    'tie!'         [ikat'kan]  [ika?kE]    [ika?kai}] 

/letup/         explode'       [  letop""  )    [  letu?  ]  _,    ( letu?  ] 
/letup  +  an/  explosion'       (letopan)   [letu?pE]   [letuPpag] 
/letup  +  kan/  explode!'      [  letop' kan]  [  letu?kE]   [letu?kar)] 

The  data  above  illustrate  that  glottalization  not  only 
applies  to  velar  stops  in  the  Kelantan  and  Terengganu  dialects, 
but  all  voiceless  stem-final  obstruents  are  affected  as  well. 
From  the  data  above  we  can  establish  the  stem  final  /k/  to  be  the 
underlying  form  of  words  in  /masak/,  /masuk/,  /hendak/  etc.  In 
these  dialects  the  stem  final  /k/  resurfaces  after  the  glottal 
stop.  If  we  take  the  position  that  the  stem  final  is  an 
underlying  /?/,  the  learner  would  then  have  to  memorize  which 
allomorphs  of  the  suffix  /-pan/,  /-tan/  and  /-kan/  each  stem 
takes.   Such  an  analysis  lacks  descriptive  adequacy. 

3.0  Analysis  and  discusaion 

Given  the  phonological  representation  that  is  available  in 
Sagey  (1986),  we  might  represent  a  voiceless  velar  stop  as 
follows. 

(8)   Representation  of  a  voiceless  velar  stop 

X 


root 


Laryngeal  tier 
I -voice] 


supralaryngeal 
tier 


place 


dorsal 
(+back] 

The  above  is  a  representation  of  the  voiceless  velar  stop 
[k]  in  Malay.  In  this  representation,  the  supralaryngeal  tier 
dominates  the  place  node,  which  in  turn  dominates  the  terminal 
node  Dorsal,  which  is  specified  as  [+back].  The  laryngeal  tier, 
where  voicing  is  specified,  has  the  feature  [-voice].  In  this 
representation,  the  manner  of  articulation  is  independent  of  the 
supralaryngeal  tier  (cf.  Clements  1985). 
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We  might  then  formulate  the  glottal  formation  rule  (GFR)  for 
Malay  as  one  which  delinks  the  supralaryngeal  node  of  a  /k/ 
segment  in  syllable  coda  position.  Thus  a  root  with  {+cons., 
-cont. j  manner  of  articulation  and  with  [-voice]  specified  at  the 
laryngeal  tier  without  a  supralaryngeal  tier  will  be  realized  as 
a  glottal  stop  [?].  The  GFR  is  thus  seen  as  the  result  of  the 
delinking  of  the  supralaryngeal  node  of  the  velar  stop  (and  for 
all  voiceless  stops  for  the  Kelantan  and  Terengganu  dialects) 
from  the  root-node  at  syllable  coda  position  leaving  only  the 
laryngeal  node  linked  to  the  root  node. 

(9)   Glottal  Formation  Rule  (as  delinking) 


R 


X/ 


With  the  above  rule  we  can  then  account  for  stems  in  Malay  which 
end  in  a  voiceless  velar  stop  and  in  general  for  voiceless  velar 
stops  in  syllable  coda  position. 

We  shall  also  at  this  juncture  crucially  assume  that  vowel- 
initial   suffixes  in  Malay,   i.e.   /-an/  and  /-i/  possess  an 
underlying  representation  with  an  extra  empty/featureless 
X-slot  as  shown  below: 


(10)   Underlying  representation  of  suffixes  in  Malay 
(a)       an        (b)       i         (c)       kan 


XXX 


XXX 


XX 


A  natural  analysis  that  ensues  from  these  underlying 
representations  is  to  say  that  the  final  segment  of  the  stem  then 
links  to  the  empty  X-slot  of  the  suffix.  With  this  we  would  then 
account  for  all  the  geminated  forms  of  stems  ending  in  a 
consonant  other  than  a  voiceless  velar  stop  when  suffixed  with 
/-Xan/  or  /-Xi/. 
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(11)   kenal 


kenal 


XXX 


XXX 


The  homorganic  glide  which  appears  in  stems  ending  with  a 
high  vowel  can  be  explained  by  the  same  feature  spreading  rule. 
In  this  case,  however,  only  the  features  of  [ +high  oback) 
features  of  the  supralaryngeal  node  of  the  final  vowel  of  stem 
spread  on  to  the  available  empty  X-slot  of  vowel  initial 
suffixes.  Since  the  melody  which  is  realized  in  the  onset  of  the 
syllable  it  appears  as  a  glide. 

(12)   Spreading  of  [-t-high  aback]  features 

Ike[jadi]„  +   anJi   "happening"   [[bantu  ]„  +   an],   "aid" 


XXX 
11/ 


OR 

\! 


xxxxx 

1111 

OR  OR 

\!   \! 


Feature  spreading 

+high 
aback 

l\ 

(ke[jadi   yan] 


XX  XXXX  XXX 

mill  IK 

OR  OROR  OR 

\l   \!\!  \! 

a      a   a  a 


To  account  for  the  manifestation  of  the  glottal  stop  in 
words  like  [kesuka?an]  and  [suka?i),  we  note  that  the  stem  ends 
in  a  [-high]  vowel  /a/.  Evidently  the  spreading  of  features 
unidirectionally  spreads  only  high  features  from  left  to  right. 
Since  nothing  spreads  from  stems  ending  with  low  vowels,  we  would 
invoke  a  default  rule  to  account  for  the  glottal  stop  in  the 
empty  X-slot  of  vowel  initial  suffixes  which  essentially  says 
that  no  X-slot  shall  remain  unspecified. 


(13) 
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Unspecified  empty  X-slot  shall  by  default  be  realized  as  [ ? ] 
(14)   Glottal  formation  by  default 
[ke  [suka]„    an)i       >      [kesuka?an] 


XX  XXXX 


OR   OROR 

\!  \!\! 

a        a    a 


XXX 
I!/ 


OR 

\! 


XXXXXXXXX 

!/ 


OROROROR 

\l\!\!\! 
a    a   a    a 


Given  the  fact  that  gemination  is  seen  as  spread  how  would 
one  fit  this  analysis  into  Hayes  (1986a,  1986b),  Schein  & 
Steriade  (1986)  with  regards  to  their  observation  about  the 
interaction  of  phonological  rules  with  geminates?  In  Steriade  & 
Schein  (1984)  proposals  about  how  geminates  can  be  represented 
within  a  non- linear  framework  were  made.  Distinctions  were  made 
between  a  monosegmental  geminate,  i.e.  a  segment  with  multiple 
associations  to  the  skeleton,  and  a  bisegmental  heteromorphic 
geminate  or  adjacent  identical  cluster.  Such  a  distinction  is 
deemed  necessary  as  increasing  evidence  (Kenstowicz  &  Pyle 
(1973),  Steriade  (1982),  Hayes  (1986a,  1986b)  shows  the  failure 
of  segments  forming  halves  of  geminates  to  undergo  rules  they 
would  a  priori  be  expected  to  undergo.  The  standard  example  from 
Tigrinya  by  Schein  (1981)  and  Kenstowicz  (1982)  shows  how  when  a 
velar  stop  forms  the  first  half  of  a  monosegmental  geminate,  (or 
true  geminates  k   la  Hayes)  postvocalic  spirantization,  (i.e. 

/k/  -->  [x]  /  V )  fails  to  apply;  heteromorphemic  geminates  (or 

false  geminates)   do,   however,   permit  spirantization.   Thus  we 
have  the  following  data  from  Tigrinya: 


(15) 

a.  /?akaleb/  -- 

b.  /fakkar  -  a/ 


[?axaleb] 
>   [ fakkara] 


"dogs" 

(cf.  [kalbi]  "dog") 

"he  boasted" 

* [ faxxara ] 

*[ faxkara] 


In  the  above  data,  (15a),  the  obstruent  /k/  spirantizes  to  [x], 
but  in  (b)  the  geminate  /kV  fails  to  undergo  the  postvocalic 
spirantization  rule.  The  geminates  are  neither  wholly  nor 
partially  spirantized,  thus  respecting  the  integrity  of 
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geminates.     This  phenomenon  has   also  been  called  "geminate 
inalterability"  by  Hayes  (1986a). 

It  would  seem  that  the  application  of  GFR  violates  the 
integrity  of  geminates  and  should  be  blocked  by  the 
inalterability  condition  in  the  cases  of  stems  with  final  velar 
suffixed  to  /-Xan/  and  /-Xi/:  /masak  +  Xan/  and  /masak  +  Xi/  are 
realized  as  [masa?kan]  and  [masa?ki].  The  inalterability 
condition  does  not  block  application  of  GFR  to  the  word  /masak  + 
kan/  "cook!"  (imp.)  found  in  (7)  above  because  underlyingly  both 
identical  segments  are  a  result  of  morpheme  concatenation  and 
thus  no  multiply- linked  representation  appears. 

The  palatalization  in  English  su[ks]ede,  a[ks]ede  poses  a 
similar  challenge  to  recent  autosegmental  theory.  The  most 
natural  analysis  completely  assimilates  the  prefixal  consonant, 
creating  a  multiply-linked  geminate,  as  in  (16). 

(16) 


sub+kede 

sub+kede 

1  1 

-> 

/! 

X  X 

XX 

But  then  by  inalterability  (Hayes  1986a)  we  expect  either 
that  palatalization  will  block  on  the  geminate  or  that  it  will 
apply  across-the-board  to  yield  a  long  [s].  The  palatalization 
of  just  the  second  half  of  the  geminate  is  thus  problematic. 
Steriade  &  Schein  (1986)  point  out  that  Italian  palatalizes  the 
entire  geminate  ( su( ccjedere)  and  appeal  to  the  fact  that  Italian 
preserves  consonant  gemination  while  English  does  not  to  explain 
the  palatalization  facts.  Since  the  English  palatalization  rule 
is  a  rather  marginal  phenomenon,  however,  one  might  legitimately 
wonder  whether  the  su[ks]ede  facts  are  a  serious  problem  for 
autosegmental  theory. 

In  this  paper  we  can  draw  an  analogous  contrast  between  two 
languages  of  the  Pacific,  i.e.  Malay  and  Toba  Batak  where  there 
can  be  no  doubt  as  to  the  productivity  of  the  rules  and  hence  the 
legitimacy  of  the  problem.  We  have  seen  that  in  Malay  the  GFR 
glottalizes  /k/  to  [ ? ]  by  removal  of  the  point  of  articulation 
features  in  syllable  codas.  Recall  from  (7)  that  in  the  Kelantan 
and  the  Terengganu  dialects  the  GFR  has  been  extended  to  the 
remaining  stops  /t,p/  and  we  find  [?t]  and  [?p]  in  the  gemination 
context 

Toba  Batak  (Hayes  1986b)  has  at  least  two  separate  rules 
that  create  geminates  by  spread  (17). 

(17)      a.      n  +  Ci       ->      CiCi 

b.       [-sonJi    +  h  ->    ( -sonJi [ -sonli 

marisap  hita  "let  us  smoke" 
IP  Pl 
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In  both  cases  the  geminates  escape  the  Batak  rule  glottalizing 
preconsonantal  stops  (18). 

(18)   [-cont]  C  ->  ?C 


lap  pingOl 

"wipe-off   ear' 

[?   p] 

1 

f]ote:     (0 

=  0) 

Hayes  cites  these  Batak  data  in  support  of  the  inalterability 
phenomenon  as  a  diagnostic  for  multiple  linking  and  hence  that 
assimilation  is  autosegmental  spreading,  as  originally  proposed 
by  Goldsmith  (1981). 

Thus,  Malay  and  Batak  contrast  as  English  and  Italian  do 
with  respect  to  inalterability.  Crucially,  Malay  and  Batak  also 
contrast  with  respect  to  gemination.  Malay  has  no  morpheme 
internal  geminates  while  Batak  does  (19). 

(19)  lOppa   "cook",   pittu  "door",   nakkin   "just  now" 

We  might  explain  these  facts  by  supposing  that  all  grammars 
respect  the  OCP  morpheme-internally  and  thus  bar  successive 
occurrences  of  the  same  phoneme  but  that  English  and  Malay  have 
an  additional  constraint  barring  geminates  (20). 

(20)  *  C 

|\ 
X  X 

When  spreading  rules  create  representations  that  violate 
this  constraint  (20)  they  are  repaired  by  separating  the  derived 
geminate  to  yield  single  linking,  whence  the  su[ks]ede  and 
masa[?k]an  facts  follow.  We  have  the  following  derivation  for 
the  word  /masak  +  Xan/  'the  cooking,  dish'. 

(21)  (a) 

masak  +   an    >   masak  +an  --> 

1 1  M  I     II  II 1 1 1\  1 1 

XXXXX   XXX  xxxxxxxx 

(b) 

masak  +kan   >   masa?   +kan 

Mill    III         Mill    III 

XXXXX     XXX  XXXXX    XXX 

In  Malay,  this  would  then  allow  our  GFR  to  apply  and  account  for 
[masa?kan]  "the  cooking,  dish"  and  [masa?ki]  "to  cook  for". 
These  results  thus  raise  the  more  general  theoretical  question  of 
how  phonological  constraints  control  rule  application.  An 
illegal  representation  created  by  spreading  is  repaired  by 
minimal  readjustment  of  the  autosegmental  linkings. 
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4.0   Conclusion 

In  our  analysis  we  are  able  to  account  for  the  glottal  stop 
in  [masa?kan]  for  both  the  words  with  underlying  structures  as 
/masak  +  Xan/  and  /masak  +  kan/,  and  [masa?ki]  for  /masak  +  Xi/ 
in  Malay. 

In  analyzing  Malay  we  found  that  it  does  not  in  general 
allow  geminates  and  a  constraint  like  (20)  above  bars  them  from 
surfacing.  However,  the  language  has  empty  X- slots  in  the 
suffixes  which  allow  spreading  and  create  linked  structures  or 
geminates.  Thus,  the  general  question  is  raised  as  to  how  Malay 
like  English  can  resolve  in  an  efficient  manner  the  question  of 
gemination  created  by  spread  and  overcome  the  general 
phonotactics  of  the  language  which  bars  such  structures.  This 
would  then  allow  for  phonological  rules  like  the  GFR  to  apply 
which  would  otherwise  be  incorrectly  blocked  from  applying. 
We  have  given  an  explanation  that  relies  on  a  minimal 
readjustment  in  the  autosegmental  linkings  to  make  the 
representation  conform  to  the  Universal  Grammar  and  language- 
particular  constraints. 
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Mirror  of  Language:  The  Debate  on  Billnaualism 
By  Kenji  Hakuta.     New  York:    Basic  Books,  1986.    Pp.  268.     $9.95. 


Reviewed  by  Cher-leng  Lee 


This  book,  as  Hakuta  says,  has  been  written  for  two  reasons(p. 
ix.x):  first,  to  help  parents,  students,  professionals,  politicians  and 
other  concerned  people  decide  about  whether  or  not  bilingualism  is 
good(see  p.  ix).  second,  to  remind  colleagues  in  the  social  and 
behavioral  sciences  of  the  breadth  of  the  research  agenda  of 
bilingualism,  and  to  emphasise  the  importance  of  studying  research 
problems  related  to  bilingualism  in  their  widest  contexts.  An 
additional  expectation  of  Hakuta  is  that  the  book  will  serve  as  an 
inspiration  for  research  to  students  in  a  variety  of  disciplines. 

The  nine  chapters  of  the  book  (including  an  introduction,  pp.  3  - 
13,  and  a  concluding  chapter:  reflections  on  bilingualism,  pp.  231- 
241,)  cover  the  following  topics:  bilingualism  and  intelligence  (pp. 
14-44);  childhood  bilingualism  (pp.  45-72);  the  bilingual  mind  (pp 
73-105);  how  children  learn  a  second  language  (pp.  106-136);  how 
adults  learn  a  second  language  (pp  137-164);  bilingualism  in  society 
(pp.  165-192)  and  bilingualism  education  (pp.  193-230).  The  book 
also  contains  a  bibliography  (pp.  243-255)  and  an  index  (pp.  257- 
268). 

What  we  see,  then,  is  that  in  these  chapters,  the  "pieces  of  one 
puzzle"  (p.  x)  have  been  put  together.  However,  a  problem  remains: 
the  book  does  not  necessarily  help  one  to  make  a  decision  foi  or 
against  bilingualism.  For  most  of  the  book,  Hakuta  presents 
opposing  positions  of  an  issue  and  then  tries  to  "reconcile"  the 
differences  or  "  integrate"  them.  Although  there  is  a  general 
inclination  favorable  towards  bilingualism,  he  does  not  make  strong 
conclusions  on  the  exciting  ongoing  debate  on  topics  such  as 
bilingualism  and  intelligence,  the  bilingual  mind  and  second  language 
acquisition  for  children  and  adults.  Perhaps  this  is  why  he  chooses 
to  conclude  the  book  with  "reflection"  rather  than  "conclusion". 
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It  is  only  in  the  chapter  on  bilingual  education,  Hakuta 
suddenly  loses  his  objectivity  and  makes  a  case  for  bilingualism 
without  providing  much  evidence.  As  for  his  second  purpose,  it  is 
obvious  that  he  tries  to  show  that  the  issues  of  bilingualism  are  not 
limited  to  linguistics  or  any  single  field  of  discipline,  but  involve  a 
multitude  of  psychological,  societal,  and  political  realities  (p.  9). 
This  comes  as  a  constant  reminder  in  the  concluding  remarks  of 
most  of  the  chapters.  As  a  result,  one  sees  the  interaction  of 
various  disciplines  in  the  study  of  bilingualism. 

Hakuta's  balancing  act,  or  the  tendency  towards 
"reconciliation"  and  "integration",  reminds  one  of  the  East  Asian 
(Japanese  and  Chinese)  emphases  on  "harmony"  and  the  "middle  path"" 
as  opposed  to  an  absolute  stand.  It  is  no  wonder  that  Howard 
Gardner  (1986:  pp.  36-46)  in  his  critical,  but  at  the  same  time 
insightful  review,  attacks  Hakuta  for  being  "unable  to  reach  any 
strong  conclusions,"  (p.  44  )  and  for  standing  on  a  "bland  middle 
ground"  (p.  44  ).  Gardner's  intolerance  of  Hakuta's  approach  of  a 
"middle"  way  is  apparent  throughout  Gardner's  article. 

Hakuta  was  prompted  to  write  this  book  to  provide  information 
about  bilingualism  that  is  "at  one  scientifically  non-technical  and 
intellectually  sophisticated"  (p  ix).  This  book  is  written  more  as  a 
general  text  than  a  technical  scholarly  work.  The  book  assumes 
little  or  no  prior  knowledge  of  academic  bilingual  issues  on  the 
reader's  part.  In  many  instances,  Hakuta  provides  useful  information 
to  the  lay  reader:  there  are  introductions  to  well  known  linguists, 
e.g.  Benjamin  Lee  Whorf  (p.  74);  explanations  of  linguistic  terms 
like  "morpheme"  (p.  119);  background  information  on  linguistic 
research,  e.g.  studies  in  first  and  second  language  acquisition  (pp 
116-119).  There  is  also  a  good  synopsis  of  current  approaches  to 
the  subject  e.g.  the  neurological  approach  (pp.  85-89);  and  excellent 
summaries  of  Chomsky's  and  Piaget's  views  on  language  and  thought 
(pp.  80-83).  Hakuta's  mastery  of  the  relevant  literature  is  indeed 
impressive.  In  discussing  various  issues,  Hakuta  leads  the  reader 
step  by  step  into  the  discussions  of  bilingual  studies. 

Another  strength  of  this  book  is  the  way  Hakuta  tries  to  help 
readers  see  the  specific  issues  in  a  wider  context.  For  example,  he 
points  out  that  the  psycholinguistic  tests  carried  out  should  not  be 
seen  as  yielding  the  absolute  truth  because  different  ways  of 
testing  give  different  results  (p.  41).  He  also  mentions  that  the 
attitude    towards    bilingualism    is    different    in    America    as    in    a 
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multilingual  society  where  people  are  expected  to  be  universally 
bilingual  (p.  164).  Hakuta's  book  is  also  interesting  to  read  because 
one  could  almost  sense  a  personal  and  emotional  narrator  at  the  back 
of  all  the  discussion. 

Am  I,  then,  fully  satisfied  with  Hakuta's  book?  The  answer  is 
'no':  I  have  several  problems  with  this  book.  First,  Hakuta  in 
choosing  opposite  views  of  the  various  issues  does  not  seem  to  see 
the  necessity  of  explaining  the  reasons  for  his  choices.  For  example, 
in  choosing  to  contrast  the  approach  of  Werner  Leopold  as  a  linguist 
(pp.  45-58)  to  Madorah  smith  as  a  psychologist  (pp.  59-65)  in  his 
chapter  on  'childhood  bilingualism',  I  feel  that  he  owes  readers  an 
explanation  of  why  he  chose  to  compare  specifically  these  two 
instead  of  others,  e.g..  E.  Haugen,  G.  A.  Miller,  and  others.  In  other 
words,  a  broader  survey  of  the  particular  field  of  interest  should 
be  provided  before  he  delves  into  the  comparison  of  two  particular 
individual    researchers. 

Similarly,  in  his  discussion  of  'bilingualism  in  society'  (pp. 
175-185)  he  simply  selects  four  communities:  the  tribes  of  the 
Northwest  Amazon,  the  inhabitants  of  Oberwart  Austria,  the  Puerto 
Rican  community  in  Jersey  City  and  the  bilingual  situation  in  Canada 
to  represent  the  positive  examples  of  bilingual  communities  without 
much  justification  of  his  choices.  One  wonders  why  other  bilingual 
communities,  for  example,  that  of  Asia,  Africa  and  other  parts  of 
the  world  are  not  cited.  Even  if  it  is  due  to  the  limitation  of 
concentration  in  the  subject,  more  explanation  should  have  been 
given.  It  is  important  in  this  kind  of  discussion  to  provide  a  basis  of 
comparison  rather  than  seemingly  citing  four  examples  at  random. 
This  will  not  enable  the  readers  to  make  generalizations  about 
bilingual    communities. 

Hakuta's  way  of  selecting  the  individual  researchers  for 
comparison  and  choosing  the  bilingual  communities  to  discuss 
without  qualifying  why  he  is  doing  so  reminds  one  of  the  way 
ancient  Chinese  scholars  like  Confucius  would  teach  their  followers 
taking  for  granted  the  respect  for  his  authority  of  wisdom  and 
knowledge  over  them.  Is  this  then,  again,  a  difference  in  culture 
causing  a  difference  in  assumption? 

Second,  there  seems  to  be  a  problem  of  proportion  in  the 
treatment  of  subject  matter  on  each  topic.  For  example,  Hakuta  goes 
into  great  detail  in  describing  the  work  of  Werner  Leopold  on  child 
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bilingualism  (pp.  45-58),  citing  many  examples  of  his  descriptions 
of  his  daughter's  acquisition  of  English  and  German.  One  wonders 
why  Hakuta  has  to  go  into  such  length  of  description  when  the  point 
is  made  after  a  couple  of  examples. 

In  contrast,  in  cases  where  one  would  like  to  see  a  clearer 
picture  of  what  he  is  trying  to  drive  at  after  listing  the  opposite 
views,  he  simply  ends  the  chapter  without  any  indication  of 
direction.  For  example,  in  the  chapter  on  'bilingualism  and 
intelligence  (pp.  14-44),  he  leaves  a  dissatisfying  remark  in  the 
conclusion  that  "the  question  of  bilingualism  and  intelligence  of 
whether  they  are  linked  positively  or  negatively,  will  evaporate  in 
the  face  of  deeper  issues  surrounding  both  bilingualism  and 
intelligence"  (p.  43).  Surely,  after  leading  the  reader  through  the 
thick  forest  of  methodological  problems  involved  in  the  relevant 
topic,  he  owes  the  reader  a  guideline,  however  tentative  it  may  be, 
to  evaluate  the  results. 

Related  to  this,  is  the  way  he  discusses  the  chapter  on  'the 
bilingual  mind'  (pp.  73-105).  Hakuta  did  a  good  job  in  giving  a 
synopsis  of  how  this  issue  can  be  investigated  at  the  neurology  level 
(pp.  93-94).  All  these  are  very  fascinating,  however,  he  seems 
content  to  draw  the  conclusion  that  "there  is  no  simple,  one-to-one 
correspondence  between  psychological  variables  and  neurological 
organization."  (p.  102)  Later  in  the  same  section,  he  reminds  us 
again  that  "the  mind  is  a  complex  thing,  and  we  have  restricted  it  to 
an  account  of  knowledge,  omitting  other  domains  such  as  emotions 
and  feelings."  (p.  104)  Much  as  I  feel  that  this  kind  of  reminder  is 
important  so  that  we  will  not  become  "victims  of  our  own  tunnel 
vision,"  (p.  105),  I  also  feel  that  after  having  given  us  a  picture  of 
these  various  aspects  of  research  into  the  bilingual  mind,  Hakuta 
should  at  least  give  a  direction  of  how  these  different  aspects  can 
give  further  enlightenment  to  our  understanding  of  the  bilingual 
mind,  rather  than  simply  concluding  that  we  should  bear  in  mind 
other  factors  like  cultures,  emotions  and  feelings.  It  would  benefit 
readers  more  if  Hakuta  had  gone  beyond  this  familiar  reminder  and 
provided  a  plausible  path  into  understanding  the  mind. 

Third,  Hakuta's  uses  a  variety  of  styles  in  the  writing  of  the 
book.  One  gets  the  feeling  that  Hakuta  is  switching  between  his 
various  intended  audiences,  and,  therefore,  Hakuta  undertakes,  as  if 
were,  'style-shifts'.  At  times,  he  gets  very  descriptive  about  the 
methods  involved  in  a  particular  research  or  the  history  of  a  certain 
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Study  e.g.  the  bilingual  education  in  America  (pp.  139-140).  At  other 
times,  he  tends  to  sound  like  a  storyteller  when  introducing  a 
chapter,  for  example,  when  he  discusses  'how  children  learn  a  second 
language'  (pp.  106-136)  and  similarly  for  'how  children  learn  a 
second  language'  (pp.  137-164)  in  which  he  has  a  section  entitled  'A 
man  named  Alberto'  (p.  139);  in  his  discussion  on  bilingual  societies, 
he  has  the  subtitle:  'A  country  called  Canada'  (p.  183).  There  does 
not  seem  to  be  enough  critical  evaluation  especially  on  issues  where 
one  would  like  to  hear  more  clearly  what  an  expert  would  like  to  say 
e.g.  whether  adults  or  children  learn  a  second  language  better, 
whether  there  is  relation  between  language  and  thought.  It  seems  to 
me  that  Hakuta's  lack  of  commitment  to  a  particular  style  seems  to 
have  given  him  the  convenience  to  avoid  critical  discussions  when  he 
does  not  want  to  commit  himself. 

Although  in  Mirror  of  Language.  Hakuta  attempts  to  review  a 
broad  spectrum  of  debatable  issues  on  bilingualism,  he  seems  to 
have  only  a  particular  readership  in  mind  -  the  Western  reader  or 
even  the  American  reader.  This  is  obvious  throughout  the  book:  In 
'bilingualism  and  intelligence',  he  traces  the  origin  of  the  issue  to 
the  early  American  immigrants  (pp.  16-22);  even  in  broader  topics 
like  'bilingualism  in  society'  (pp.  165-192)  and  'bilingual  education* 
(pp.  193-230),  Hakuta  has  predominantly  discussed  the  United 
States.  Since,  bilingualism  is  not  a  phenomenon  exclusive  to 
America,  Hakuta  should  give  an  explanation  for  the  limited  scope  of 
his  discussion.  Moreover,  since  it  is  a  book  for  general  reading, 
Hakuta  could  do  well  in  providing  some  references  to  the  major 
bilingual  communities  in  the  world  that  have  been  widely  discussed. 
For  example,  the  works  of  Bokamba(1976),  Kachru(1982,  1983), 
Noss(1982),  Platt(1980)  and  others  (see  references).  It  is 
interesting  to  note  that  out  of  281  items  on  his  references,  only  one 
deals  with  Africa  and  none  with  Asian  countries. 

Despite  the  shortcomings  mentioned  above.  Mirror  of  Language 
is  still,  to  me,  a  valuable  source  of  information  for  several 
fascinating  topics  on  bilingualism,  for  example,  the  bilingual  mind, 
how  children  and  adults  learn  a  second  language.  Hakuta  does  a  good 
job  in  summarizing  and  synthesizing  the  important  issues  involved 
in  these  areas  of  research  and  presents  them  in  a  somewhat  personal 
manner  that  makes  it  easy  to  read.  It  is  an  interesting  book  to 
recommend  for  classroom  purposes  as  a  survey  of  current  research 
issues  on  bilingualism.  The  general  problem  of  this  book  seems  to 
be  the   ambiguity   of   his   intentions:   at  times,   he  does   not  provide 
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adequate  discussions,   at  other  times,   he  dwells  too   much  on   the 

descriptions.      However,   any   one   interested   in   bilingualism  will   do 

well  to  read  it.    Mirror  of   Language  may  be  a  good  example  of  how 

difficult  it  is  to  write  an   introduction    on    this    important  cross- 
cultural    topic. 
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THE  REPRESENTATION  OF  DIPHTHONGS  IN  SPANISH' 
Maria  Carreira 


On  the  surface,  Spanish  appears  to  have 
both  falling  and  rising  diphthongs.   This 
paper  will  argue  that  only  falling 
diphthongs  exist  as  primitives  while  rising 
diphthongs  are  derived  via  a  rule  of 
contraction.   The  analysis  presented  will  be 
defended  on  two  counts:  1)   it  will  be  shown 
to  account  for  certain  stress  properties  of 
word-final  diphthongs  that  have  up  to  now 
escaped  explanation,  and  2)  it  will  be 
argued  that  a  phonological  system  such  as 
that  of  Spanish  which  on  the  surface  appears 
to  have  both  types  of  diphthongs  poses  a 
serious  problem  for  the  acquisition  of 
diphthongs  unless  one  type  of  diphthong  is 
assumed  to  be  derived.  We  will  conclude  that 
a  constraint  that  rules  out  more  than  one 
diphthong  type  in  a  language  like  Spanish 
is  a  desirable  outcome  from  the  point  of  view 
of  learnability . 


I.   THE  DIPHTONGS  OF  SPANISH 

A.   On  the  surface,  Spanish  appears  to  have  both  rising  and 
falling  diphthongs, . 


vic.to.rya  "victory" 

pyo.ho  "flea" 

a.gwa  "water" 

cwo.ta  "quota" 


boy .na 
gay . ta 
dew. da 


'beret" 

'bagpipe" 

'debt" 


pa. ryen. te 
pwer . ta 
a.gwi .ta 


pey.ne 
aw. la 
naw. ti .CO 


"relative' 
"door" 
"water" 
(dim.) 

'comb" 

'classroom' 

'nautical" 


As  the  following  examples  suggest,  the  syllabicity  of 
both  prevocalic  and  postvocalic  [+high,  -cons]  segments  is 
not  predictable  from  context.   This  difference  must  be 


stipulated  somehow  and  memorized  as  such  by  the  language 
learner. 

2) 


Ma.ri.a  v.   Ma.ryo 
name        name 

o .  i' .  do 
"hearing" 

b. 

V. 

boy.na 
"beret" 

cri.o.llo  V.  pyo.ho 
"Creole"      "flea" 

a.un 
"even" 

V. 

aw.  la 
"classroom' 

Previous  analyses  (Harris  1983,  Morgan  1985)  have  argued 
that  both  types  of  diphthongs  contain  the  [+high,  -cons] 
segment  in  rhyme  position.  Rising  and  falling  diphthongs 
are  represented  as  in  (3a)  and  (3b),  respectively. 

3)        a.  b. 

(C)  G  V  (C)  V  G 

I    \l  IK 

OR  OR 

Harris  (1983)  cites  two  arguments  in  support  of  a 
representation  like  (3a)  for  prevocalic  glides.  The  lack 
of  co-occurrence  restrictions  between  prevocalic  glides  and 
onsets  (ie.  prye.to,  trwe.no,  plye.ge...)  suggests  that 
prevocalic  glides  do  not  appear  in  onset  position.  The  fact 
that  prevocalic  glides,  like  closed  syllables,  block 
antepenultimate  stress  when  in  penultimate  position  (ie. 
*en.fer.mo,  *tro.pje.zo)  argues  in  favor  of  a 
representation  that  includes  the  glide  in  rhyme  position. 

Post-vocalic  glides  are  assigned  a  structure  identical 
to  that  posited  for  syllable-final  consonants  because  they 
pattern  like  such  consonants  in  two  ways:  a)  they  do  not 
allow  syllabification  of  an  additional  consonant  in  the 
same  coda,  (ie.  *es . culp. tor,  *dewn.ta)  and  b)  like 
syllable-final  consonants,  they  block  antepenultimate 
stress  when  in  the  penultimate  syllable  of  a  word 
(*u.ni . for .me,  *a.plaw.so). 

Thus,  the  fact  that  both  types  of  diphthongs  block 
antepenultimate  stress  when  in  penultimate  position  is 
captured  in  this  analysis  by  the  presence  of  a  branching 
rhyme  in  (3a)  and  (3b) . 

However,  while  both  prevocalic  and  postvocalic  glides 
behave  alike  with  respect  to  stress  when  in  the  penultimate 
syllable  of  a  word,  they  exhibit  very  different  properties 
when  in  word-final  position.  Nouns  and  adjectives  with 
final  syllables  containing  a  rising  diphthong  always 
receive  penultimate  stress  (ie.  far.ma.cya,  pur .ga . to . ryo) . 
Falling  diphthongs  in  the  final  syllable  of  a  word  always 


4) 

a. 

mwer.te 

"death" 

byen 

"good" 

fyel 

"loyal" 

syes.ta 

"nap" 

attract  stress  (ie.  ma.mey,  com. boy) . 

The  representations  in  3a)  and  3b)  fail  to  account  for 
these  stress  properties  of  diphthongs.  If  rising 
diphthongs  have  a  branching  rhyme  why  don't  they  attract 
stress  when  in  word-final  position?  That  is,  why  don't 
they  bear  a  similar  stress  pattern  to  other  words  that  end 
in  syllables  with  branching  rhymes  (ie.  mamey,  carbon)? 

Another  difference  between  falling  and  rising 
diphthongs  concerns  the  number  of  tautosyllabic  segments 
that  they  can  be  followed  by.  Rising  diphthongs  allow 
postvocalic  consonants  (as  in  4a)  while  falling  diphthongs 
do  not  (4b) . 

b. 

*boyn.ta 
*pawr .ta 
*cewl .ta 
*cews.ta 

In  order  to  account  for  this,  Harris  (1983) 
differentiates  between  falling  and  rising  diphthongs  by 
means  of  the  representation  in  (5) .  Prevocalic  glides  are 
immediately  dominated  by  the  maximal  projection  of  the 
rhyme  (R").  Vowels  and  postvocalic  glides  form  a 
constituent  that  is  dominated  by  R'  . 

5) 

[-cons 


[-syll] 


The  impossibility  of  forms  like  (4b)  is  explained  by 
the  fact  postvocalic  glides  and  consonants  occupy  the  same 
position  in  the  rhyme.  It  follows  from  this  that  falling 
diphthongs  will  block  the  incorporation  of  any  additional 
segment  following  the  glide.  Prevocalic  glides,  on  the 
other  hand,  do  not  have  this  effect  because  they  do  not 
increase  the  size  of  the  constituent  that  contains  the 
vowel  and  the  postvocalic  segment. 

While  a  representation  such  as  (5)  accounts  for  the 
facts  in  (4),  it  presents  a  problem  for  the  stress 
properties  of  glides  in  word-final  position.  These 
properties  are  summarized  below. 

6)    a.  Prevocalic  glides  in  the  final  syllable  of  a  word 
always  cause  stress  to  fall  on  the  penultimate 


4 
syllable  (ie.   far.ma.cja,  pur .ga. to .ryo) . 

b.   Postvocalic  glides,   on  the  other  hand,   always 
receive  stress  (ie.  ma.mey,  con. boy,  A. brew) . 

The  stipulations  that  a  representation  such  as  (5) 
force  us  to  make  in  order  to  account  for  the  above  facts 
are  void  of  explanatory  power.  For  (6a)  we  are  forced  to 
say  that  word-final  vowels  preceded  by  glides  are  always 
extrametrical .  Otherwise,  stress  would  incorrectly  fall  on 
the  final  syllable  of  words  with  a  final  rising  diphthong. 
For  (6b)  we  must  stipulate  that  word-final  glides  can  never 
be  extrametrical.  This  will  assure  that  stress  will  always 
fall  on  the  final  syllable. 

The  generalization  made  for  postvocalic  glides  cannot 
be  extended  to  postvocalic  consonants  since  they  don't 
always  cause  stress  to  fall  in  the  final  syllable. 
6c) 

ar.bo(l)        "tree"         na.)ca(r)     "mother  of 
diag.no''. si  (s)   "diagnosis"  pearl" 

To  summarize,  the  accepted  representation  of 
diphthongs  shown  in  (5)  accounts  for  the  fact  that  both 
prevocalic  and  postvocalic  glides  block  antepenultimate 
stress  when  in  the  penultimate  syllable  of  a  word.  However, 
such  a  representation  fails  to  explain  why  the  two  types  of 
diphthongs  behave  so  differently  with  respect  to  stress 
when  in  word-final  position. 

A  second  difference  between  these  diphthongs  concerns 
the  impossibility  of  adjoining  a  final  consonant  to 
syllables  with  postvocalic  glides.  No  such  restriction 
holds  for  syllables  with  prevocalic  glides.  A 
representation  such  as  (5)  allows  us  to  account  for  these 
facts  while  maintaining  the  hypothesis  that  both  prevocalic 
and  postvocalic  glides  appear  in  rhyme  position  at  all 
times  in  the  derivation.  However,  (5)  leads  to  ad  hoc 
stipulations  with  regards  to  the  stress  pattern  of 
word-final  diphthongs. 

The  hypothesis  that  both  prevocalic  and  postvocalic 
glides  are  part  of  the  underlying  inventory  of  syllable 
types  in  Spanish  not  only  leads  to  costly  stipulations  for 
stress,  but  it  also  represents  a  highly  marked  option  in 
Universal  Grammar.  Generally,  languages  exhibit  one  type  of 
diphthong,  but  not  both.  For  example,  Slovak,  which  on  the 
surface  appears  to  have  both  falling  and  rising  diphthongs, 
has  been  convincingly  shown  by  Kenstowicz  and  Rubach  (1987) 
to  derive  its  rising  diphthongs  from  underlyingly  long 
vowels . 


In  the  next  section  we  will  argue  that  rising 
diphthongs  are  always  derived  in  Spanish.  Falling 
diphthongs,  on  the  other  hand,  must  be  specified  in  the 
lexicon.  With  this  we  eliminate  the  undesirable  situation 
of  having  two  types  of  diphthongs  in  the  same  language.  In 
addition,  such  an  analysis  will  allow  us  to  account  for  all 
the  stress  properties  of  glides  as  well  as  for  the 
conditions  on  syllable-final  consonants  in  syllables  with 
postvocalic  glides. 


II.   THE  STATUS  OF  PREVOCALIC  GLIDES 

Steriade  (1984)  posits  a  rule  of  syllabification  that 
accounts  for  the  distribution  of  high  vowels  and  glides  in 
Romanian.   Such  a  rule  is  stated  as  follows: 

(7)   The  CV  rule:  (C)V  >   (C)V 

I  I 
0  R 

Where  C=[-syll]  or  [^-syll],  V=[+syll]  or  [esyll], 
0=Onset,  R=Rime;  operates  left-to-right. 

An  assumption  of  this  analysis  is  that  in  Romanian 
some  [+high,  -cons]  segments  are  unspecified  for  the 
feature  syllabic  in  their  underlying  representation.  Being 
[Osyll],  the  CV  rule  will  syllabify  them  either  as  onsets 
or  syllble  heads,  depending  on  the  context.  [+high, -cons] 
segments  are  assigned  to  onset  position  under  two 
conditions:  a)  if  the  [+high]  segment  is  in  word-initial 
position  and  is  followed  by  a  [+syll]  segment,  or  b)  if  the 
[+high]  segment  appears  between  two  [+syll]  segments.  In 
all  other  cases  such  segments  are  organized  as  syllable 
heads . 

The  CV  rule,  in  conjunction  with  a  rule  of  contraction 
and  principles  of  extrametricality  account  for  the 
difference  between  the  Romanian  studyu  and  skatiw.  In  this 
analysis,  the  syllabification  contrast  between  these  two 
items  is  attributed  to  a  difference  in  stress  pattern. 

As  is  the  case  with  Spanish,  glide-final  words  in 
Romanian  always  receive  final  stress  while  those  with  a 
rising  diphthong  in  word-final  position  always  bear 
penultimate  stress.  Steriade  suggests  that  this  difference 
is  ^ due  to  the  fact  that  the  final  segment  in  words  like 
studyu  is  extrametrical  while  in  words  like  skatiw  the 
final  [+high,  -cons]  segment  counts  for  stress.  Given  this 
assumption,   the   derivation   of   each   word  proceeds   as 


follows.  The  contraction  rule  is  first  illustrated  in  (8) . 

(8)   Contraction 

0   R        R  OR 

•11  I      A 

Condition:                                      C   X        X  >      C   X  X 

'I  I  I 

i  is  unstressed            i   u  i  u 

s  t  u  d  i    (u)  s   k  a  t   i  u 

CV  rule  ORORR  ORORR 

Stress  studi(u)  skatiu 

Contraction  studyu  skatiu 

Two  additional  rules  derive  the  final  forms  (studyu, 
skatiw^) .  A  rule  of  complex  onset  formation  adjoins  the 
initial  I  si  in  both  words  to  the  first  syllable.  A  high 
vowel  desyllabification  (HVD)  rule  converts  a  [+high] 
segment  into  a  postvocalic  glide  when  it  appears  at  the  end 
of  a  word.   HVD  yields  skatiw^  from  skati .u. 

The  analysis  proposed  for  Romanian  can  be  altered  to 
account  for  the  difference  between  Maryo/Marfa.  Let  us 
suppose  that  the  contraction  rule  is  modified  so  as  to 
apply  between  an  unstressed  high  vowel  and  any  onsetless 
syllable  that  follows.  Whether  the  [+high,  -cons]  segment 
is  incorporated  into  onset  position  or  into  the  following 
rhyme  will  not  concern  us  now.  We  will  return  to  this 
question  in  section  IV. 

If  we  maintain  the  assumption  that  some  [+high, 
-cons]  segments  are  unspecified  for  the  feature  syllabic 
and  allow  the  final  segment  in  M^ryo  to  be  extrametrical 
we^  can  account  for  the  syllabification  difference  between 
Maryo  and  Marfa.  The  derivation  proceeds  as  follows. 


(9)   Contraction  (revised) 


0  R 

1  1 

Condition:  [+high]  is 
unstressed 

C    X 
1 

[+high] 

M  a  r  i 
CV  rule           0  R  0  R 

(o) 
R 

Stress            M  a'  r  i 
Contraction       Mary 

(o) 
o 

0  R 

<        /\ 

C      X        X 
I 
[+high] 


Maria 
ORORR 


Maria 


with  this  we  explain  the  stress  properties  of 
word-final  rising  diphthongs.  Maryo  and  Maria  exhibit 
the  only  two  possible  stress  patterns  for  word-final 
sequences  of  a  [+high,  -cons]  segrment  and  a  vowel.  The 
final  syllable  in  Ma.ri.o  (before  contraction)  does  not 
receive  stress  because  it  is  extrametrical .  Consequently, 
stress  falls  on  the  antepenultimate  syllable  which 
becomes  the  penultimate  syllable  after  contraction 
(Ma.ri.o  — >  Ma.ryo) .  In  Ma.ri.a  the  final  syllable  will 
not  be  stressed  because  it  is  light.  It  follows  that  the 
penultimate  syllable  will  receive  stress,  bleeding 
Contraction  (Ma.ri.a) . 

Thus,  final  stress  is  never  an  option  for  word-final 
rising  diphthongs  because  they  start  out  as  two  separate 
syllables,  the  last  of  which  is  light.  Since  word-final 
light  syllables  in  Spanish  do  not  normally  bear  stress  it 
follows  that  the  final  vowel  in  words  that  end  in  a 
rising  diphthong  will  not  receive  stress  either. 

Antepenultimate  stress  is  impossible  when  a  rising 
diphthong  appears  in  either  final  or  penultimate  position 
because  before  the  operation  of  contraction  the  [+high] 
segment  constitutes  a  separate  syllable.  What  appears  to 
be  the  antepenultimate  syllable  then,  is  really  the 
fourth-to-last  syllable  at  the  time  when  stress 
assignment  takes  place.  Since  pre-antepenultimate  stress 
is  not  possible  in  Spanish,  it  follows  that  rising 
diphthongs  in  penultimate  position  will  appear  to  block 
stress.  This  is  illustrated  below. 

/t   r  a  v  i  e   s  o/  /n  o  t   i   c  i    (a) / 

CV  rule  ORORROR  ORORORR 

Stress  travieso  notici(a) 

Contraction        travyeso  noticy      a 

With  this  analysis  we  eliminate  the  need  to 
distinguish  prevocalic  glides  from  high  vowels  that 
constitute  a  syllable  peak.  [+high]  segments  that  do  not 
receive  stress  are  turned  into  glides  if  they  immediately 
precede  an  onsetless  syllable. 

This  representation  of  diphthongs  finds  support  in 
certain  morphological  processes  that  alter  the  stress 
pattern  of  the  root.  Consider  the  following  words.  The 
underived  forms  in  lOA  bear  stress  on  the  initial 
syllable.  Suffixation  causes  stress  to  shift  one 
syllable  to  the  right. 


(10) 


lu.  na 

"moon" 

—  > 

lu.na'.ti  .CO 

rd.sa 

"rose" 

—  > 

ro.  sa'.  ryo 

ca'.sa 

"house" 

—  > 

ca.sc5'.na 

"lunatic" 
"rosary" 
"big  house' 


ma.ni.a  "obsession"  -> 
di.a  "day"  — > 
Ma.ri.a   name      — > 


ma.nya. ti . co 
dya'.ryo 
Ma . ryo.na 


"maniac" 

"diary" 
augmentative 

of  Maria 


The  importance  of  the  above  facts  for  our  analysis 

lies  in  the  fact  that  syllable  peaks  consisting  of  a 

[+high,  -cons]  segment  undergo  glide  formation  after 
suffixation  causes  the  stress  to  shift  away  from  them 

(lOB)  ,    Thus,   the  process   captured  by   our   rule   of 

contraction  finds  independent  motivation  in  the  phonology 
of  Spanish. 

Harris  (1983)  discusses  an  analysis  which  derives 
all  glides  from  unstressed  [+high]  vowels  but  rejects  it 
based  on  the  following  considerations: 

1)    While   such   an   analysis   accounts   for   the   stress 

pattern   of  words   with  rising  diphthongs   in  the   final 

syllable,   it  fails  to  make  the  right  predictions   for 

glide-final  words.   That  is,  if  word-final  glides  start 

out  in  a  separate  syllable  and  undergo  stress  assignment 

before  contraction  then  two  stress  patterns  should  be 

attested: 

a.    If   the   final   segment   does   not   count   as 

extrametrical,   then  the  penultimate  syllable   (which 

will  become  the   final   syllable   after   contraction) 

will  bear  stress  (ie.  con. bo. i  ->  con. boy). 

b.     Alternatively,    the    penultimate    syllable 

(penultimate   after   contraction,   antepenultimate 

underlyingly)   could  bear  stress  as  a  result   of 

extrametricality    of   the    final    segment    (ie. 

con.bo.(i)).    However,   as  we  have  seen,   this  last 

option   is   never   attested   since   all   glide-final 

words   bear   final   stress.   2)   A   second  argument 

mentioned   by   Harris   against   an   analysis   that 

derives  all  glides   from  a  rule  of  contraction  is 

based   on   the   stress   pattern   of   words   like 

na^^.  f  ra  .  go   and   far  .ma  .  cew.  ti  .  co  .   If   postvocalic 

glides   start   out   in   separate   syllables   then   the 

above  mentioned  words  would  be  receiving  stress  on 

the   fourth-to-   last   syllable,   an   option   that   we 

have   rejected   for   Spanish   (ie.   /na".  u  .  f  ra  .  go/ , 


/far .ma . ce . u . ti . co/)  .  In  order  to  account  for 
surface  antepenultimate  stress  on  syllables  with 
postvocalic  glides  we  must  assume  that  the  glides 
do  not  start  out  in  separate  syllables. 

While  Harris'  points  are  well  taken  with  regards  to 
postvocalic  glides,  they  have  nothing  to  say  about 
prevocalic  glides.  They  suggest  that  postvocalic  glides 
must  start  out  in  the  same  syllable  as  the  preceding 
vowel.  However,  none  the  above  arguments  apply  to 
prevocalic  glides.  As  we  have  already  seen,  the  stress 
pattern  of  prevocalic  glides  is  no  problem  for  an 
analysis  that  derives  all  such  segments  from  a 
contraction  rule  (see  the  derivation  in  (9)).  The 
existence  of  words  with  stress  on  antepenultimate 
syllables  with  prevocalic  glides  (ie.  cja^.ti  .ca)  does  not 
constitute  an  argument  against  such  an  analysis  since 
stress  falls  on  the  third-to-last  syllable  regardless  of 
the  application  of  the  rule  of  contraction. 

To  summarize,  the  analysis  presented  here  eliminates 
the  need  to  have  prevocalic  glides  in  the  inventory  of 
diphthong  types  in  Spanish.  Only  falling  diphthongs 
appear  to  need  specification.  The  underlying 
representation  of  the  glides  in  such  diphthongs  remains 
to  be  investigated.  While  it  is  unclear  whether  such 
segments  should  be  differentiated  from  vowels  via  a 
feature  specification  or  by  being  syllabified  in  the 
underlying  representation,  it  is  nonetheless  clear  that 
they  should  be  singled  out  from  other  [+high]  segments. 

The  facts  presented  for  Spanish  extend  to  Portuguese 
and  Italian  with  minor  modifications  that  do  not  affect 
the  spirit  of  the  analysis.  Both  of  these  languages, 
like  Spanish,  appear  to  have  rising  and  falling 
diphthongs.  Moreover,  these  diphthongs  exhibit  the  same 
asymmetry  with  respect  to  stress  when  in  the  final 
syllable  of  a  word. 

For  these  languages  too,  we  could  claim  that 
prevocalic  glides  are  derived  via  a  contraction  rule 
while  postvocalic  glides  are  primitives.  This  simplifies 
the  inventory  of  diphthongs  in  these  languages  and 
suggests  that  a  claim  that  restricts  language-  particular 
syllabification  rules  to  allow  only  one  type  of  diphthong 
might  be  tenable.  As  we  will  see  in  the  next  section, 
this  is  a  significant  step  for  purposes  of  acquisition. 

Before  addressing  the  area  of  learnability,  however, 
there  is  one  final  issue  regarding  the  syllabification 
properties  of  [+high,  -cons]  segments  that  needs  to  be 


12) 

a. 
/i  e  m  a/ 
I  1  1  1 

"egg  yoke" 

b. 

/o  i  a/ 

CV  rule 

1  1  1  1 

0  R  0  R 
[ye. ma] 

1  1  1 
R  0  R 
[o.ya] 
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addressed.  An  unstated  assumption  of  our  analysis  has 
been  that  the  CV  rule  (see  7)  correctly  syllabifies  all 
[+high,  -cons]  segments.  The  left-to-right  application  of 
the  rule  guarantees  that  [+high]  segments  will  surface  in 
onset  position  under  two  conditions:  a)  at  the  beginning 
of  a  word  if  followed  by  a  vowel  or  by  another  [+high] 
segment,  and  b)  when  the  [+high]  segment  is  in 
intervocalic  position.  These  cases  are  illustrated 
bellow: 


"pot" 


However,  there  are  two  potential  problems  that 
threaten  the  validity  of  the  CV  rule  as  a  sufficient 
principle  in  accounting  for  the  syllabification 
properties  of  [+high]  segments.  In  Carreira  (1988)  I 
argue  that  lexical  items  that  are  spelled  with  hie  (the 
-h  is  silent)  must  be  distinguished  from  those  spelled 
with  Y£  i^  3  number  of  phonological  processes.  At  the 
segmental  level,  they  both  consist  of  a  [+high,  -back, 
-cons]  segment  followed  by  the  vowel  [e].  The 
organization  of  these  segments  at  the  syllable  level 
however,  appears  to  differ  depending  on  the  orthography. 
The  [+high,  -cons]  segment  in  words  spelled  with  ^ 
behave  like  onset  segments  with  respect  to  a  number  of 
processes.  These  pose  no  problem  for  the  CV  rule  since 
the  rule  predicts  that  they  should  appear  in  onset 
position  (as  in  12a) . 

The  forms  spelled  with  hie  however,  appear  to  have 
the  glide  in  rhyme  position.  Given  our  hypothesis  that 
prevocalic  glides  that  are  not  in  onset  position  are 
always  derived  from  a  contraction  rule,  this  would  force 
us  to  conclude  that  these  forms  start  out  in  two  separate 
syllables  and  undergo  contraction  (ie.  i.er.ba  --> 
ier.ba)  .  However,  if  the  first  segment  is  [+high]  the 
CV  rule  will  adjoin  it  to  the  onset.  Crucially,  it  will 
not  syllabify  it  as  a  rhyme.  The  CV  rule  will  assign 
identical  structures  to  forms  spelled  with  hie  as  well  as 
to  those  that  start  with  ye^.  This  is  illustrated  in 
(13)  . 

(13)    [hierba]  *i  e  r  b  a 

CV  rule     OR   OR 

What  we  need  to  distinguish  between  words  spelled 
with  hi  and  those  spelled  with  y£  is  to  have  the  former 
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in  the  rhyme  of  the  syllable  and  the  latter  in  onset 
position.  However,  if  the  derivation  proceeds  as  in  (13) 
then  we  have  no  way  of  distinguishing  between  words  like 
hierba  and  yeso  in  terms  of  the  organization  of  the 
segmental  slots  at  the  syllable  level. 

A  promising  way  of  distinguishing  between  these 
words  has  been  proposed  by  Harris  (1985)  .  The  claim  of 
this  analysis  is  that  lexical  items  spelled  with  hie  have 
an  underlying  representation  that  consists  of  the  segment 
/e/  followed  by  a  s)celetal  slot  without  segmental 
content.  Such  words  are  assigned  the  representation  in 
14a: 

14)    a.    e  V  b.     e  e  b.   i  e 

XX  XX  XX 

The  steps  that  map  the  underlying  representation  in 
(14a)  into  a  representation  that  consists  of  a  prevocalic 
glide  and  a  vowel  will  analyzed  later  in  this  paper. 
Suffice  it  to  say  at  this  point  that  the  empty  V-slot 
will  receive  the  features  of  the  vowel  [e]  (14b)  ,  and 
subsequently  undergo  diphthongization  (14c) .  Adopting 
this  solution  into  the  proposed  analysis,  we  can  see  that 
a  representation  such  as  14a  will  be  syllabified  by  the 
CV  rule  as  two  separate  rhymes.  This  is  exactly  what  we 
want  since  the  only  way  to  get  the  initial  segment  in  a 
position  where  it  can  undergo  Contraction  is  to  have  in  a 
separate  syllable.  There  is  independent  evidence  that  in 
Spanish  the  sequence  "e  e"  is  simplified  to  "i  e".^  If 
this  applies  to  the  structure  in  (14b)  the  result  will  be 
a  representation  that  can  undergo  contraction.  The 
derivation  proceeds  as  follows: 
(15) 
e  e  — >  e  e  — >  i  e  — >  y  e 
XX   CV  rule   x  x        x  x  contraction  x  x 

R    R  R  R 

To  summarize,  the  threat  posed  to  the  CV  rule  by  the  need 
to  distinguish  between  word  initial  -ye  sequences  that 
include  the  glide  in  the  onset  from  those  that  assign  the 
glide  to  rhyme  position,  can  be  eliminated  if  we  adopt  an 
analysis  li)ce  Harris  (1985)  . 

A  more  serious  problem  for  the  CV  rule  concerns  the 
existence  of  a  small  number  of  words  that  have  intervocalic 
[+high]  syllable  heads  (ie.  [bo.i.o]  (spelled  bohio)  "hut", 
[ba.i.a]  (sp.  bahia)  "bay").  This  is  a  threat  to  the 
validity  of  the  CV  rule  because,  as  illustrated  in  12b, 
intervocalic  segments  that  bear  the  feature  [+high]  should 
appear  in  onset  position.   That  is,  the  CV  rule  incorrectly 
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predicts   that   words   like   bohio   and   bahia   will   be 
syllabified  as  *bo.yo  and  *ba.ya. 

One  possible  solution  to  this  problem  is  to  allow  some 
[+high]  segments  to  be  specified  as  [+syll]  in  the 
underlying  representation.  This  would  account  for  the 
possibility  of  intervocalic  [+high]  syllable  heads  since 
the  CV  rule  would  not  be  able  to  incorporate  these  segments 
into  onset  position.  However,  this  implies  that  a  three-way 
distinction  in  the  representation  of  [+high]  segments  must 
be  allowed: 

a)  [+high]  segments  specified  as  [-syll]  become  falling 
diphthongs  (ie.  naw.fra.go). 

b)  [+highj   segments   specified  as   [+syll]   are   always 
syllable  heads   (ie.  bo.i.o). 

c)  [+high]  segments  that  are  unspecified  for  the  feature 
syllabic  are  assigned  sylllable  structure  via  the  CV  rule. 

If  a  three-way  distinction  turns  out  to  be  necessary 
in  the  specification  of  [+high]  segments,  then  this  would 
constitute  an  argument  in  favor  of  Steriade's  (1987)  view 
of  underspecification.  Crucially,  it  would  suggest  that 
the  D-value  [+/-syll]  for  the  feature  [+high]  is  not  absent 
from  the  underlying  representation. 

A  second  and  perhaps  more  promising  solution  is  to 
posit  an  underlying  representation  for  words  like  bo (h) io 
and  ba  (h) ia  that  contains  an  empty  x-slot  before  the 
[+high]  segment.  The  existence  of  this  segment  is 
reflected  in  the  orthography  and  is  further  justified  by 
the  historical  fact  that  many  words  spelled  with  an  [h] 
were  once  pronounced  with  an  aspirated  consonant. 

Given  this  representation,  the  CV  rule  will  assign  the 
[+high]  segment  to  rhyme  position  and  the  unspecified  slot 
to  its  onset,  yielding  the  correct  surface  form. 

16)      baia  boio 

xxxxx  xxxxx 

ORORR  ORORO 

The  viability  of  this  solution  hinges  on  several 
things.  First  of  all,  the  existence  of  an  extra  segmental 
slot  that  receives  no  segmental  features  has  been  justified 
by  both  the  orthography  and  the  historical  facts.  We 
predict,  therefore,  that  intervocalic  [+high]  segments  will 
be  syllabified  as  syllable  onsets  (ie.  V.GV)  unless  there 
is  an  orthographic  -h.  Consequently,  if  we  find  a 
significant  number  of  words  that  bear  no  -h  and  are  not 
syllabified  as  V.GV  we  will  be  forced  to  abandon  this 
solution. 
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A  second  consideration  concerns  the  interaction  of 
this  solution  with  the  representation  proposed  in  Harris 
(1985)  for  words  that  show  vowel/glide  alternations.  As  we 
have  previously  mentioned,  the  structure  assigned  to  such 
words  consists  of  an  x-slot  that  bears  the  features  of  a 
vowel  followed  by  an  empty  slot.  A  rule  of 
diphthongization  associates  the  feature  [-consonantal]  to 
this  segment.  Subsequently,  a  default  rule  assigns  the 
features  of  the  epenthetic  vowel  (  [e] )  to  this  position. 
Finally,  the  leftmost  vowel  is  turned  into  a  glide,  forming 
a  rising  diphthong  with  the  following  segment.  The 
derivation  is  illustrated  below. 
(17) 

e       — >   e  V     — >   e   e     — >    i   e 

I  11  II  II 

X  X   diph.   X  X  default   x  x   gliding  x  x 

Crucially,  the  process  outlined  in  (17)  relies  on  the 
assumption  that  empty  x-slots  will  be  assigned  the  feature 
[-consonantal]  (abbreviated  as  "V"  above)  and  subsequently 
undergo  the  default  rule.  However,  such  an  assumption  is  in 
conflict  with  the  solution  we  have  proposed  for  words  like 
bohio  and  bahia.  The  empty  x-slot  in  such  words  must  not 
undergo  the  rule  of  default  (ie.  *boeio,  *baeia)  .  This 
position  then,  must  somehow  be  assigned  the  value  [+cons] . 
This  is  the  only  way  to  insure  that  the  default  rule  will 
not  apply. 

To  restate  problem  at  hand,  Harris'  account  of 
segments  that  show  vowel/glide  alternations  proves  to  be 
important  to  our  analysis  because  it  allows  us  to  explain 
the  difference  in  syllabification  between  words  li)ce  yema 
and  hierro.  However,  the  same  account  proves  to  be 
problematic  for  our  analysis  if  we  assume  that  words  like 
bohio  are  syllabified  as  three  syllables  because  of  an 
extra  segmental  slot  before  the  [+high]  segment.  Following 
the  logic  of  Harris'  account,  we  incorrectly  predict  that 
the  empty  x-slot  in  such  words  should  undergo 
diphthongization  and  subsequently  the  default  rule.  What 
we  need  then,  is  a  way  to  allow  the  default  rule  to  apply 
to  words  like  hierro  but  to  be  blocked  in  the  case  of 
bohio. 

The  CV  rule  provides  such  a  way.  Suppose  that  empty 
slots  are  parsed  by  the  CV  rule  as  segments  that  are 
unspecified  for  the  feature  syllabic.  In  other  words, 
suppose  that  empty  slots  are  treated  the  same  as  [+high, 
-cons]  segments.  When  in  intervocalic  position  such 
segments  will  be  assigned  to  the  onset.  This  is  what 
happens  with  bo.hi.o.   Otherwise,   they  will  appear  in 
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nuclear  position  as  in  hierro. 

(18)  a.  b. 

b  o   i  o  e   rro 

xxxxx  xxxx 

ORORO  RROR 

The  default  rule  will  fail  to  apply  to  the 
representation  in  (18a)  because  the  segment  [e]  cannot 
appear  in  onset  position.  Such  a  rule  will  be  applicable  in 
(18b)  since  the  empty  segmental  slot  will  be  in  nuclear 
position.  The  created  hiatus  in  (18b)  is  resolved  by  a 
process  which  raises  the  leftmost  member  of  a  two  vowel 
sequence.  Subsequently,  Contraction  applies  yielding  a 
rising  diphthong. 

(18b' )  e    rr  o 

XXX    X 

default     e  e  rr  o 

raising     i  e  rr  o 

contraction   y  e  rr  o 

The  CV  rule  eliminates  the  need  to  include  a  rule 
which  assigns  the  feature  [-consonantal]  to  an  empty 
skeletal  position.  The  syllabification  algorithm  correctly 
predicts  that  the  empty  skeletal  position  in  words  that 
show  the  vowel/glide  alternation  will  appear  in  nuclear 
position.  In  addition,  it  yields  the  right  results  for 
words  like  bohio  and  bahia. 

Thus,  the  potential  problems  raised  by  such  words  are 
eliminated  if  we  posit  an  empty  skeletal  position  before 
the  [+high]  segment  and  allow  the  CV  rule  to  treat  this 
slot  like  any  other  segment  that  is  unspecified  for  the 
feature  syllabic. 

Before  concluding  this  section  and  moving  on  to  the 
issue  of  learnability  let  us  summarize  the  findings  made 
here.  Steriade's  CV  rule  goes  a  long  way  in  accounting  for 
the  syllabic  properties  of  [+high,  -cons]  segments  in 
Spanish.  It  allows  us  to  derive  rising  diphthongs  via  a 
rule  of  contraction.  In  doing  so,  we  eliminate  the  need  to 
include  both  rising  and  falling  diphthongs  in  the 
underlying  inventory  of  syllable  types  in  Spanish.  Falling 
diphthongs,  as  we  have  seen,  cannot  be  derived  via  a 
contraction  rule.  The  fact  that  stress  is  possible  in 
antepenultimate  syllables  with  falling  diphthongs 
(naw.fra.go)  suggests  that  these  do  not  start  out  as 
separate  syllables. 

The  CV  rule  faces  a  potential  problem  from  words  with 
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intervocalic  [+high]  syllable  heads  since  it  predicts  that 
all  such  segments  should  surface  as  syllable  onsets.  The 
problems  posed  by  forms  like  bohio  and  bahia  however, 
appear  to  have  a  number  possible  viable  solutions. 
Finally,  we  have  sketched  how  the  analysis  presented  could 
extend  to  Portuguese  and  Italian. 


III.   THE  ACQUISITION  OF  [+HIGH,  -CONS]  SEGMENTS 

In  this  section  I  will  discuss  why  a  phonological 
system  such  as  that  of  Spanish  poses  a  serious  problem  for 
the  acquisition  of  prevocalic  glides  if  we  assume  that  such 
segments  are  represented  as  in  (3a) .  In  addition,  we  argue 
that  the  correct  representation  of  prevocalic  glides  cannot 
be  arrived  at  by  the  language  learner  unless  we  include  a 
constraint  in  the  grammar  that  rules  out  more  than  one 
diphthong-type  in  a  language  having  the  stress  properties 
of  Spanish. 

Consider  the  evidence  available  to  the  language 
learner  regarding  prevocalic  glides: 

(19) 

a)  Word-final  syllables  with  prevocalic  glides  do  not 
receive  stress  (ie.  far.ma.cja,  pur .ga. to. r jo. . . ) 

b)  Syllables  with  prevocalic  glides  allow  the 
incorporation  of  a  tautosyllabic  postvocalic  segment 
(tjes.to,  kwen.ta...) 

c)  There  are  no  apparent  co-occurrence  restrictions 
between  prevocalic  glides  and  elements  in  onset  position 

(trwe.no,  plje.ge) . 

Of  the  three  pieces  of  evidence  mentioned  above  only 
the  last  one  suggests  that  prevocalic  glides  appear  in 
rhyme  position.  However,  as  we  will  see  in  section  IV, 
there's  reason  to  think  that  some  co-occurrence 
restrictions  hold  between  prevocalic  glides  and  complex 
onsets.  If  this  turns  out  to  be  the  case,  then  none  of  the 
available  evidence  will  actually  point  to  a  representation 
of  prevocalic  glides  as  in  3a)  .  The  first  two  pieces  of 
evidence  (19a)  and  19b))  suggest  that  the  glide  might  not 
be  in  rhyme  position.  If  it  where,  it  would  tend  to 
attract  stress  in  word-final  position  and  it  would  not 
allow  the  incorporation  of  an  additional  segment  in  the 
rhyme.  Thus,  the  available  evidence  is,  at  best, 
inconclusive.  Crucially,  the  most  convincing  piece  of 
evidence  in  favor  of  a  representation  such  as  (3b) ,  namely, 
that  antepenultimate  stress  is  not  possible  when  there  is  a 
prevocalic  glide  in  the  penultimate  syllable  is  not 
available  to  the  learner  since  that  constitutes  negative 
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evidence.  That  is,  while  the  absence  of  forms  like 
ca. ryo.ca  or  a.sya.do  constitutes  an  important  piece  of 
evidence  for  the  linguist,  it  is  not  available  to  the 
learner  as  a  source  of  guidance  in  the  acquisition  process. 

In  contrast  to  the  difficulty  encountered  in 
determining  the  status  of  prevocalic  glides,  the  structure 
of  postvocalic  consonants  is  easily  determined.  The  fact 
that  word-final  falling  diphthongs  always  receive  stress 
suggests  that  they  are  part  of  a  branching  rhyme.  The 
existence  of  forms  like  naw. ti . co,  and  caws . ti . co  confirms 
that  falling  diphthongs  must  not  be  derived. 

If  the  syllabic  properties  of  diphthongs  in  Universal 
Grammar  are  such  that  the  existence  of  one  type  of 
diphthong  rules  out  the  possibility  of  the  other  one 
existing  as  a  primitive,  then  the  acquisition  of  falling 
diphthongs  in  Spanish  should  signal  that  rising  diphthongs 
must  be  derived.  This  suggests  that  the  choice  of  "glide 
type"  available  to  a  language  might  be  expressed  in  terms 
of  parameters.  Setting  the  parameter  one  way  blocks  the 
opposite  value  of  the  parameter.  With  regard  to  Spanish, 
it  seems  reasonable  to  assume  that  the  evidence  available 
for  falling  diphthongs  is  more  easily  analyzed  than  that 
available  for  rising  diphthongs.  Therefore,  setting  the 
parameter  to  the  value  that  indicates  the  existence  of 
falling  diphthongs  should  alert  the  learner  to  the 
existence  of  rules  that  derive  rising  diphthongs  from  a 
sequence  of  a  [+high,  -cons]  segment  followed  by  a  vowel. 

To  repeat,  if  we  accept  the  premise  that  negative 
evidence  is  not  available  to  the  learner,  then  we  must 
conclude  that  a  structure  like  (3a)  poses  problems  for 
acquisition.  As  an  alternative,  we  have  suggested  that  the 
existence  of  one  type  of  diphthong  in  a  language  might  rule 
out  the  possibility  of  the  other  one  existing  as  a 
primitive.  In  a  language  that  appears  to  have  both,  this 
means  that  once  the  existence  of  one  type  has  been 
ascertained  the  structure  of  the  other  must  be  derived  from 
contraction-type  rules. 

There  is  a  third  possibility  regarding  the  status  of 
prevocalic  glides  that  we  have  failed  to  consider.  There 
is  the  possibility  that  both  types  of  diphthongs  are 
possible  in  the  underlying  representation  of  Spanish  but 
that  prevocalic  glides  have  a  structure  that  is  different 
from  (3a)  —  a  structure  that  poses  no  problems  for 
acquisition.  A  priori,  there  is  no  reason  to  reject  an 
analysis  that  claims  that  a  language  can  have  both  types  of 
diphthongs  and  posits  structures  for  each  type  that 
facilitate  the  acquisition  of  each.  Such  an  account  would 
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not  be  inferior  to  one  that  claims  that  only  one  diphthong 
type  is  possible  in  a  language  and  derives  the  other  by 
rules  of  contraction. 

If  no  principle  of  grammar  rules  out  the  possibility 
of  more  than  one  diphthong-type  in  a  language  and  there  is 
a  structure  for  each  type  that  does  not  pose  the  type  of 
problems  shown  to  exist  for  (3a)  ,  then  all  the  language 
learner  has  to  do  upon  hearing  both  diphthongs  in  Spanish 
is  two  conclude  that  both  structures  exist.  If,  on  the 
other  hand,  there  is  a  principle  ruling  out  more  than  one 
diphthong  type,  and  both  types  of  diphthongs  appear  to 
exist  in  the  language,  then  the  language  learner  must 
figure  out  which  one  is  present  in  the  underlying 
representation  and  how  the  other  one  is  derived. 

Thus,  a  representation  of  prevocalic  glides  that  is 
consistent  with  the  facts  available  to  the  learner  could 
facilitate  the  acquisition  process  without  making  it 
necessary  to  posit  a  constraint  that  rules  out  more  than 
out  diphthong-type  in  a  language  that  appears  to  have  both. 

Schane  (1987)  mentions  that  cross-linguistically 
prevocalic  glides  generally  count  as  one  mora,  while 
postvocalic  glides  count  as  two.  In  view  of  this,  he 
proposes  the  following  representations: 

(20)        a)  b) 

V  V  V 

I   I  /\ 

e   i  i  e 

(20a) ,  "with  its  two  V  slots  is  a  proper  representation  for 
a  falling  diphthong  only;  a  rising  diphthong  should  be 
represented  as  in  (20b),  with  a  single  V  slot,  a 
representation  that  has  been  proposed  also  for  short 
diphthongs."  (Schane  (1987)  p.  X). 

Keeping  in  mind  the  representation  in  (20b)  let  us 
reconsider  the  evidence  available  to  the  learner  regarding 
the  structure  of  prevocalic  glides  (see  19)  .  All  of  the 
facts  suggest  that  prevocalic  glides  must  be  structured  as 
in  (20b).  Regarding  (19a),  since  rising  diphthongs  consist 
of  a  single  V  slot  they  will  not  attract  stress  in 
word-final  position.  A  representation  such  as  (20b)  also 
predicts  that  the  incorporation  of  an  additional  segment 
following  the  vowel  will  be  possible  if  CVC  syllables  are 
possible,  since  the  number  of  elements  in  the  skeletal  tier 
will  not  be  affected  by  the  presence  of  a  prevocalic  glide. 
Finally,  the  lack  of  co-occurrence  restrictions  between  the 
onset  and  prevocalic  glides  confirms  the  hypothesis  that 
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the  glides  must  be  in  the  rhyme  of  the  syllable  in  which 
they  appear.  Thus,  all  the  available  evidence  suggests 
that  prevocalic  glides  have  a  structure  like  (20b) . 
However,  the  fact  that  prevocalic  glides  in  the  penultimate 
syllable  of  a  word  block  antepenultimate  stress  argues 
against  the  hypothesis  that  a  structure  like  (20b)  might  be 
part  of  the  underlying  inventory  of  diphthong  types. 
Unfortunately,  this  last  piece  of  evidence  which  is  crucial 
for  arriving  at  the  correct  representation  of  prevocalic 
glides  is  not  available  to  the  learner  since  it  constitutes 
negative  evidence.  In  other  words,  all  the  available 
evidence  regarding  the  structure  of  prevocalic  glides 
should  lead  the  language  learner  to  make  the  incorrect 
hypothesis  that  prevocalic  glides  have  a  structure  like 
(20b) .  One  crucial  piece  of  evidence  suggests  however,  that 
this  is  not  right.  If  prevocalic  glides  had  a  structure 
like  (20b)  then  antepenultimate  stress  would  not  be  blocked 
in  words  having  a  rising  diphthong  in  the  penultimate 
syllable.  Unfortunately,  this  last  piece  of  evidence  is  not 
available  to  the  learner  because  it  relies  on  the  absence 
of  evidence. 

Consequently,  if  we  start  out  with  the  assumption  that 
both  types  of  diphthongs  are  possible  in  Spanish  and  try  to 
arrive  at  a  representation  of  prevocalic  glides,  the  facts 
will  incorrectly  point  in  the  direction  of  a  representation 
like  (20b) .  The  only  way  to  make  sure  that  a  structure  like 
(20b)  will  not  be  hypothesized  as  a  possible  diphthong-type 
of  Spanish  is  to  include  a  stipulation  in  the  grammar  that 
rules  out  the  possibility  of  having  both  rising  and  falling 
diphthongs  in  the  same  language.  This  way,  as  soon  as  the 
existence  of  falling  diphthongs  is  ascertained  rising 
diphthongs  are  necessarily  analyzed  as  derived. 

To  summarize,  we  have  argued  in  this  section  that  a 
theory  of  phonology  which  rules  out  more  than  one 
diphthong-type  in  a  language  is  a  necessesity  in  accounting 
for  acquisition.  A  structure  like  (3a)  poses  serious 
problems  for  acquisition  because  none  of  the  facts 
available  to  the  language  learner  lead  to  the  conclusion 
that  they  should  be  assigned  a  representation  like  (3a)  . 
The  evidence  presented  in  (19)  suggests  that  prevocalic 
glides  have  a  structure  like  (20b) .  However,  such  a 
representation  is  untenable  as  an  underlying  structure  as 
evidenced  by  the  fact  that  prevocalic  glides  in  the 
penultimate  syllable  of  a  word  block  antepenultimate 
stress.  The  only  way  to  make  sure  then,  that  (20b)  will 
not  be  construed  as  an  underlying  representation  of 
prevocalic  glides  is  to  rule  out  the  possibility  of  having 
more  than  one  type  of  diphthong  in  a  language  like  Spanish. 
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Thus,  the  representation  of  diphthongs  defended  in 
section  II  is  not  only  to  be  preferred  over  previous 
analyses  because  it  accounts  for  facts  that  such  analyses 
leave  unexplained,  but  also  because  it  allows  for 
acquisition  to  proceed  correctly. 

One  final  point  regarding  the  representation  in  (20a) 
must  be  made  before  concluding  this  section.  Nothing  in  the 
analysis  presented  in  section  II  rules  out  a  structure  like 
(17a)  from  being  derived  via  a  contraction  rule.  It  is 
possible  that  a  rule  of  contraction  could  derive  such  a 
structure  from  a  sequence  of  a  [+high]  segment  and  a  vowel. 
What  is  crucial  to  our  analysis  is  that  such  a  structure 
cannot  be  part  of  the  underlying  inventory  of  syllable 
types  in  Spanish. 

IV.   SUMMARY  OF  OTHER  ISSUES 

As  we  have  previously  mentioned,  one  important  issue 
that  remains  to  be  investigated  is  the  position  of 
prevocalic  glides  after  Contraction  (9) .  Several 
possibilities  suggest  themselves: 

A)  Contraction  could  place  the  [+high,  -cons]  segment  in 
onset  position  as  in: 

pi. a. no   >  pia.no 

XX    XXX        XXX    XX 

ORROR      0''r0R 

B)  Another  available  option  is  to  syllabify  the  glide 
into  the  following  rhyme  as  in 


p  i.  a.  n  o 

XX   X   XX 

ORROR 


p  i  a  . 


n  o 

X  X 

0  R 


C) 


A  third  possibility  is  to  eliminate  the  skeletal  slot 


associated  with  the  glide  and  to  allow  the  floating 
features  to  link  to  the  skeletal  slot  of  the  following 
vowel : 

pi. a. no  >        pia.no 

X     X         X         X     X  X       ^         X     X 


ORROR 


R      0   R 


The  first  possibility  is  suggested  by  the  existence  of 
syllables  with  complex  onsets  that  fail  to  undergo 
Contraction.  The  contrast  between  the  two-syllable 
sequences  in  21a)  and  the  rising  diphthongs  in  21b) 
suggests  that  the  complex  onset  might  be  blocking  the 
application  of  Contraction  in  the  first  set  of  examples. 
Crucially,  we  could  explain  why  Contraction  fails  to  apply 
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in  the  first  set  of  words  by  invoking  a  principle  that 
rules  out  more  than  two  segments  in  the  onset.  If 
Contraction  incorporates  the  [+high,  -cons]  segment  in 
onset  position,  it  follows  that  the  presence  of  two 
elements  in  the  onset  will  block  the  incorporation  of  an 
additional  one. 

(21)  a.  b. 

cri.d.llo  kyo's.ko 

cli.en.te  ca.lyen.te 

fri.al.dad  fye'l 

cri.an.za  pa.ryen.te 

However,  there  are  a  number  words  in  Spanish  that  have 
syllables  with  complex  onsets  and  rising  diphthongs: 

(22)  plye.ge,  trwe.no,  prye'.to,  frye'.go,  grye''.go, 
trwe .ke 

At  first  glance,  these  items  argue  against  the  claim 
that  certain  restrictions  hold  between  onsets  and 
prevocalic  glides.  Moreover,  the  examples  in  22)  raise  the 
question  of  why  Contraction  fails  to  apply  in  21a)  .  That 
is,  if  there  are  no  constraints  keeping  the  prevocalic 
glides  in  22)  from  being  incorporated  into  onsets  that 
already  have  two  elements,  there  is  no  reason  why 
Contraction  shouldn't  apply  to  the  forms  in  21a) . 

Closer  inspection  reveals  however,  that  the  set  of 
lexical  items  that  have  tautosyllabic  sequences  of  a  rising 
diphthong  and  a  complex  onset  is  largely  composed  of  words 
that  exhibit  the  V/GV  alternation  we  have  discussed  before: 

(23)  ,  ^ 

plye.ge    /  plegar         trwe.no    /   tronar 
"fold"  (N) /  "to  fold"      "thunder" (N) /  "to  thunder" 

prye.to  /apretar  frye'^go   /   frega'r 

"dark"  /  "to  tighten"      "I  scrub"  /  "to  scrub" 

grye'go  /  Gccya  trwe''.  ke    /  troca'r 

"Greek"  /  "Greece"        "change" (N) /"to  change 

While  we  will  not  pursue  here  a  study  of  the 
properties  of  lexical  items  with  complex  onsets  and 
prevocalic  glides,  the  facts  in  22)  suggest  that  diphthongs 
arising  from  Contraction  cannot  be  preceded  by  a  complex 
onset  while  those  that  alternate  are  subject  to  no 
co-occurrence  restrictions  with  the  onset..  ^  .      , 

these  considerations,  let  us  assume  that  contracte^^ForSs 
have  a  structure  such  as  that  proposed  under  (A)  .  With 
this  we  explain  why  Contraction  fails  to  apply  to  the  forms 
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in  21)  which  have  a  complex  onset.  Alternating  forms  have 
the  prevocalic  glide  in  nuclear  position  as  in  (B)  or  (C)  . 
Since  the  [+high,  -cons]  segment  in  the  alternating  forms 
is  not  in  onset  position,  it  follows  that  there  will  be  no 
co-occurrence  restrictions  between  the  onset  and  the  glide. 
The  details  of  the  structure  of  alternating  diphthongs  is 
an  issue  that  warrants  further  investigation. 

Another  important  issue  that  remains  to  be 
investigated  is  the  stress  properties  of  syllable-initial 
glides.  Onset  glides  in  the  final  syllable  of  a  word  always 
block  antepenultimate  stress  (*pa.pa.ya).  In  the 
penultimate  syllable  however,  they  allow  stress  to  appear 
in  the  preceding  syllable  (ie.  contyu.ge,  Sa.ya.go) .  The 
fact  that  antepenultimate  stress  is  possible  in  words  with 
a  glide-initial  syllable  in  penultimate  position  confirms 
that  the  glide  is  not  in  nuclear  position.  If  it  were,  it 
would  cause  the  penultimate  syllable  to  have  a  branching 
rhyme  and  block  stress.  However,  if  we  assume  that  the 
syllable-initial  [y]  is  strictly  in  onset  position,  we  have 
no  way  of  explaining  why  antepenultimate  stress  is 
impossible  for  words  like  papacy  a  since  no  other  onset 
consonant   in  the   same  position  blocks   antepenultimate 


stress  (ie.  nu.me.ro,  sa.ba.na.  Cor .do .ba. . . ) 
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The  answer  to  this  problem  requires  an  indepth  look  at 
certain  aspects  of  Spanish  phonology  that  are  beyond  the 
scope  of  this  paper.  One  possible  solution  that  merits 
further  study  consists  in  allowing  the  syllable-initial 
glide  to  be  an  ambisyllabic  consonant.  This  is  represented 
below: 

(21)  papaya 

X  X  X  X  X  X 

0  R  0  R  0  R 

According  to  this  analysis,  onset  glides  in  the  final 
syllable  of  a  word  block  antepenultimate  stress  because 
they  cause  the  penultimate  syllable  to  have  a  branching 
rhyme.  Onset  glides  in  the  penultimate  syllable  do  not 
block  antepenultimate  stress  because  they  do  not  affect  the 
size  of  the  penultimate  rhyme.  With  this  we  explain  the 
stress  properties  of  onset  glides.  The  consequences  of  this 
solution  remain  a  topic  of  investigation. 

Other  issues  that  merit  attention  include  the 
syllabification  differences  between  [y]  and  [w]  and  the 
representation  of  postvocalic  glides.  Steriade  (1984) 
argues  that  there  are  some  important  differences  in 
Romanian  between  [y]  and  [w] .  In  Spanish  there  also  appear 
to  be  some  differences  in  how  these  consonants  should  be 
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specified  underlyingly .  Finally,  certain  properties  of 
postvocalic  glides  suggest  that  at  least  some  falling 
dipthongs  can  be  derived  via  a  rule  of  contraction.  If 
this  turns  out  to  be  possible  it  will  represent  a  further 
step  in  simplfying  the  inventory  of  syllable-types  in 
Spanish.  This  is  a  desirable  outcome  from  the  point  of 
view  of  learnability . 


CONCLUSION 

We  have  presented  an  analysis  of  the  syllabification 
properties  of  [+high,  -cons]  segments  in  Spanish  that 
derives  all  rising  diphthongs  from  a  sequence  of  an 
unstressed  high  vowel  followed  by  an  onsetless  syllable. 
This  analysis  accounts  for  the  stress  properties  of  rising 
diphthongs  in  the  final  syllable  of  a  word.  In  this,  it 
represents  an  improvement  over  previous  analyses. 

The  solution  presented  here  is  also  desirable  from 
the  point  of  view  of  acquisition.  We  have  shown  that 
previous  representations  of  rising  diphthongs  are 
difficult,  if  not  impossible  to  learn  given  the  data 
available  to  the  language  learner.  By  eliminating  rising 
diphthongs  from  the  inventory  of  syllable-types  in  Spanish 
we  simplify  the  acquisition  process  and  eliminate  the 
problems  of  negative  evidence  in  previous  analyses.  In  all 
these  things  we  have  come  a  step  closer  to  a  theory  of 
phonology  that  solves  the  logical  problem  of  language 
acquisition. 


NOTES 

*I  am  indebted  to  Michael  Kenstowicz,  Dieter  Wanner,  and 
Zoann  Branstine  for  their  helpful  comments  and  discussions. 
All  errors,  of  course,  remain  strictly  my  own.  Este 
trabajo  va  dedicado  a  mi  abuelo. 


I  will  use  orthographic  form  throughout  this  paper, 
except  in  the  following  cases: 

a)  [+high,  -cons]  segments  that  are  unspecified  for  the 
feature  syllabic  or  that  are  assigned  syllabic  status 
in  the  course  of  a  derivation  will  be  represented  as 
[i]/[u].  [+high,  -cons]  segments  that  are  specified 
as  [-syll]  or  that  are  assigned  glide  status  during  the 
course  of  syllabification  will  be  denoted  by  [y]/[w]. 
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b)  Stress  will  be  marked  only  in  the  relevant  cases  and  not 
in  conformance  to  orthographic  conventions. 

c)  The  sound  /h/  (or  /x/  in  Castilin  dialects)  will  be 
represented  by  its  phonetic  symbol  ratheer  than  by  its 
orthographic  form  (-j)  . 

2 

There  are  a  few  lexical  items  in  Spanish  that  contain  a 
postvocalic  glide  followed  by  a  tautosyllabic  consonant 
(ie.  veyn.te  "twenty",  seys  "six",  caws .ti .co  "caustic"). 
However,  following  Harris  (1983),  I  have  decided  to 
consider  these  marginal  examples  that  do  not  represent 
productive  syllable  templates. 

See  Schane  (1987)  on  the  resolution  of  hiatus  in  Spanish. 


This  appears  to  be  the  case  for  the  large  majority  of 
words  that  have  a  complex  onset  followed  by  a  [+high, 
-cons]  segment.  However,  only  an  exhaustive  survey  of  the 
Spanish  lexicon  will  reveal  whether  this  observation  is 
true. 

This  is  not  entirely  correct,  palatal  consonants  and  the 
trilled  [r]  bloc)c  antepenultimate  stress  when  in  the  final 
syllable  of  a  word.  However,  there  is  reason  to  think  that 
these  consonants  are  complex  segments  (see  Carreira 
(1988)) . 
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TONOLOGY  OF  NOUN-MODIFIER  PHRASES  IN  JITA* 
Laura  J.  Downing 


In  Jit a,  some  noun-modifier  phrases  have  one 
more  high  tone  than  occurs  on  the  noun  and 
modifier  in  isolation.  In  this  paper,  I  first 
discuss  the  tone  patterns  of  nouns  and  show  how 
the  additional  high  tone  of  noun-modifier  phrases 
may  be  accounted  for  by  a  rule  inserting  a  high 
tone  on  the  final  syllable  of  a  noun  which 
precedes  a  modifier  phrase.  Secondly,  I  discuss 
problems  in  formulating  the  high  tone  insertion 
rule,  and  conclude  that  it  must  refer  to  "N' ' 
modifier  phrase"  as  the  trigger  for  high  tone 
insertion  to  correctly  account  for  all  the  data. 


0.   INTRODUCTION 

Jita  is  a  Lacustrine  Bantu  language  of  Tanzania 
(Guthrie's  Zone  E  Group  20),  and  my  paper  is  based  on  the 
speech  of  F.  T.  Magayane,  a  native  speaker  of  Jita  from  the 
island  of  Ukerewe.  Jita  has  received  almost  no  treatment, 
either  descriptive  or  analytical,  in  the  literature. 
However,  as  argued  in  Massamba  (1977)  ,  it  is  closely  related 
to  Ruri,  a  language  whose  tonal  phonology  has  been  the 
subject  of  some  study,  notably  Massamba' s  (1982)  Ph.D. 
dissertation  and  a  series  of  articles  by  Goldsmith  (1982, 
1984,  1986),  based  on  his  work  with  Massamba. 

Massamba  (1982)  noted  an  unusual  tone  pattern  in  noun- 
adjective  phrases  in  Ruri,  namely,  the  presence  of  a  high 
tone  on  the  first  syllable  (after  vowel  coalescence)  of  the 
adjective  in  contexts  where  the  source  of  the  high  tone 
could  not  be  the  preceding  noun.  For  example,  the  Ruri 
noun-adjective  phrases  below  (Massamba  (1982),  p  206)  have 
one  more  high  tone  than  is  found  on  the  words  in  isolation, 
so  that  the  tone  pattern  of  the  adjective  following  a  low 
noun  is  identical  to  that  following  a  high  noun: 

(la)  omu  g£si   ^wife'  omu  j6mu   ^good' 

omugSsu:muj6mu  'good  wife' 
(lb)  omu  sa:ni  *  friend'        omu  j6mu   'good' 

omusa:nu:muj6mu  'good  friend' 
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Thus,  an  underlying  high  tone  on  the  preceding  noun 
does  not  seem  to  contribute  the  additional  high  tone  of  the 
adjective.  Instead,  Massamba  accounted  for  it  by  a  rule 
inserting  a  phrasal  accent,  which  was  associated  with  the 
last  syllable  of  a  noun  followed  by  an  adjective. 

In  this  paper  I  will  show  that  in  Jita  noun-modifier 
phrases,  the  first  syllable  of  the  modifier  (after  vowel 
coalescence,  if  applicable)  likewise  has  a  high  tone  if  the 
preceding  noun  is  low-toned  (2a)  .  No  additional  high  tone 
appears,  however,  on  modifiers  following  nouns  with  a  high 
tone  on  the  penult  (penult-high  nouns,  (2b) ) ;  nor  when  the 
noun  has  a  high  tone  on  the  final  syllable  (final-high 
nouns;  (2c)  )  .  In  fact,  low  nouns  and  final-high  nouns 
neutralize  in  noun-modifier  phrases,  as  can  be  seen  by 
comparing  (2a)  with  (2c) : 


(2a)  Low  Noun 

ama  Bui   ^stones' 

amaBui  masito  'heavy  stones' 


ama  sito   'heavy' 


(2b)  Penult-Hiah  Noun 
omu  jigo  'load' 
omujig6  musito   'heavy  load' 


omu  sito   'heavy' 


(2c)  Final-High  Npyin 
li  tl   'log' 
liti  lisito   'heavy  log' 


li  sito 


*heavy' 


(The  falling  contour  which  occurs  in  final-high  nouns 
phrase-finally  is  derived  by  a  rule  which  will  be  discussed 
in  the  next  section.) 

Using  a  non-accentual  approach,  I  analyze  these  tone 
patterns  as  resulting  from  a  rule  of  Noun  Phrase  High 
Insertion,  which  inserts  a  high  tone  on  the  final  syllable 
of  a  noun  when  it  is  followed  by  a  modifier.  The  inserted 
high  tone  associated  with  the  final  syllable  of  the  noun  is 
then  either  deleted  or  it  spreads  rightward  one  syllable  and 
the  left  branch  delinks  by  rules  which  apply  quite  generally 
in  Jita. 


Although  this  analysis  resembles  in  some  respects  that 
proposed  by  Massamba  (1982)  for  the  analogous  facts  in  Ruri, 
I  will  show  that  the  inserted  high  tone  occurs  in  noun- 
modifier  phrases  not  discussed  by  Massamba  or  for  which  he 
gave  a  different  analysis.  Further,  I  will  argue  that  in 
Jita  the  inserted  high  tone  must  associate  with  the  final 
syllable  of  the  noun  and  not  the  first  syllable  of  the 
modifier,  while  Massamba  claims  this  choice  is  arbitrary  for 
Ruri.  Finally,  I  will  discuss  difficulties  encountered  in 
formalizing  the  domain  of  Noun  Phrase  High  Insertion. 
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1.   TONE  PATTERNS  OF  NOUNS  AND  BASIC  TONE  RULES  OF  JITA 

Before  presenting  an  analysis  of  the  tone  patterns  of 
Jita  noun-modifier  phrases,  I  will  briefly  discuss  the 
possible  underlying  tone  patterns  of  nouns  and  the  rules 
which  derive  their  surface  tone  patterns  phrase-finally. 
Like  many  other  Eastern  Bantu  languages  -  such  as  Digo 
(Kisseberth  (1984)),  Ganda  (Stevick  (1969)),  Haya  (Hyman  and 
Byarushengo  (1984))  and  Ruri  (Goldsmith  (1986)),  to  name  a 
few  -  Jita  only  has  high  tones  underlyingly,  with  most 
surface  lows  inserted  by  a  post-lexical  default  rule.  In 
the  lexical  representations  of  nouns,  it  is  necessary  to 
specify  if  there  is  a  high  tone  and  the  syllable  to  which  it 
is  linked.  Thus,  the  three  main  surface  tone  patterns  found 
in  noun  stems  of  two  syllables  may  be  characterized  as 
follows  (nouns  have  the  structure  (pre-pref ix+)  class  prefix 
+  stem) : 

(3a)  JS   tone:  o+mu+sa:ni   ^friend' 

(3b)  initial-  (penult-)  high:  o+mu+g£si  *woman' 

(3c)  final-high:  o+mu+tu:nga  'rich  person' 

Although  I  claim  that  the  noun  in  (3c)  has  a  high  tone 
linked  with  the  final  syllable  underlyingly,  it  surfaces  in 
phrase-final  position  with  a  falling  contour.  In  phrase- 
medial  position,  however,  the  final  syllable  of  a  final-high 
noun  is  low  and  the  next  syllable  rightward  is  high-toned, 
as  shown  by  the  data  below: 

(4a)  omu  tu:nga   'rich  person'     mumuji   'in  town' 
omutu:nga  mumuji   'rich  person  in  the  town' 

(4b)  eci  mina   'scorpion'      kumaBui   'by  the  rocks' 
ecimina  kumaBui   'scorpion  by  the  rocks' 

(4c)  i:ndara   'leopard'       ya:Bilima   'it  ran' 
i:ndara  y^:Bilima   'the  leopard  ran' 

The  high  tone  on  the  final  syllable  of  the  noun  seems 
to  have  shifted  one  syllable  rightward  in  these  phrases.  If 
the  high-low  (falling)  contour  on  the  final  syllable  were 
underlying,  however,  one  would  expect  the  low  tone  to  block 
rightward  shift  of  the  high  tone.  Since  this  does  not  occur 
nor  does  a  low  tone  manifest  itself  in  any  other  way  when 
final-high  words  are  in  phrase-medial  position,  I  conclude 
that  the  falling  contour  tone  is  derived  from  an  underlying 
level  high  tone  on  a  final  syllable.  Following  Kisseberth' s 
(1984)  analysis  of  a  similar  phenomenon  in  Digo,  I  account 
for  this  by  a  rule  which  inserts  a  low  tone  phrase-finally. 
This  low  tone  links  to  the  final  syllable  of  the  phrase, 
deriving  a  falling  contour  when  that  syllable  is  associated 
with  a  high  tone.  The  rules  of  Phrase-Final  Lowering  and 
Contour  Creation  are  given  below: 
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(5)  Phrase-Final   Lowering 

jar  — >   L   /  _]  phrase 

(6)  Contour  Creation 

H   L 
L-'' 
^  ] phrase 

Turning  to  three-syllable  steins,  the  following  surface 
patterns  occur: 

(7a)  0   tone:      o+mu+lamusi  ^ judge' 
(7b)  penult-high:  ji+ngok6la  ^knees' 

li+nanaji  'pineapple' 
(7c)  final-high:   li+darina  'tangerine' 

Theoretically,  three-syllable  noun  stems  would  be  expected 
to  have  at  least  four  tone  patterns  -  fi  tone,  initial-high, 
penult-high  and  final-high  -  but  the  initial-high  pattern 
does  not  surface.  Phrase-medially,  however,  three-syllable 
penult-high  nouns  show  two  different  tone  patterns.  To 
illustrate,  in  phrases  (8a)  and  (8b)  below,  the  penult  high 
tone  has  shifted  to  the  final  syllable,  but  in  phrases  (8c) 
and  (8d)  the  high  tone  remains  on  the  penult: 

(8a)  li  nanaji   'pineapple'    lya:mali:Bwa   'was  eaten' 
linanaji  lya:mali:Bwa  'the  pineapple  was  eaten' 

(8b)  i:  Bustani   'garden'  mumuji   'in  town' 

i:Bustani  mumuji   'the  garden  in  town' 

(8c)  i:  ngokdla   'knee'       ya:mau:ta:ra   'was  hurt' 
i:ngok6la  ya:mau:ta:ra   'the  knee  was  hurt' 

(8d)  eBi  tu:nguru   'onions'    mucikapo   'in  the  basket' 
eBitu:nguru  mucikapo   'the  onions  in  the  basket' 

The  phrases  below  show  that  when  two-syllable  penult- 
high  nouns  appear  in  phrase-medial  position,  their  high 
tones  also  shift  to  the  final  syllable: 

(9a)  omu  ftembe   'mango  tree'       gwa:temwa   'was  cut' 
omuftembe  gwa:temwa   'the  mango  tree  was  cut' 

(9b)  aBa  le:la   'tall  people'  kumulyango   'by  the  door' 
aBale:l^  kumulyango   'tall  people  by  the  door' 

(9c)  i:  n  j6ki   'bee'     kuBwa:syo   'on  the  flowers' 
i:njok£  kuBwa:syo   'the  bee  on  the  flowers' 
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Thus,  three-syllable  penult-high  nouns  like  those  in  (8a) 
and  (8b)  show  the  same  phrase-medial  tonal  behavior  - 
namely,  a  shift  of  their  high  tone  from  the  penult  to  the 
final  syllable  -  as  two-syllable  penult-high  nouns.  What 
remains  to  be  explained  is  why  the  penult  high  tone  of  the 
nouns  in  (8c)  and  (8d)  does  not  shift  in  phrase-medial 
position. 

A  similar  problem  arises  in  the  tone  pattern  of 
infinitives  in  Jita  (oku-  is  the  infinitive  prefix) .  Two- 
syllable  high  verb  stems  always  have  a  high  tone  on  the 
stem-initial  (or  penult)  syllable  in  phrase-final  position. 
This  penult  high  tone  shifts  to  the  final  syllable  when  the 
infinitive  is  in  phrase-medial  position,  just  as  it  does  in 
two-syllable  penult-high  nouns  (compare  the  data  below  with 
that  in  (9)  )  : 

(10a)  oku  i:ga   'to  look  for'  ji:  n  d6Bo   'buckets' 
okui:gl  ji:nd6Bo   'to  look  for  buckets' 

(10b)  oku  B6na   'to  see;  get'       i:  fioni   'bird' 
okuBon^  i:ftoni   'to  see/get  a  bird' 

Three-syllable  high  verb  stems  have  a  high  tone  on  the 
second  stem  syllable  (or  penult)  in  phrase-final  position, 
but  their  penult  high  tone  never  shifts,  just  like  that  of 
the  three-syllable  nouns  in  (8c)  and  (8d) : 

(11a)  oku  kar^rnga   'to  fry'       ji:  n  swi   'fish' 
okukar^rnga  ji:nswl   'to  fry  fish' 

(lib)  oku  Bag^la   'to  weed'        i:  Bustiini   'garden' 
okuBagala  i:Bust£ni   'to  weed  the  garden' 

Tone  shift  also  occurs  in  other  environments  in 
infinitives.  As  noted  above,  two-syllable  high  stems  have  a 
high  tone  on  the  initial,  or  penult,  syllable.  However, 
when  derivational  suffixes  are  added  to  these  stems,  the 
high  tone  shifts  to  the  second  stem  syllable  if  the  derived 
stem  has  three  syllables  or  more,  as  illustrated  below: 

(12a)  oku  B6n-a   'to  get;  see' 

(12b)  oku  Bon-^n-a   'get/see  each  other' 

(12c)  oku  Bon-er-a   'to  get  something  for  someone' 

(12d)  oku  Bon-lr-an-a  'to  get  something  for  each  other' 

(13a)  oku  i:g-a   'to  look  for' 

(13b)  oku  i:g-^n-a   'to  look  for  each  other' 

(13c)  oku  i:g-ir-a   'to  look  for  something  for  someone' 

(13d)  oku  i:g-ir-an-a  'to  look  for  s.t.  for  ea.  other' 

Toneless  stems  remain  toneless  when  the  suffixes  are  added 
to  them,  as  shown  below: 
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(14a)  oku  Bum-a   ^to  hit' 

(14b)  oku  Bum-an-a   ^to  hit  each  other' 

(14c)  oku  Bum-ir-a   'to  hit  for  someone' 

(14d)  oku  Bum-ir-an-a   'to  hit  for  each  other' 

(14e)  oku  gur-a   'to  buy  something' 

(14f)  oku  gur-ir-a   'to  buy  something  for  someone' 

(14g)  oku  gur-ir-an-a   'to  buy  s.t.  for  each  other' 

Thus,  these  suffixes  must  be  toneless  underlyingly,  and  the 
high  tone  on  the  second  syllable  of  the  derived  high  stems 
in  (12)  and  (13)  must  be  derived  by  the  stem  high  tone 
shifting  one  syllable  to  the  right. 

The  derived  three-syllable  verb  stems  in  (12b)  and 
(12c)  and  (13b)  and  (13c)  have  a  high  tone  on  the  penult 
after  tone  shift.  Unlike  the  underlying  penult  high  tone  on 
the  two-syllable  stems,  the  derived  penult  high  tone  will 
not  shift  phrase-medially,  as  shown  below  (compare  these 
examples  with  (10a)  and  (10b) ,  above)  : 

(15a)  oku  Bon-er-a   'to  get  something  for  someone' 
li  ge:mbe   'plow 
okuBonera  lige:mbe   'to  get  a  plow  for  someone' 

(15b)   oku  Bon-an-a   'to  see  each  other' 
kusuile   'at  school' 
okuBonana  kusu:le   'to  see  each  other  at  school' 

(15c)   oku  i:g-ir-a      'to  look  for  s.t.  for  someone' 
i:  n  sikiri   'donkey' 

okuirgira  i:nsikiri   'to  look  for  a  donkey  for 
someone' 

(15d)   oku  i:g-an-a   'to  look  for  each  other' 
kusu:le   'at  school' 

okui:gana  kusu:le   'to  look  for  each  other  at 
school' 

Another  context  in  which  tone  shift  occurs  in 
infinitives  is  in  low  infinitives  with  one  object  marker 
(OM)  .  All  OM' s  are  high-toned  in  Jita  (except  the  first 
person  singular)  and  immediately  precede  the  verb  stem. 
When  they  are  prefixed  to  low  verb  stems,  their  high  tone 
shifts  one  syllable  to  the  right  of  the  OM  (i.e.,  to  the 
first  stem  syllable),  except  when  the  OM  is  the  penult. 
This  is  illustrated  in  the  examples  below  (the  OM' s  are 
underlined) : 

(16a)  oku  sy-a  'to  grind' 

(16b)  oku  ij.-sy-a   'to  grind  it  (Class  9) 


(16c)  Oku  Bum-a   'to  hit' 

(16d)  oku  nm-Bum-a   *to  hit  him' 

(16e)  oku  iQii-Bum-ir-a   *to  hit  for  him' 

(16f)  oku  gur-a   *to  buy' 

(16g)  oku  ci-gur-a   'to  buy  it  (Class  7)' 

(16h)  oku  ci-gur-ir-a   'to  buy  it  (CI.  7)  for  someone' 

The  low  infinitives  in  (16d)  and  (16g)  come  to  have  a  high 
tone  on  the  penult  after  tone  shift.  Unlike  the  underlying 
penult  high  tone  on  the  OM  in  (16b) ,  the  derived  penult  high 
tone  will  not  shift  phrase-medially,  as  shown  below  (compare 
(17a)  with  (17b) ,  (17c) ) : 

(17a)  oku  ir-sy-a   'to  grind  it  (CI.  9)' 
kumugera  'by  the  river' 
okui:sy£  kumugera   'to  grind  it  by  the  river' 

(17b)  oku  mu-Bum-a   'to  hit  him'    muno   'alot' 
okumuBuma  muno   'to  hit  him  alot' 

(17c)  oku  ci-gur-a   'to  buy  it  (CI.  7)   le:16   'today' 
okucigura  le:16   'to  buy  it  today' 

The  data  in  (9),  (10)  and  (17a)  show  that  surface 
penult  high  tones  which  are  underlyingly  associated  with  the 
penult  will  shift  one  syllable  to  the  right  in  phrase-medial 
position.  Surface  penult  high  tones  which  are  underlyingly 
associated  with  the  antepenult,  as  in  (15),  (17b)  and  (17c), 
will  not  shift  in  phrase-medial  position,  however.  Thus,  I 
conclude  that  the  surface  penult  high  tones  of  the  nouns  in 

(8a)  and  (8b)  which  shift  to  the  final  syllable  in  phrase- 
medial  position  are  underlyingly  associated  with  the  penult. 
By  contrast,  the  surface  penult  high  tones  of  the  nouns  in 

(8c)  and  (8d)  and  the  verbs  in  (11)  which  do  not  shift 
rightwards  in  phrase-medial  position  are  underlyingly 
associated  with  the  antepenult,  or  stem-initial  syllable, 
and  have  shifted  rightward  onto  the  penult. 

Underlyingly,  then,  three-syllable  noun  stems  do  have 
four  possible  tone  patterns  -  namely,  all-low,  initial-high, 
penult-high  and  final-high.  The  initial-high  will  always  be 
neutralized  with  the  penult-high  in  phrase-final  position, 
however,  due  to  tone  shift.  Only  in  phrase-medial  position 
will  underlyingly  initial-high  and  penult-high  three- 
syllable  noun  stems  contrast,  since  only  underlying  penult 
high  tones  will  shift  to  the  final  syllable. 

This  process  of  tone  shift  may  be  accounted  for  by 
rules  which  spread  the  high  tone  one  syllable  rightward  and 
then  delink  the  left  branch  of  the  derived  doubly-linked 
high,  as  formulated  below: 
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(18)  Rightward  Spread  (RS) 


(19)  Leftward  Delinking  (Delink) 
H 


RS  and  Delink  are  phrase-level,  or  post-lexical,  rules, 
because  they  apply  not  only  across  both  morpheme  and  word 
boundaries  but  also  within  stems.  This  context-free, 
across-the-board  application  results  in  absolute 
neutralization  of  the  underlying  tone  pattern  in  stems  of 
three  syllables  or  more  with  a  pre-penult  high  tone. 
Further,  RS  and  Delink  must  be  ordered  before  the  post- 
lexical  default  rule  inserting  low  tones,  since  RS  does  not 
create  contour  tones  or  downstep,  nor  do  low  tones  interact 
in  any  other  way  with  RS  which  might  indicate  that  low  tones 
were  specified  when  RS  applies. 

The  derivations  of  (8b)  and  (8d)  below  illustrate  how 
the  correct  phrase-medial  tone  patterns  of  three-syllable 
penult-high  nouns  may  be  predicted,  if  we  postulate  that 
before  RS  and  Delink  apply  the  high  tone  is  underlyingly 
associated  with  the  penult  in  nouns  with  "shifting"  penult 
high  tones  like  i ;Bust^ni  ^garden'  (20) ,  but  with  the 
initial  syllable  (antepenult)  in  nouns  with  "non-shifting" 
penult  high  tones  like  eBiturnquru  ^onions'  (21) : 

(20)   Derivation  of  (8b) 

i;Bustani  mumuii   'garden  in  town' 

H        H 

i  .       I.. 

UR        i:Bustani  mumuji 

garden     in  the  town 


H       H 

RS  &  |\       I  ? 

Delink    i:Bustani  mumuji 

SR        i:Bustani  mumuji 
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(21)   Derivation  of  (8d) 

eBituinauru  mucik^po  ^onions  in  the  basket' 


UR        eBiturnguru  mucikapo 

onions       in  the  basket 

H  H 

RS  &  P^^.^  I  ? 

Delink    eBitu:nguru  mucikapo 

SR        eBiturnguru  mucikcipo 

I  have  not  yet  accounted  for  the  fact  that  a  high  tone 
underlyingly  associated  with  the  penult  does  not  spread  when 
a  word  is  in  phrase-final  position,  as  in  mumu  i  i  and 
mucikapo  in  (20)  and  (21),  respectively.  There  are  two 
possible  ways  of  analyzing  this  condition  on  RS.  The  first 
possibility  is  that  final  syllables  are  extratonal  at  the 
phrase  level,  but  not  extratonal  for  lexical  tonal 
association.  Since  RS  is  a  phrase-level  rule,  there  would 
thus  be  no  available  rightward  syllable  for  the  penult  high 
tone  to  spread  to  when  RS  applies. 

The  second  possibility  is  that  the  high  tone  does 
spread  to  the  final  syllable.  Then,  before  Delink  applies, 
Phrase-Final  Lowering  (5)  and  Contour  Creation  (6)  apply  to 
derive  the  following  configuration: 

(22)  H    L 

^     ^1  phrase 

A  rule  of  contour  simplification  would  then  have  to  apply  to 
delink  the  high  tone  of  a  falling  contour,  when  that  high 
tone  is  multiply-linked: 

(23)  Contour  Simplification 
H   L 

^   ^] phrase 

As  illustrated  in  the  derivations  below,  both  of  these 
solutions  correctly  allow  spread  from  the  penult  to  a  word- 
final  syllable  when  a  word  is  in  phrase-medial  position,  but 
block  or  undo  spread  from  the  penult  to  the  final  syllable 
when  a  word  is  in  phrase-final  position: 
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(24a)  Extratonality  analysis,  derivation  of  (8b) : 
i:Bustani  mumu-ji   'garden  in  town' 


UR 


i:Bustani 
garden 


mumu  J 1 

in  the  town 


RS  & 
Delink 


SR 


H        H 
i:Bustani  mumu(3i)gjj 
i:Bustani  mumuji 


(24b)  Contour  Simplification  analysis,  derivation  of 
(8b) :   i:Bustani  mumuii   'garden  in  town' 


UR 


i:Bustani 
garden 


H 

! 

mumu  j  i 

in  the  town 


RS 


1\ 
i:Bustani 


H 
mumu  j  i 


Final 
Low 


H 

i:Bustani 


H    L 
mumuji] phrase 


Contour 
Creation 


H 

|\ 
i :Bustani 


H   L 
mumuji] phrase 


H  H  L 

Contour         |\  [\| 

Simplif.   i:Bustani  mumuji 

H  H  L 

+\  I  i 

Delink    irBustani  mumuji 

SR        i:Bustani  mumuji 


In  the  Extratonality  analysis  (24a) ,  only  phrase-final 
syllables  are  extratonal.  Thus,  a  high  tone  may  spread  from 
the  penult  to  the  final  syllable  of  i;Bust€ni.  which  is  non- 
final  in  the  phrase,  but  the  high  tone  on  the  penult  of 
mumd  j  i  may  not  spread  since  the  following  syllable  is 
phrase-final  and  thus  extratonal.  In  the  Contour 
Simplification  analysis  (24b) ,  lows  are  only  introduced 
phrase-finally.  Thus,  while  the  high  tones  of  both 
i :Bustlni  and  mumuii  spread.   Final  Lowering  and  Contour 
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Creation  may  apply  only  to  mumuii.  Contour  Simplification 
will  then  apply  to  mumu i i ,  undoing  spread,  while  Delink  will 
apply  to  i;Bustdni,  deriving  tone  shift.  I  have  found  no 
evidence  so  far  in  favor  of  one  of  these  hypotheses  over  the 
other,  so  I  shall  arbitrarily  adopt  the  extratonality 
solution  in  this  paper. 

With  this  background  on  the  underlying  tone  patterns  of 
nouns  and  the  basic  tone  rules  of  Jita  in  mind,  I  will  now 
turn  to  the  problem  of  high  tone  insertion  in  noun-modifier 
phrases.  Before  presenting  my  analysis  of  the  tone  patterns 
in  Jita  noun-modifier  phrases,  however,  I  will  sketch 
Massamba's  (1982)  account  of  analogous  facts  in  Ruri,  a 
language  closely  related  to  Jita. 


2.    TONE  IN  RURI  NOUN-MODIFIER  PHRASES 

David  Massamba's  (1982)  Ph.D.  dissertation  on  Ruri 
tonology  concentrates  on  analyzing  tone  in  isolated  words, 
mainly  verb  forms,  using  an  accentual  approach.  In  his 
analysis,  the  basic  tone  melody  of  Ruri  is  LHL,  and  the 
accent  is  associated  with  the  first  low  tone.  Every  surface 
high  tone  is  the  result  of  an  underlying  accent.  The  LHL 
melody,  like  my  rule  of  RS,  accounts  for  the  fact  that  in 
Ruri,  as  in  Jita,  a  high  tone  usually  surfaces  one  syllable 
to  the  right  of  the  one  which  underlyingly  contributes  the 
high  tone.  Since  there  are  no  rules  of  tone  spread  in  this 
framework,  surface  tone  patterns  are  derived  by  rules 
referring  to  accent  placement. 

Massamba  (1982)  discussed  Ruri  noun-adjective  phrases 
in  some  detail  in  the  final  chapter  of  his  dissertation,  and 
noted  three  different  tonal  alternations.  The  first 
alternation  occurs  when  the  noun  has  a  high  tone  on  a  short 
penult  syllable  which  lengthens  phrase-medially: 

(25)  omw  aana   'child'       omu  j6mu   'good' 
omwa:n6:muj6mu  'good  child' 

The  second  alternation  occurs  when  the  noun  has  a  high  tone 
on  a  short  penult  syllable: 

(26)  omu  g^si   'wife'        omuj6mu   'good' 
omuglsu:muj6mu  'good  wife' 

The  third  alternation  occurs  when  the  noun  is  all  low: 

(27)  omu  sa:ni   'friend'  omu  j6mu   'good' 
omusa:nu:muj6mu  'good  friend' 

(Massamba  gives  no  examples  of  phrases  with  nouns  having  a 
high  tone  -  or  accent,  in  his  analysis  -  on  the  final 
syllable.) 
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In  (26)  and  (27)  above,  the  phrases  have  one  more  high 
tone  than  the  noun  and  adjective  do  in  isolation,  namely, 
the  first  syllable  (after  vowel  coalescence)  of  the 
adjective  has  become  high-toned.  Since  in  Massamba' s 
analysis  each  high  tone  is  the  result  of  an  accent  on  the 
preceding  syllable,  he  accounted  for  this  data  with  a  rule 
of  Noun-Adjective  Juxtaposition  inserting  an  accent  on  the 
final  vowel  of  the  noun.  He  claimed  that  associating  the 
accent  with  the  final  vowel  of  the  noun  rather  than  the 
first  vowel  (pre-prefix)  of  the  adjective  was  an  arbitrary 
decision,  however,  since  the  two  become  tautosyllabic  after 
vowel  coalescence.  To  account  for  the  absence  of  an 
additional  high  tone  in  phrases  like  (25) ,  Massamba  proposed 
a  rule  of  FV  Accent  Back  Hopping  to  fuse  the  accent 
introduced  by  Noun-Adjective  Juxtaposition  with  the  accent 
on  the  noun  stem: 

(28)   FV  Accent  Back  Hopping  (p.  207,  no.  14. iv.) 


V  V  +  V  # 

Massamba  (1982)  also  discussed  two  other  types  of  noun- 
modifier  phrases,  namely,  noun-demonstrative  phrases  and 
noun-possessive  phrases.  Demonstratives  all  have  a  low  tone 
on  their  first  syllable  in  isolation,  but  in  noun- 
demonstrative  phrases  the  first  syllable  of  demonstratives 
is  always  high-toned,  as  illustrated  below  (from  Table  7a, 
p.  219)  : 

(29a)  aBa  ana   '  children'    Banu   ^these' 
aBarna  Banu  *these  children' 

(29b)  omu  rume   *man'  unu   *this' 

omurume  unu   ^this  man' 

(29c)  eci  BuyQ   ^gourd'        cinu   ^this' 
eciBuyu  cinu  'this  gourd' 

(29d)  emi  simu   'arrows'       jinu   'these' 
emisimu  jinu  'these  arrows' 

Massamba  concluded  that  these  phrases  have  an  additional 
high  tone  due  to  the  same  process  involved  in  noun-adjective 
phrases,  that  is,  by  a  rule  which  inserts  an  accent  on  the 
final  vowel  of  the  noun.  Apparently  FV  Accent  Back  Hopping 
does  not  apply  to  noun-demonstrative  phrases,  however,  since 
there  is  an  additional  high  tone  on  the  demonstrative  after 
high  nouns  with  a  penult  vowel  which  lengthens  phrase- 
medially,  like  "aBarna, "  whereas  there  was  no  additional 
high  on  adjectives  following  these  nouns  (see  25) .  Massamba 
(1982)  does  not  comment  on  this,  however,  nor  does  he  give 
derivations. 
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Noun-possessive  phrases  do  not  seem  to  follow  the 
pattern  of  having  an  accent  inserted  on  the  final  vowel  of 
the  noun.  Ruri  possessive  pronouns  have  a  high  tone  on  the 
penult  in  isolation,  which  Massamba  analyzed  as  resulting 
from  an  accent  on  the  preceding  (initial)  syllable.  In 
noun-possessive  phrases,  however,  the  possessive  usually  has 
a  high  tone  on  the  initial  syllable,  as  illustrated  below 
(from  p.  212,  no.  20) : 

(30a)  omu  gasi   ^wife'        wa£ni   'my' 
omugisi  w4ani  'my  wife' 

(30b)  omu  sa:ni   'friend'  wa^ni   'my' 

omusarni  Wciani  'my  friend' 

(30c)  omw  a^na   'child'       wa^ni   'my' 
omwarnd  waani  'my  child' 

(30d)  eci  BuyO   'gourd'       caiini   'my' 
eciBuyu  ca^ni  'my  gourd' 

Massamba  concluded  that  the  accent  on  the  initial  syllable 
of  the  possessive  hops  backwards  one  syllable  to  the  final 
syllable  of  the  noun  in  these  phrases,  unless  the  penult  or 
final  syllable  of  the  noun  is  already  accented. 

Thus,  in  Massamba' s  (1982)  discussion  of  noun-modifier 
phrases  in  Ruri,  he  finds  no  unifying  principle  for 
analyzing  their  tone  patterns.  While  both  noun-adjective 
and  noun-demonstrative  phrases  acquire  an  additional  high 
tone  on  the  modifier  by  a  rule  of  accent  insertion,  only 
noun-adjective  phrases  require  the  rule  of  FV  Back  Hopping. 
Noun-possessive  phrases,  on  the  other  hand,  show  no  evidence 
of  an  additional  high  tone.  Rather,  in  certain  cases  the 
accent  on  the  first  vowel  of  the  possessive  will  hop  back  to 
the  final  vowel  of  the  noun. 

Although  Jita  and  Ruri  are  closely  related  languages, 
and  both  do  have  a  rule  inserting  a  high  tone  in  noun- 
modifier  phrases,  in  the  next  section  I  will  show  that  the 
inserted  high  tone  surfaces  in  different  tonal  contexts  in 
Jita  from  in  Ruri.  Further,  I  will  show  that  the  inserted 
high  tone  occurs  in  a  much  wider  range  of  noun-modifier 
contexts  in  Jita  than  Massamba  presents  for  Ruri.  In  fact, 
in  Jita  a  high  tone  is  inserted  before  any  nominal  modifier 
(the  definition  of  "modifier"  will  be  discussed  in  Section 
F,  below) .  Finally,  I  will  argue  that  in  Jita  the  inserted 
high  tone  necessarily  -  not  arbitrarily,  as  Massamba  claimed 
for  Ruri  -  associates  with  the  final  syllable  of  the  noun, 
not  the  initial  syllable  of  the  modifier. 
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3.    JITA  NOUN-MODIFIER  PHRASES 

3.1   NOUN-ADJECTIVE  PHRASES 

As  we  have  seen  in  the  preceding  section,  in  Ruri  noun- 
adjective  phrases  an  additional  high  tone  appears  after  both 
all-low  nouns  and  high  nouns  with  a  short  (phrase-medially) 
penult  syllable.  In  Jita,  an  additional  high  tone  also 
appears  in  noun-adjective  phrases,  but  in  different  tonal 
contexts  from  in  Ruri.  As  is  illustrated  in  (31),  in  Jita 
the  additional  high  tone  surfaces  only  after  nouns  which  are 
either  all  low  (31a)  or  which  have  a  high  tone  underlyingly 
on  a  pre-penultimate  syllable  ("non-shifting"  or  pre-penult 
high;  (31b) ) .  After  nouns  with  an  high  tone  underlyingly  on 
the  penult  (penult-high;  (31c)  )  or  final  syllable  (final- 
high;  (31d) ) ,  there  is  no  additional  high  tone,  so  that  the 
contrast  between  all-low  nouns  and  final-high  nouns  is  lost 
in  noun-adjective  phrases  (compare  (31a)  and  (31d) ) : 


Low  Nouns 

(31a)   ama  Bui   ^stones'       ama  sito 
amaBui  mcisito  'heavy  stones' 


'heavy' 


(31b)   eBi  Bira   'forests'         eBi  nene   'big' 
eBiBira  Binene   'big  forests' 


Pre-Penult  High  Nouns 

(31c)   eBi  tu:nguru   'onions'       eBi  sito 
eBiturngurw  eBisito   'heavy  onions' 


(31d)   i:  ngok:6la   'knee' 

i:ngok61a  i:nene   'fat  knee' 

Penult-High  Nouns 
(31e)   omu  jigo   'load' 

omujig6  musito   'heavy  load' 


i :  nene 


'heavy' 
fat;  big' 


omu  sito   'heavy' 


(31f)   olu  g6ye   'rope'  olu  le:la   'long' 

olugoye  lule:la   'long  rope' 


Final-High  Nouns 
(31g)   li  tl   'log' 
liti  lisito 


'heavy  log' 


(31h) 


omw  o:ng6   'nose' 
omwo:ngo  munene 


'big  nose' 


li  sito   'heavy' 
omu  nene   'big' 


To  account  for  the  additional  high  tone  in  noun- 
adjective  phrases  like  (31a),  (31b),  (31c)  and  (31d) ,  I 
propose  a  rule  of  Noun-Phrase  High  Insertion,  which  inserts 
a  high  tone  on  the  final  syllable  of  a  noun  when  it  is 
followed  by  a  modifier  within  N' ' ,  as  formulated  below: 
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(32)  Noun-Phrase  High  Insertion,  version  1  (NPHIl) 

H 

6  -->  6  /  [[... ]n  [6...]j,odif]N" 

(This  rule  is  essentially  a  non-accentual  version  of 
Massamba's  (1982)  Noun-Adjective  Juxtaposition  rule.)  Vowel 
Coalescence  may  then  apply  to  delete  the  first  (pre- 
prefixal)  vowel  of  the  adjective.  Finally,  Rightward  Spread 
(RS)  and  Leftward  Delinking  (Delink;  (18)  and  (19),  above) 
apply,  resulting  in  a  high  tone  on  the  prefix  of  the 
adjective,  as  illustrated  in  the  derivation  of  (31a) ,  below: 

(33)  Derivation  of  (31a) :  amaBui  m^sito  *heavy  stones' 


UR 


[ama  Bui]j^ 
stones 


:ama  sitoln^odif 
heavy 


NPHIl 


ama  Bui  ama  sito 


Vowel 
Coal 


ama  Bui  ma  sito 


RS  & 
Delink 


ama  Bui  ma  sito 


SR 


amaBui  mcisito   ^heavy  stones' 


Note  that  RS  and  Delink  are  crucially  ordered  after 
Vowel  Coalescence.  This  is  because  Vowel  Coalescence 
collapses  the  final  vowel  of  the  noun  and  the  initial  short 
vowel  of  the  following  modifier  into  a  single  syllable,  then 
deletes  the  second  vowel  or  glides  the  first  vowel  of  the 
collapsed  syllable.  It  is  from  this  collapsed  syllable  that 
RS  spreads  the  high  tone  one  syllable  rightward,  as  shown  in 
(33),  above.  If  Vowel  Coalescence  applied  after  RS  and 
Delink,  the  high  tone  on  the  final  vowel  of  the  noun  could 
only  spread  to  the  initial  vowel  of  the  modifier.  Then  when 
these  two  vowels  are  collapsed  into  a  single  syllable  by 
Vowel  Coalescence,  the  effect  of  tone  shift  would  be  lost, 
as  illustrated  below: 
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(34)    Derivation   of    (31a)  :    amaBui   meisito    'heavy   stones'  : 


Vowel  Coalescence  ordered  before  Rl 

UR 

[ama  Builjj   [ama  sitolmodif 
stones      heavy 

H 

NPHIl 

ama 

Bui   ama  sito 

RS  & 
Delink 

ama 

H 
Bui   ama  sito 

Vowel 
Coal 

ama 

H 

f 
Bui   ma  sito 

SR 

A  high  tone  is  inserted  in  all  noun-adjective  phrases, 
but  no  additional  high  tone  surfaces  in  final-high  noun 
phrases,  because  the  Twin  Sister  Principle  will  delete  one 
of  the  two  high  tones  associated  with  the  final  syllable  of 
the  noun  after  NPHIl.  This  is  illustrated  in  the  derivation 
of  (31g) ,  below: 

(35)  Derivation  of  (31g) :  liti  lisito   'heavy  log' 


H 
I 

[li  ti]N 
log 


[li  sitoln^odif 
heavy 


NPHIl 


H 

I 

li  ti 


H 


li  sito 


Twin 
Sister 


H   H 
li  ti 


li  sito 


Vowel 
Coal 


N/A 


RS  & 
Delink 


SR 


H 
li  ti    li  sito 
liti  lisito    'heavy  log' 


Thus,  all-low  and  final-high  nouns  are  neutralized  in  noun- 
adjective  phrases  because,  when  RS  applies,  both  have  one 
high  tone  on  their  final  syllable. 


A  different  principle,  however,  is  necessary  to  account 
for  the  absence  of  an  additional  high  tone  in  phrases  with 
penult-high  nouns  like  (31e) ,  above.  The  following 
derivation  of  (31e)  shows  that,  after  NPHIl  applies,  both 
the  penultimate  and  final  syllables  of  the  noun  are 
associated  with  a  high  tone: 

(36)   Derivation  of  (31e) :  omuiia6  musito   ^heavy  load' 


UR        [omu  jigoljj   [omu  sitolj^Q^if 
load         heavy 

H  H 

I  : 

NPHIl     omu  jigo     omu  sito 

High  tones  associated  with  adjacent  syllables  meet  the 
structural  description  of  a  common  rule  of  Bantu  tonology 
(Goldsmith  (1984)  known  as  Meeussen' s  Rule,  which  deletes  a 
high  tone  when  it  is  immediately  preceded  by  another  high 
tone  within  the  same  word: 

(37)  Meeussen' s  Rule  (MR) 

H  H  H 

I   I  I 

[  .  .  .6   6.  .  .]  — >  [  . ..  6   6. . .] 

It  should  be  pointed  out  that  Meeussen' s  Rule  also  applies 
in  other  contexts  in  Jita.  For  example,  when  high-toned 
object  markers  (OM's)  precede  a  high-toned  verb  stem,  the 
stem  high  tone  is  deleted,  neutralizing  the  contrast  between 
high-  and  low-toned  stems,  as  shown  below  (compare  these 
data  with  those  in  16,  above) : 

(38)  High  Infinitives  +  1  OM  (OM  underlined) 

(a)  Oku  ly-a   *to  eat' 

Oku  Bi-ly-a   'to  eat  it  (Class  8) 

(b)  Oku  B6n-a   'to  see' 

oku  mu-B6na   'to  see  him' 

(c)  oku  karci:ng-a   'to  fry' 

oku  ji-kcira:ng-a   'to  fry  them  (Class  10) 

In  noun  phrases,  Meeussen' s  Rule  applies  after  NPHIl  to 
delete  the  inserted  high  tone  associated  with  the  final 
syllable  of  the  noun  when  there  is  also  a  high  tone  on  the 
immediately  preceding  (penultimate)  syllable.  The 
derivation  of  (31e)  continues  as  follows: 
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(39)   Derivation  of  (31e) ,  continued; 


it 

omu 

•  1 

jigo 

omu  sito 

MR 

omu 

? 

jigo 

omu  sito 

Vowel 
Coal 

omu 

H 

1 

jigo 

musito 

RS  & 

Delink 

omu 

H 

jigo 

musito 

SR 


omujig6  musito   ^heavy  load' 


As  predicted  by  my  analysis,  noun-modifier  phrases  with 
three-syllable  penult-high  nouns  (those  with  a  "shifting" 
penult  high  tone)  have  the  same  tone  pattern  as  two-syllable 
penult-high  noun  phrases,  namely  the  inserted  high  tone  does 
not  surface: 

(40a)  li  nan^ji   'pineapple'       li  sito   'heavy' 
linanaji  lisito   'heavy  pineapple' 

(40b)  i:  Bustani   'garden'         i:  nene   'big' 
i:Bustani  i:nene   'big  garden' 

Meeussen's  Rule  applies  to  delete  the  inserted  high  tone  in 
these  phrases,  just  as  it  did  in  the  derivation  of  31c- 
omuiia6  musito  'heavy  load' .  This  is  illustrated  in  the 
derivation  of  (40a) ,  below: 

(41)  Derivation  of  (40a) : 

linanaii  lisito  'heavy  pineapple' 


UR 


[li   nanajilfj      [li    sito]jnodif 
pineapple  heavy 


NPHIl 


MR 


H  H 

I  i 
li  nanaji 

H 

,.      I.. 

li  nanaji 


li  sito 


li  sito 
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Vowel 
Coal 


N/A 


RS  & 
Delink 


SR 


H 

li  nanaji 


li  sito 


linanaji  lisito   ^heavy  pineapple' 


By  contrast,  noun  modifier  phrases  with  three-syllable 
initial-high  (i.e.,  pre-penult  high)  nouns  which  have  "non- 
shifting"  penult  high  tones  retain  the  inserted  high  tone, 
just  like  all-low  nouns  do.  This  is  predicted  in  my 
analysis,  since  I  claim  that  the  high  tone  of  these  nouns  is 
underlyingly  on  the  initial  syllable,  even  though  it  always 
surfaces  on  the  penult.  Thus,  Meeussen' s  Rule  will  not 
apply  to  delete  the  inserted  high  tone  in  pre-penult  high 
noun  phrases,  because  the  underlying  pre-penult  high  tone  is 
not  adjacent  to  the  inserted  high  tone  associated  with  the 
final  syllable  of  the  noun.  This  is  illustrated  by  the 
derivation  of  (31c) ,  below: 

(42)  Derivation  of  (31c) : 

eBitu:naurw  eBisito  ^heavy  onions' 


NPHIl 
MR 


[eBi  tu:nguru]N   [eBi  sito]modif 
onions  heavy 


H     H 
eBi  turnguru 
N/A 


eBi  sito 


H  H 

Vowel  I  I 

Coal.     eBi  turngurw  eBi  sito 

H  H 

RS  &         4^--.  f , 

Delink    eBi  tu:ngurw  eBi  sito 

SR        eBitu:ngurw  eBisito   ^heavy  onions' 


3.2.   OTHER  NOUN-MODIFIER  PHRASES 


So  far  I  have  only  given  examples  of  noun-adjective 
phrases,  but  other  types  of  noun-modifier  phrases  have  the 
same  tone  patterns.  An  additional  high  tone  is  found  on  the 
first  syllable  of  the  modifier  following  low  and  pre-penult 
high  nouns.   Penult-high  and  final-high  noun  phrases  have  no 
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additional  high  tone,  so  that  final-high  and  all-low  noun 
phrases  have  identical  tone  patterns.  This  is  illustrated 
in  the  noun-demonstrative,  noun-possessive,  noun-associative 
phrase  and  noun-relative  clause  phrases  with  each  of  the 
four  noun  types,  below  (the  underlying  forms  are  to  the  left 
of  the  arrow,  the  surface  forms  are  to  the  right  in  these 
and  all  following  data  in  this  paper) : 

(43)  Noun-Demonstrative  Phrases 

(a)  Low  Noun 

[li  siBa]^  [li  li:ya]modif  ~~>  lisiBa  lili:ya 
^that  well' 

(b)  Final-High  Noun 

[ji  n  swijjq  [ji  li:ya]n,odif  — >  jinswi  jilirya 
'those  fish' 

(c)  Penult-Hiah  Noun 

[otu  g6ye]N  [tu  li:ya]modif  "">  otugoye  tulirya 
'that  string' 

(d)  Pre-Penult-Hiah  Noun 

[emi   fi6roro]j^    [ji    li:ya]mQcjif   — >   emiftor6ro 
jilirya    'those  chains' 

(44)  Noun-Possessive  Phrases 

(a)  Low  Noun 

[omu  sa:ni]jj  [w  aolmodif  — >  omusarni  w^o   'your 
friend' 

(b)  Final-High  Noun 

[i  ngS]jj  [y  a6]jj,Q(^if  — >  inga  y^o   'your  cow' 

(c)  Penult-High  Noun 

[olw  imbolN  [Iw  aelijiodif  -">  olwimb6  Iwae   'his 
song' 

(d)  Pre-Penult  High  Noun 

[eci  k^rangiro]^  [ci  aoJn^o^if  — >  ecikar^ngiro  cao 
'your  frying  pan' 

(45)  Noun-Associative  Phrase  Phrases 

(a)  Low  Noun 

[ji   fiofiijj^    [ja     eci   Biral^Q^if   — >    jifiofti 
je   ciBira    'birds   of  the    forest' 

(b)  Final-High  Noun 

[ji  ngaljq  [j^  aBa  lefiln^o^if  ~~>  jinga  ja  Balefi 
'cows  of  the  shepherds' 

(c)  Penult-High  Noun 

[ji  m  begu]j^  [jci  ama  darinajj^Q^^jif  — >  jimbegu 
ja  madarina  'seeds  of  the  tangerines' 

(d)  Pre-Penult  High  Noun 

[omu  gurusi]^  [wa   eci  refu] j^q^^j^^  — >  omugurusi 
wecirefu   'old  man  with  a  beard' 
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(46)  Noun-Relative  Clause  Phrases 

(a)  Low  Noun 

[aBa  limi]iq  [aBo   caaB6na],^Q(jj^f  — >  aBalimy  aB6 
caaB6na   *the  farmers  whom  we  saw' 

(b)  Final-High  Noun 

[omu  Bu]j^  [ogwo  naaB6na]  j^jQ^^jif  — >  omuBw  ogw6 
naaB6na  'the  mosquito  that  I  saw' 

(c)  Penult-High  Noun 

[aka  g6ye]iq  [ako  naaB6na]modif  — >  akagoy  ^ko 
naaB6na   'the  strings  that  I  saw' 

(d)  Pre-Penult  High  Noun 

[emi  fi6roro]N  [ejo  naaB6na] jj,Q^if  — >  emifiordrw 
ej6  naaB6na   'the  chains  which  we  saw' 

The  derivation  of  the  tone  patterns  in  these  phrases 
would  be  analogous  to  that  of  the  noun-adjective  phrases 
discussed  in  the  preceding  section.  A  high  tone  is  inserted 
between  the  noun  and  its  modifier,  and  the  inserted  high 
tone  is  linked  to  the  final  syllable  of  the  noun.  In  all- 
low  and  pre-penult-high  noun  phrases,  the  inserted  high 
remains  and  spreads  one  syllable  rightward  onto  the  first 
syllable  (after  vowel  coalescence)  of  its  modifier.  The 
inserted  high  tone  is  deleted  from  final-high  and  penult- 
high  nouns  by  the  Twin  Sister  Principle  or  Meeussen's  Rule, 
respectively.  The  derivations  of  the  noun-modifier  phrases 
in  (43)  through  (46)  would  thus  proceed  exactly  like  those 
already  given  in  the  preceding  section  (3.1)  for  noun- 
adjective  phrases. 

Taking  a  closer  look  at  the  noun-demonstrative,  noun- 
possessive  and  noun-associative  phrases,  it  may  be  noted 
that  those  modifiers  all  have  consonant-initial  agreement 
prefixes,  unlike  (most)  adjectives  and  the  relative 
pronouns.  Vowel  coalescence  does  not  apply  to  those 
modifiers,  yet  the  inserted  high  tones  surface  one  syllable 
to  the  right  of  the  final  syllable  of  the  noun  just  as  they 
do  in  the  vowel-initial  modifiers.  In  the  next  section  I 
will  argue  that  the  tone  pattern  of  phrases  with  consonant- 
initial  modifiers  provides  strong  evidence  that  the  inserted 
high  tone  must  necessarily  be  associated  with  the  final 
syllable  of  the  noun  in  Jita,  and  not  arbitrarily  so,  as 
Massamba  (1982)  claimed  for  Ruri. 


4.   EVIDENCE  FOR  HIGH  ASSOCIATION  TO  THE  FINAL  SYLLABLE 
OF  THE  NOUN 

In  Massamba' s  (1982)  discussion  of  the  Ruri  noun  phrase 
data,  he  states  that  the  choice  of  whether  the  inserted 
accent  in  noun-modifier  phrases  is  associated  with  the  final 
syllable  of  the  noun  or  the  first  syllable  of  the  modifier 
is  a  purely  arbitrary  one,  since  they  collapse  into  a  single 
syllable  after  vowel  coalescence.  However,  I  will  argue  in 
this  section  that  the  inserted  high  tone  in  Jita  must  be 
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associated  with  the  final  syllable  of  the  noun:  associating 
it  with  the  first  syllable  of  the  modifier  would  predict 
incorrect  results.  In  fact,  noun-demonstrative  phrases  in 
Ruri  provide  evidence  that  high  tone  (or  accent)  association 
to  the  final  syllable  of  the  noun  is  to  be  preferred  in  that 
language  also. 

In  Jita,  an  analysis  associating  the  inserted  high  tone 
with  the  final  syllable  of  the  noun  has  two  main  arguments 
in  its  favor.  Firstly,  it  directly  captures  the 
generalization  that  the  tone  patterns  of  all-low  nouns  and 
final-high  nouns  are  neutralized  in  noun-modifier  phrases. 
If  the  inserted  high  is  associated  with  the  final  syllable 
of  the  noun,  then  the  representation  of  all-low  nouns  has 
become  identical  to  that  of  final-high  nouns  when  RS 
applies. 

Secondly,  not  all  modifiers  in  Ruri  and  Jita  take  an 
agreement  prefix  which  has  a  vowel  pre-prefix  to  coalesce 
with  the  final  vowel  of  the  noun.  Such  modifiers  are 
consonant-initial,  yet  the  inserted  high  tone  surfaces  one 
syllable  rightward  of  the  final  syllable  of  the  noun  just  as 
it  does  in  the  vowel-initial  modifiers.  Examples  of 
consonant-initial  modifiers  from  Ruri  which  trigger  accent 
insertion  are  found  in  noun-demonstrative  phrases  (see  (29) , 
above)  .  In  Jita,  not  only  demonstratives,  but  also  certain 
adjectives  and  quantifiers,  possessives  and  the  associative 
particle  take  consonant-initial  noun  class  agreement 
prefixes  (see  (43),  (44),  (45),  above).  If  the  inserted 
high  tone  were  associated  with  the  first  syllable  of  the 
modifier  in  these  phrases,  the  wrong  tone  patterns  would 
result  in  every  case.  This  is  illustrated  by  comparing  the 
correct  derivation  of  omukeka  gumutuku  'red  mat'  in  (47)  , 
below,  with  the  incorrect  derivation  in  (48)  which  results 
when  the  inserted  high  tone  is  associated  with  the  first 
syllable  of  the  modifier: 

(47)   High  Association  with  the  final  syllable  of  the 

noun: 
omu  keka  qu  mutuku  — >  omukeka  gumutuku   'red  mat' 

UR        [omu  keka  In    [gu  mutuku  l^Q^jif 
mat  red 

H 

NPHIl      omu  keka    gu  mutuku 
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Vowel 
Coal. 


N/A 


RS  & 

Delink 


SR 


H 
omu  keka    gu  mutuku 
omukeka  gumutuku 


(48)   High  Association  with  the  initial  syllable  of  the 

modifier: 
omu  keka  gu  mutuku  — >  omukeka  aumutuku   ^red  mat^ 


UR 


[omu  kekalN   [gu  mutuku] mo^if 
mat         red 


NPHIl 


Vowel 
Coal. 


omu  keka 


N/A 


gu  mutuku 


RS  & 
Delink 


SR 


H 
omu  keka    gu  mutuku 
* omukeka  gumutuku 


It  can  be  seen  in  (48)  that  when  the  inserted  high  tone  is 
linked  with  the  initial  syllable  of  the  modifier,  it 
surfaces  one  syllable  too  far  to  the  right. 

Phrases  of  penult-high  or  final-high  nouns  with 
consonant-initial  modifiers  do  not  have  an  inserted  high 
tone  on  the  surface,  just  like  those  with  vowel-initial 
modifiers  (compare  (43),  (44),  (45)  with  (46)).  Thus, 
Meeussen's  Rule  (MR)  and  the  Twin  Sister  Principle, 
respectively,  must  apply  in  these  phrases  to  delete  the 
inserted  high  tone.  However,  if  the  inserted  high  tone  were 
associated  with  the  first  syllable  of  the  modifier,  neither 
of  these  rules  could  apply  in  phrases  with  consonant-initial 
modifiers,  and  incorrect  derivations  would  result  as 
illustrated  in  (49)  and  (50),  below: 
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(49)   Final-High  Noun  Phrase,  High  Association  with  the 

initial  syllable  of  the  modifier: 
-ii  n  swi   ii  mutuku  — >  iinswi  iimutuku  'red  fish  (p1)  ' 


H 

1 

UR 

[ji  n  swi]N 
fish  (pi) 

[ji  mutuku] modi f 
red 

? 

H 

NPHIl 

ji 

n  swi 

ji  mutuku 

Twin 
Sister 

N/A 

MR 

N/A 

Vowel 
Coal. 

N/A 

RS  & 
Delink 

ji 

H 
n  swi 

H 
ji  mutuku 

*jinswi   jimutuku 


(50)   Penult-High  Noun  Phrase,  High  Association  with 

the  initial  syllable  of  the  modifier: 
Ii  kamba   Ii  mutuku  — >  likamb^  Iimutuku  'red  feather' 


UR 

[Ii  kambalN 
feather 

[Ii  mutuku Injodif 
red 

H 

1 

H 

NPHIl 

Ii 

1 
kamba 

li  mutuku 

Twin 
Sister 

N/A 

MR 

N/A 

Vowel 
Coal. 

N/A 

RS  & 
Delink 

Ii 

H 

kamba 

H 

li  mutuku 

SR 


*likamb^   Iimutuku 
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It  can  be  seen  in  both  (49)  and  (50)  that  linking  the 
inserted  high  tone  with  the  initial  syllable  of  the  modifier 
derives  one  more  high  tone  than  actually  surfaces. 

A  High  Insertion  rule  in  Jita  associating  the  inserted 
high  tone  with  the  first  syllable  of  the  modifier  would  thus 
have  to  apply  only  on  the  condition  that  the  modifier  is 
vowel-initial,  otherwise  association  is  with  the  final 
syllable  of  the  noun.  The  same  holds  true  for  Ruri. 
Demonstratives  are  consonant-initial  in  that  language,  yet 
Massamba  analyzed  noun-demonstrative  phrases  as  triggering 
insertion  of  an  accent  on  the  final  vowel  of  the  noun.  If 
the  accent  were  associated  with  the  first  vowel  of  the 
demonstrative  instead,  the  same  sort  of  incorrect  tone 
patterns  would  be  derived  as  for  Jita.  Since  a  rule  linking 
the  inserted  high  tone  or  accent  with  the  final  syllable  of 
the  noun  makes  correct  predictions  in  all  cases,  it  is 
clearly  to  be  preferred. 


5.   PROBLEMS  IN  DEFINING  THE  DOMAIN  OF  NOUN  PHRASE  HIGH 
INSERTION 

In  this  section  I  will  discuss  the  proper  formulation 
of  the  rule  of  Noun  Phrase  High  Insertion  (NPHIl)  ,  and 
especially  problems  encountered  in  specifying  its  domain. 
First,  I  will  justify  limiting  the  rule  to  within  the  noun 
phrase,  by  showing  that  no  high  tone  is  inserted  in  other 
phrasal  contexts.  Second,  I  will  show  that  "modifier"  must 
be  defined  in  such  a  way  as  to  distinguish  locative-NP 
modifiers,  which  do  not  trigger  NPHIl,  from  the  modifiers 
which  do.  Third,  I  will  show  that  while  a  high  tone  is 
always  inserted  between  a  noun  and  its  adjacent  (non- 
locative)  modifier,  it  may  also  occur  between  modifiers 
following  the  noun.  To  account  for  this,  two  possible 
revised  versions  of  NPHIl  are  proposed:  one  which  inserts  a 
high  tone  before  a  modifier  phrase  in  N' ' ,  and  another  which 
inserts  a  high  tone  between  any  two  daughters  of  N' '  . 
However,  since  only  the  first  of  these  revisions  of  NPHIl 
makes  correct  predictions  in  the  case  of  coordinate  NP's,  I 
conclude  that  it  is  necessary  to  refer  to  the  constituent, 
"N' '  modifier  phrase,"  in  order  to  formulate  the  rule  of 
NPHI  properly.  The  fact  that  noun  class  agreement  occurs  in 
the  same  domain  as  NPHI  supports  the  claim  that  N' '  modifier 
phrases  are  a  constituent  in  Jita. 
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5.1,    THE  NOUN  PHRASE  AS  THE  DOMAIN  FOR  HIGH  INSERTION 

The  rule  of  NPHIl  (32),  cited  again  below  for 
convenience: 

(32)   Noun  Phrase  High  Insertion,  version  1  (NPHIl) 

H 

6  — >  6  /  [[... ]n  [6...]nioclif]N" 

applies  in  a  very  restricted  domain,  namely,  when  a  noun  is 
followed  by  a  modifier  within  a  noun  phrase  (N' ' ) .  In 
section  3,  above,  I  showed  that  NPHIl  does  indeed  apply 
between  a  noun  and  any  modifier.  In  this  section  I  will 
justify  limiting  NPHIl  to  the  noun  phrase  by  showing  that  it 
does  not  occur  after  nouns  which  are  not  followed  by  a 
modifier  but  by  some  other  constituent,  nor  does  it  occur  in 
verb  phrases  between  verbs  and  their  arguments  or  modifiers. 

In  order  to  test  the  validity  of  limiting  NPHIl  to  the 
noun  phrase,  one  could  remove  that  condition  on  the  rule  and 
see  if  additional  high  tones  occur  in  other  possible 
domains.   I  tested  three  possibilities,  namely,  whether: 

(51a)  a  high  tone  is  inserted  after  every  noun,  not 
just  one  followed  by  a  modifier:  ^   — >  H  /  Ij^  

(51b)  a  high  tone  is  inserted  after  every  phrasal  head, 
not  just  nouns  (for  example,  after  verbs) :  0   — >  H  /  ]x  

(51c)  a  high  tone  is  inserted  before  every  modifier 
(for  example,  before  adverbial  or  prepositional  modifiers  of 
V):   ^  ->  H  /  [  Imodif- 

Additional  high  tones  are  not  found  in  any  of  these 
contexts,  however.  There  is  no  high  tone  after  a  nominal 
subject  or  between  the  indirect  object  and  direct  object  of 
a  verb,  as  (51a)  would  predict: 

(52a)  Subject ]n  Verb 

aBa  nu]jj  Baaftwa >  aBanu  Baaftwa  ^people  drank' 

people   drank 

(52b)  Indirect  ObjectJiq  Direct  Object 

cagusisya   omu  limilj^  omu  keka > 

we  sold    farmer     mat 

cagusisy  omulimi  omukeka  ^we  sold  a  farmer  a  mat' 

No  additional  high  tone  appears,   either,   after  a  verb 

followed  by  an  object,  as  (51b)  predicts  (but  (51c)  does 

not) ,  or  after  a  verb  followed  by  a  modifier  (as  both  (51b) 
and  (51c)  predict) : 


(53)  Verblx  Object 

a  lurBireJv  omu  sa:ni  wae  — >  alu:bire  omusa:ni  w^e 
he  followed   friend    his     'he  followed  his  friend' 

(Note  that  NPHIl  does  apply  between  omusa:ni  'friend'  and 
its  modifier,  wae  'his'.) 

(54)  Verb  [Modifier 

(a)  agenda  [Bwanguln^o^if 
he  ran   quickly 

— >  agenda  Bwangu  'he  ran  quickly' 

(b)  Bafula      [^^siJmodif  — ^   Bafula  kisi 
they  washed  well         'they  washed  well' 

(c)  afula     [kumugera]inodif  — -*■  afula  kumugera 

he  washed  at  river      'he  washed  at  the  river' 

(d)  caBcisakila     [muluto:ke][np(^j^f 
we-them  helped  in  banana  field 

— >  caBas^ila  muluto:ke  'we  helped  them  in  the 
banana  field' 

Thus,  limiting  high  insertion  to  within  the  noun  phrase 
is  necessary  to  predict  its  occurrence  correctly.  No  high 
tone  is  inserted  following  nouns  or  phrasal  heads,  or  before 
modifiers  in  other  phrasal  contexts. 


5.2.    THE  PROBLEM  OF  LOCATIVE-NP  MODIFIERS 

Although  I  have  claimed  until  now  that  NPHIl  applies 
between  a  noun  and  any  following  modifier,  there  is  actually 
one  exception:  locative  noun  phrases  which  functionally  are 
nominal  modifiers  do  not  trigger  NPHIl  as  shown  in  the 
examples  below: 

(55a)   [aBa  kokololjj   [kumuliro]  ^qc  ^P 
old  people     by  fire 
— >  aBakokolo  kumuliro    'old  people  by  the  fire' 

(55b)  [ec  e:yo]N  [[i:fiuma]N  [y^  omulyango)n,Qdif Uoc  NP 
broom       behind      of  door 
— >  ece:yo  i:ftuma  y  6mulyango  'broom  behind  door' 

(Note  that  NPHIl  applies  within  the  locative  noun  phrase, 
just  as  it  does  in  other  noun  phrases,  accounting  for  the 
additional  high  tone  in  (55b)  on  the  modifier  phrase 
modifying  i ; fluma  'behind'.) 

In  this  section  I  will  argue  that  the  locative  noun 
phrases  are  not  only  distinguished  from  the  other  modifiers 
by  the  fact  that  they  do  not  trigger  NPHIl.  They  are  also 
morphologically,      functionally,      syntactically     and 
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semantically  distinct  from  the  modifiers  which  trigger 
NPHIl,  justifying  that  they  should  be  structurally  distinct. 

The  first  difference  to  be  noted  between  locative 
modifiers  and  the  other  nominal  modifiers  is  that  locative 
modifiers  do  not  show  noun  class  agreement  with  the  nouns 
they  modify.  To  illustrate,  in  all  the  noun-modifier 
phrases  below  -  and  all  the  noun-modifier  phrases  cited  to 
now  -  which  do  trigger  NPHIl,  the  modifiers  have  class 
prefixes  showing  agreement  with  the  nouns  they  modify  (the 
agreement  prefixes  are  underlined) : 

(56)  Noun  +  non-locative  modifiers,  showing  agreement 
(a.l)  [aBa  kokolo]^  [aBa  jimalmodif 

old  people    good 
— >  aBakokolo  Bajima  'good  old  people' 

(a. 2)  [aBa  kokololjj  [Ba  li:ya]niodif 
old  people    those 
— >  aBakokolo  Bali:ya   'those  old  people' 

(a. 3)  [aBa  kokolo]jj  [Ba  ama  guru  ama  le: laJ^Q^ijj^f 
old  people    of  legs      long 
— >  aBakokolo  B^  maguru  m^lerla 
'old  people  with  long  legs' 

(b.l)  [ec  eeyo]N  [eci  yayalmodif 
broom       new 
— >  eceeyo  ciyciya   'new  broom' 

(b.2)  [ec  eeyolN  [caoJmodif  — >  eceeyo  c^o 

broom       your  'your  broom' 

(b.3)  [ec  eeyolN  [c  £     omugasiJnjodif 
broom       of  woman 
— >  eceeyo  c  6mugasi  'the  broom  of  the  woman' 

If  these  phrases  are  compared  with  those  in  (55) ,  above, 
containing  the  same  nouns,  it  can  be  seen  that  the  locative 
modifiers  do  not  agree  with  the  nouns.  Instead,  locative 
modifiers  are  invariable,  as  shown  below: 

(57)  Noun  +  Locative  NP 

(a)  'the  cloth  on  the  bed'  [inguBo]fj   [kucitaBo]  j^q,;,  jjp 

(b)  'the  book  on  the  bed'   [ecitaBo]j^  [kucitaBo]  j^q,-,  j^p 

(c)  'the  bag  on  the  bed'    fomufukoTt.;  [kucitaBo]  iqc  ^P 

Also  unlike  the  modifiers  which  do  agree  with  nouns, 
locative  noun  phrases  may  modify  verbs  as  well  as  nouns, 
with  no  change  in  form.  Only  the  position  of  the  locative 
will  generally  determine  which  element  it  modifies.  For 
example,  in  the  phrase  below,  the  locative  follows  and  is 
understood  to  modify  the  noun,  so  that  the  interpretation  of 
the  phrase  is,  roughly,  "The  friends  by  the  door  (now  are 
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the  ones)  whom  we  talked  to": 

(58)  aBasaini   kumulyango  aBo   caal6m^  n^B6 
friends  by  the  door  REL  we  tal)ced  to  them 
The  friends  by  the  door  whom  we  talked  to. 

By  contrast,  in  the  following  phrase,  the  locative  follows 
and  is  understood  to  modify  the  verb,  so  that  the 
interpretation  is,  "The  friends  whom  we  talked  to  (when  we 
were  all)  by  the  door": 

(59)  aBasa:ny  aB6  caaldm^  n^B6  kumulyango 
friends  REL  we  talked  to  them  by  the  door 
Friends  whom  we  talked  to  by  the  door. 

In  sum,  locative  modifiers  do  not  show  agreement  with  the 
nouns  they  modify,  and  -  probably  because  they  do  not  -  they 
may  modify  both  nouns  and  verbs.  These  qualities 
distinguish  them  morphologically  and  functionally  from  the 
nominal  modifiers  which  trigger  NPHIl . 

Syntactically,  too,  locative  phrases  may  be 
distinguished  from  the  other  nominal  modifiers.  While  order 
among  the  other  modifiers  is  relatively  free  (except  that 
relative  clauses  always  follow  the  others) ,  locative  noun 
phrases  may  not  be  freely  ordered.  Instead,  they  must 
follow  all  other  modifiers  except  relative  clauses  (as  in 
(58),  above).  The  examples  below  show  that  an  adjective, 
for  example,  may  precede  or  follow  possessives, 
demonstratives  and  associative  phrases  -  either  order  is 
acceptable.  (Note  that  NPHIl  has  applied  in  some  cases  not 
only  between  the  noun  and  its  adjacent  modifier,  but  also 
between  the  two  modifiers.  This  problem  will  be  discussed 
in  the  next  section.): 

(60)  Noun-Adiective-Possessive  (adjective  underlined) 
^your  heavy  frying  pan' 

(a)  eci  k£ra:ngiro  eci  sito  c  ao  > 

frying  pan    heavy     your 
ecikarcingirw  ecisito  c£6 

OR 

(b)  eci  k^ra:ngiro  c  a6  eci  sito  > 

frying  pan    your  heavy 
ecikar^ngiro  c^w  ecisito 
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(61)  Noun-Adiective-Demonstrative  (adj.  underlined) 
^those  fierce  leopards' 

(a)  ji  n  dara   ii  n  duru   ji  li:ya  > 

leopards   fierce     those 
jindara  iinduru  jilirya 

OR 

(b)  ji  n  dara   ji  li:ya   ii  n  duru  > 

leopards   those     fierce 
jindara  jilirya  iinduru 

(62)  Noun-Adiective-Associative  Phr.  (adj.  underlined) 
'the  old  suitcase  of  the  teacher' 

(a)  Ii  sandikwa   Ii  kokolo   ly  ^  omw  alimu  > 

suitcase       old       of   teacher 
lisandikwa  likokolo  ly  6mwJilimCl 

OR 

(b)  Ii  sandikwa   ly  a  omw  alimu   Ii  kokolo  > 

suitcase     of     teacher   old 
lisandikwa  ly  6mwalimu  likokolo 

However,  it  is  not  possible  to  freely  order  an  adjective 
with  respect  to  a  locative  noun  phrase.  The  adjective  must 
precede  the  locative  or  the  phrase  is  unacceptable,  as  shown 
below  (adjective  underlined) : 

(63)  *the  young  people  by  the  door' 

(a)  aBa  nu  aBa  lela  ku  mu  lyango  > 

people   young     by  door 

aBanw  aBalela  kumulyango 
BUT 

(b)  *  aBa  nu  ku  mu  lyango  aBa  lela 

(64)  'the  heavy  stones  by  the  river' 

(a)  ama  Bui   ama  sito  ku  mu  gera  > 

stones   heavy     by  river 
amaBuy  am^sito  kumugera 

BUT 

(b)  *  ama  Bui  ku  mu  gera  ama  sito 

Finally,  locative  modifiers  do  not  have  the  same 
specific,  contrastive  meaning  as  the  other  modifiers.  This 
can  be  shown  by  comparing  the  interpretation  of  a  noun 
phrase  containing  a  locative  modifier  with  that  of  a  near- 
identical  phrase  in  which  the  locative  is  contained  within 
an  associative  modifier  phrase: 

(65)  Noun  +  Locative  NP  Modifier 
'old  people  by  the  fire' 

[aBa  kokolo] j^   [ku  mu  lirol^Q^  jjp 
— >  aBakokolo  kumuliro 
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(66)  Noun  +  Associative  Modifier  Phrase 
*old  people  by  the  fire' 
[aBa  kokoloJM   [B  £     ku  mu  lirolj^odif 


aBakokolo  Bakumuliro 

Both  phrases  may  be  translated  as  ^old  people  by  the  fire,' 
but  the  phrase  with  the  locative  modifier  (65)  has  the  non- 
restrictive  interpretation,  "Old  people  who  happen  to  be  by 
the  fire"  (these  interpretations  are  in  no  way  to  be  taken 
as  postulated  deep  structures)  ,  while  the  phrase  with  the 
associative  modifier  (66)  has  the  restrictive 
interpretation,  "Among  the  old  people  here  and  there,  the 
ones  specifically  by  the  fire  as  opposed  to  somewhere  else." 

Thus,  the  fact  that  the  locative  modifier  does  not 
trigger  NPHIl  is  not  exceptional,  since  the  locative 
modifier  is  also  clearly  distinguishable  morphologically, 
functionally,  syntactically  and  semantically  from  the 
nominal  modifiers  which  do  trigger  NPHIl.  I  suggest,  then, 
that  NPHIl  is  still  correctly  characterized  as  applying 
within  NP  (or  N' ' ) ,  and  that  locative  modifiers  are  outside 
that  domain  -  for  example,  daughters  of  a  higher  structure 
such  as  N' ' ' : 

(67) 


Locative  NP 


Modifier 


5.3.   THE  PROBLEM  OF  HIGH  INSERTION  BETWEEN  MODIFIERS 

The  rule  of  NPHIl  is  formulated  so  as  to  insert  a  high 
tone  only  on  the  final  syllable  of  a  noun  when  it  is 
followed  by  a  modifier  within  N' '  .  As  has  already  been 
noted  (see  (60a),  (61a)  and  (62a),  above),  however,  in  a 
noun  phrase  containing  more  than  one  modifier,  a  high  tone 
may  be  inserted  not  only  between  the  noun  and  adjacent 
modifier,  but  also  between  the  modifiers.  Additional 
examples  of  this  are  given  below: 

(68)  Noun  Phrases  with  more  than  one  modifier: 

(a)  [eBi  taBolN  [eBi  sitol^jodif  t®^^  ^o^°lolmodif  ""> 

books  heavy  old 

eBitaBw  eBisito  Bfkokolo   'old  heavy  books' 

(b)  [oBu  jilfj  [Bu  mutukulinQdif  l^w  a6]jnodif  ~"> 

thread         red  your 

oBuji  Bumutuku  Bw^o       'your  red  thread' 
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(c)  [li  twangirolN  [li  sitolnjodif  Hi  li:ya]modif  ""> 

mortar  heavy ^         that 

litwangiro  lisito  lili:ya   ^that  heavy  mortar' 

(d)  [i  n  sikirilN  [i  ng  kokololmodif  tv^  °"^"  li^^ilmodif 

donkey  old  of   farmer 

— >  insikiri  ingkokolo  y  6mulimi 
'old  donkey  of  the  farmer' 

(e)  [eBi  taBol^  [eBi  sitolmodif  t^By  o  By^:gwa]modif 

books  heavy  REL   CI  8-fell 

— >  eBitaBw  eBisitw  eBy6  Byargwa 
'heavy  books  which  fell' 

It  should  be  mentioned  that,  while  a  high  tone  is  always 
inserted  between  the  noun  and  first  modifier,  it  is  not 
consistently  inserted  between  modifiers.  For  example,  it 
seems  to  be  more  likely  to  occur  in  questions  or  when  the 
second  modifier  is  emphasized  or  considered  more 
contrastive.  Also,  it  is  more  likely  to  be  inserted  between 
an  adjective  and  another  modifier  (possessive,  demonstrative 
or  associative)  when  the  adjective  precedes  the  other 
modifier.  When  the  adjective  follows,  a  high  tone  is  rarely 
inserted  (except  in  questions  or  for  emphasis,  as  already 
mentioned) . 

These  problems  aside,  it  is  clear  that  the  rule  of 
NPHIl,  which  only  inserts  a  high  tone  between  a  noun  and  a 
modifier,  does  not  account  for  those  cases  when  high  tones 
are  also  inserted  between  modifiers.  One  possible  revision 
of  NPHIl  would  be  to  delete  reference  to  the  noun  but  retain 
reference  to  a  modifier  phrase  in  the  structural  description 
of  the  rule.  This  version  of  NPHI  could  be  stated  as 
follows: 

(69)  Noun  Phrase  High  Insertion  (Pre-Modifier) 

Within  N' ' ,  associate  a  high  tone  with  the 
last  syllable  of  any  word  which  immediately 
precedes  a  N' '  modifier  phrase  (MP).  (Obligatory 
if  the  word  preceding  MP  is  a  noun;  optional  if 
the  word  is  a  member  of  another  MP.) 

Alternatively,  since  both  head  nouns  and  all  the  modifiers 
which  trigger  NPHIl  are  daughters  of  N' '  (locative-noun 
phrase  modifiers  are  daughters  of  N' ' ' ) ,  another  version  of 
the  rule  which  would  account  for  high  insertion  in  all  the 
cases  discussed  so  far  could  be  formulated  as  follows: 
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(70)   Noun  Phrase  High  Insertion  (Inter-Daughter) 

Associate  a  high  tone  with  the  final  syllable 
of  a  constituent  X  when  it  is  followed  by  another 
(non-null)  constituent  Y,  if  both  X  and  Y  are 
daughters  of  N' '  .  (Obligatory  if  X  is  a  noun; 
optional  if  X  is  a  modifier  phrase.) 

These  two  rules  do  not  ma)ce  the  same  predictions, 
however.  The  rule  of  Pre-Modifier  (69)  predicts  that  a  high 
tone  is  inserted  only  before  a  modifier  phrase  in  N' '  .  By 
contrast,  the  rule  of  Inter-Daughter  (70)  predicts  that  a 
high  tone  will  be  inserted  between  any  two  daughters  of  N"  , 
whether  or  not  either  is  a  modifier  phrase.  In  the  next 
section  I  will  argue  that  high  tone  insertion  in  coordinate 
noun  phrases  provides  evidence  that  NPHI  must  be  limited  to 
applying  before  a  modifier  phrase,  as  in  (69) .  Inserting  a 
high  tone  between  all  daughters  of  N' ' ,  as  in  (70),  gives 
incorrect  results. 


5.4.  THE  PROBLEM  OF  HIGH  INSERTION  IN  COORDINATE  NOUN 
PHRASES 

Since  coordinate  N' '  phrases  and  the  conjunction  which 
links  them  are  daughters  of  N' '  but  not  modifier  phrases, 
this  type  of  construction  will  allow  us  to  test  which 
version  of  the  rule  of  NPHI,  Pre-Modifier  (69)  or  Inter- 
Daughter  (70),  best  accounts  for  the  Jita  data.  Given  the 
coordinate  noun  phrase  structure  below  (n4  is  the 
conjunction,  *and' ) : 


(71) 


Modifier2 


Pre-Modifier  (69)  predicts  that  only  two  high  tones  will  be 
inserted:  one  before  Modifierj^  and  the  other  before 
Modifier2.  Inter-Daughter  (70),  on  the  other  hand,  predicts 
that  four  high  tones  will  be  inserted:  one  between  each  of 
the  terminal  elements  in  the  above  structure,  since  they  are 
all  daughters  of  N"  . 


The  phrases  below  show  that  only  two  high  tones  are, 
fact,  inserted  -  one  before  each  modifier  phrase  (MP) : 
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(72)  Coordinate  N"  Phrases 
(a) 

[[omukeka  [gumutukuJ^plN"  n^  [eBitaBo  [eBisitoli^lu' '  In" 
mat       red  and  books   heavy 

>  omukeka  gumutuku  n  eBitaBw  eBisito 

'red  mats  and  heavy  books' 

(b) 
[[jinjofu  [jilela]Mp]N"  n4  [jintwiga  [  jilfe:  lal^plu"  ]n" 
elephants  young       and   giraffes  tall 

>  jinjofu  jilela  na  jintwiga  jile:la 

'young  elephants  and  tall  giraffes' 

(c)  ^  ,    . 

[  [amadarisya  [mwendajj^jp]  jj/ /  na  [emilyango  [enajj^jp]  j^/ /  ]  jj/ » 
windows    nine  and  doors     four 

>  amadarisya  mwenda  n  emilyangw  ena 

'nine  windows  and  four  doors' 

(d)  ^ 

[  [amayarage    [gekisilj^] N' '    na    [jinumbu      [  jekisili^jplj^/ /  ]  jj/ / 
beans  tasty  and  sw.   potatoes     tasty 

>  amayarage  gekisi  na  jinumbu  jekisi 

'tasty  beans  and  tasty  sweet  potatoes' 

Thus,  the  prediction  made  by  rule  (69)  that  high  tones 
are  inserted  only  before  the  modifier  phrases  in  coordinate 
N' '  phrases  is  correct.  This  is  illustrated  in  the 
derivation  of  (72a) ,  below: 

(73)  Derivation  of  (72a)  with  Rule  (69),  Pre-Modifier 

H 

UR  [omukeka  [gumutuku] j^^]  jj/ /  na  [eBitaBo  [eBisitolf^JN"  In" 
mats      red  and   books   heavy 

H  H        H 


Rule 
(69) 

omukeka 

H 
1 

gumutuku 

na  eBitaBo 

H        H 
1 

eBisito 

na-HA 

1 
omukeka 

gumutuku 

na  eBitaBo 

eBisito 

Vowel 
Coal. 

H 

omukeka 

gumutuku 

H 

n  eBitaBw 

f 

eBisito 
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H 

H        H 

RS  & 

f^« 

eBitaBw  eBisito 

Delink 

omukeka  gumutuku 

n 

SR        omukeka  gumutuku  n  eBitaBw  eBisito 
^red  mats  and  heavy  books' 

(Note  that  the  conjunction,  n4,  contributes  a  high  tone,  but 
this  high  tone  surfaces  two  syllables  rightward  of  the 
conjunction  if  the  following  word  is  consonant-initial. 
This  is  because  the  high  tone  is  not  linked  to  the 
conjunction  underlyingly .  Rather,  it  links  to  the  next 
syllable  rightward,  before  vowel  coalescence,  by  a  special 
association  rule  "na-HA"  then  undergoes  RS  and  Delink,  as 
illustrated  in  derivation  (73).) 

The  rule  of  NPHI  is  therefore  best  formulated  as  in 
(69),  specifying  that  a  high  tone  is  only  inserted  before  a 
N' '  modifier  phrase  rather  than  between  any  two  daughters  of 
N' '  .  It  should  also  be  noted  that  the  coordinate  N' ' 
phrases  do  not  show  noun  class  agreement  with  each  other  nor 
does  the  conjunction  n4  agree  with  either  noun.  This 
confirms  the  observation  made  in  the  discussion  of  locative- 
noun  phrase  modifiers,  that  where  NPHI  does  not  occur, 
agreement  does  not  occur,  either. 


6.    CONCLUSION 

In  this  paper  I  have  argued  that  the  additional  high 
tones  found  in  Jita  noun-modifier  phrases  may  be  best 
accounted  for  by  a  rule  (NPHI)  which  inserts  a  high  tone  on 
the  final  syllable  of  any  word  preceding  a  modifier  phrase 
within  N' '  .  Alternative  formulations  of  the  rule  which 
expand  its  domain  outside  of  N' '  or  which  do  not  distinguish 
modifier  phrases  from  other  daughters  of  N' '  were  considered 
and  shown  to  predict  high  tones  would  be  inserted  in 
contexts  where  they  do  not,  in  fact,  occur. 

Independent  motivation  for  "N"  modifier  phrase"  as  a 
constituent  in  Jita  comes  from  the  fact  that  N' '  modifier 
phrases  are  also  the  domain  for  noun  class  agreement. 
Nominal  modifiers  outside  of  N' ' ,  such  as  locative-noun 
phrases,  neither  show  agreement  nor  trigger  NPHI.  Daughters 
of  N' '  which  are  not  modifier  phrases,  like  coordinate  N' ' 
phrases  and  the  conjunction  which  links  them,  likewise 
neither  show  agreement  (i.e.,  the  N' '  phrases  do  not  agree 
with  each  other  and  the  conjunction  does  not  agree  with 
either  N"  phrase,  though  there  is  agreement  within  each 
conjoined  N' '  phrase),  nor  do  they  trigger  NPHI.  I  suggest, 
then,  that  the  constituent  "N'  '  modifier  phrase"  should  be 
included  among  the  constituents  to  which  phrasal  rules  may 
be  sensitive,  since  only  a  rule  of  NPHI  which  may  refer  to 
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"N' '  modifier  phrase"  as  the  trigger  for  high  tone  insertion 
correctly  accounts  for  all  the  Jita  data. 
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EXPLORING  THE  DICTIONARY:  ON  TEACHING  FOREIGN  LEARNERS 
OF  ARABIC  TO  USE  THE  ARABIC-ENGLISH  DICTIONARY 

Dr.  Omar  Irshied  &  Dr.  Peter  Whelan 
Mu'tah  University 

This  paper  explores  some  of  the  special  problems  students  of 
the  Arabic  language  face  in  learning  to  use  the  Arabic-English 
dictionary  efficiently, 


Reading  is  an  activity  of  great  complexity,  and  never  more  so  than  for 
the  adult  learner  of  a  foreign  language  who  wishes  to  cope  with  texts  at,  or 
even  somewhat  beyond,  his  or  her  ability  to  understand  the  language.  In  the 
case  of  Arabic,  such  learners  might  be  undergraduates  following  a 
university  course  in  Arabic,  businessmen  concerned  with  commerce  with  the 
Arab  world,  scholars  interested  in  non-linguistic  aspects  of  the  (Middle  East, 
linguists  attracted  by  the  complexities  of  Arabic  morphology,  or  non-Arab 
Muslims  concerned  to  deepen  their  knowledge  of  the  Koran. 

The  primary  aid  for  this  diverse  group  of  learners  is  the  bilingual 
dictionary,  which  enables  the  learner  to  grasp,  more  or  less  approximately, 
the  meanings  of  those  words  not  yet  part  of  the  passive  vocabulary.  In 
dealing  with  a  difficult  text  the  learner  will  often  find  himself  juggling 
with  several  unknown  words  in  the  same  sentence  or  even  the  same  clause, 
searching  for  the  most  appropriate  combination  of  the  several  meanings  of 
each  one.  Dictionary  skills  are  therefore  at  a  premium  in  this  activity. 

Arabic  has  an  Immensely  rich  vocabulary,  and  for  the  foreign  learner 
of  Arabic  the  task  is  further  complicated  by  the  fact  that  the  graphological 
representation  of  the  language  in  a  normal  text  contains  no  Indication  of  the 
vowels,  but  only  of  the  consonants.  Thus  using  a  dictionary  In  Arabic  is  a 
more  demanding  activity  than  in,  say,  English,  where  all  that  is  required  to 
find  any  word  is  a  knowledge  of  the  alphabet. '  In  this  paper  we  Intend  to 
examine  briefly  the  graphological  representation  of  unvowelled  Arabic,  and 
suggest  a  method  by  which  students  can  be  prepared  most  efficiently  to 
read  it.  In  particular,  we  shall  concentrate  on  the  use  of  the  Arabic-English 
dictionary  and  offer  some  exercises  to  practise  the  students  in  the  skills 
required  to  use  it  effectively. 
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We  have  not  been  able  to  discover  much  in  the  way  of  previous  writing 
on  the  subject  of  teaching  the  use  of  the  Arabic  dictionary.  In  Arabic, 
Muhammad  Muhiy  al-Din  Abd  al-Hamld's  Al-Masaallk  al-9arabiuuah  (5th  ed. 
1966)  mentions,  in  its  first  volume,  the  teaching  of  the  alphabet  in  its 
isolated  and  combining  forms,  and  the  necessity  of  finding  words  according 
to  their  roots.  It  also  deals  briefly  with  the  extra  problems  posed  by  weak 
radicals,  but  his  book  is  written  mainly  with  native  speakers  of  Arabic  in 
mind.  His  lack  of  diagrammatic  systematization  and  the  fact  that  non- 
native  speakers  are  given  only  a  brief  mention  detracts  from  the  usefulness 
of  his  work  for  our  practical  purpose.  Ahmad  Abd  al  Ghafur  Attar's 
Muoaddimat  al-sahhah  (3rd  ed,  1984),  is  chiefly  a  history  of  lexicography, 
and  again,  the  problems  of  teaching  the  use  of  Arabic  dictionaries  to  foreign 
learners  of  Arabic  is  merely  glanced  at. 

The  text  currently  most  widely  used  for  teaching  Arabic  to  foreigners 
is  the  two-volume  Elementary  Modern  Standard  Arabic  (rev,  ed.  1983),  by 
Abboud  and  McCarus.  This  devotes  a  few  pages  to  the  use  of  the  dictionary, 
but  though  it  offers  a  very  brief  explanation  of  the  system  of  listing  words 
according  to  the  roots  and  a  small  amount  of  dictionary  practice,  the 
student  is  left  very  much  to  his  or  her  own  devices  in  getting  to  grips  with 
thecomplexities.2 

Rather  more  work  has  been  done  on  the  teaching  of  dictionary  skills 
to  foreign  and  native  learners  of  other  languages  than  Arabic.  Richard  C. 
Yorkey's  Study  Skills  for  Students  of  English  ( 1 982),  for  example,  has  a 
useful  chapter  (Chap.  2)  on  "Using  an  English  Dictionary,"  but  the  particular 
combination  of  Arabic  morphology  and  graphology  renders  the  acquisition  of 
dictionary  skills  in  Arabic  a  unique  task,  and  one  which  deserves  separate 
consideration.  In  this  paper  we  shall  not  be  attempting  to  give 
comprehensive  answers  to  the  problems  involved,  but  to  try  to  define  what 
these  problems  are  and  to  suggest  some  kinds  of  exercise  which  will  help 
the  student. 

A  complete  account  of  Arabic  morphology  Is  clearly  beyond  our  scope 
here,  but  a  brief  introduction  will  be  useful  for  readers  who  are  not 
specialists  in  Arabic.  Arabic  grammarians  have  traditionally  analyzed  the 
verbal  morphology  of  Arabic  in  terms  of  a  triliteral  or,  more  rarely, 
quadriliteral  root  and  its  "derived  forms,"  which  number,  for  modem 
practical  purposes,  twelve.  Each  derived  form  is  Itself  the  source  of  a 
number  of  other  nominal  and  adjectival  forms.  Thus,  the  root  k-t-b  Is,  In  Its 
simplest  lexical  form. 
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kataba  —  he  wrote. 

From  this  are  derived  the  following  words,  among  many  others: 

active  participle:  kaatib  —  a  writer,  writing, 
passive  participle:  maktuub  —  written,  a  letter. 

Note  that  according  to  context  these  may  have  a  verbal  or  nominal  meaning. 
These  three  words  derive  directly  from  the  root  in  its  first  form,  kataba. 
but  a  modem  standard  Arabic  tri literal  verb  root  may  have  up  to  ten  derived 
forms.  Not  all  of  these  exist  for  the  verb  kataba.  but  theoretically  they 
could.  The  following  is  a  list  of  the  ten  forms;  those  forms  which  are  not  in 
fact  used  are  marked  with  an  asterisk. 

I  kataba 

II  kattaba 

III  kaataba 

IV  aktaba 
*^V  takattaba 

VI  takaataba 

VII  inkataba 

VIII  iktataba 
^IX  iktabba 

X  istaktaba 

Each  of  these  forms  has  a  meaning  related  with  a  degree  of 
predictability  to  the  meaning  of  the  root.  For  example,  the  second  form 
generally  has  a  causative  meaning;  thus  kattaba  means  "to  make  someone 
write  something."  The  third  form  often  contains  the  notion  of  sharing;  thus 
kaataba  means  "to  correspond  with."  The  fifth  form  is  takaataba.  which  has 
a  reflexive  meaning  closely  related  to  that  of  the  third  form  —  to 
correspond  (mutually).  Each  form  also  has  its  derivative  active  and  passive 
participles:  for  example  the  active  and  passive  participles  of  the  third  form 
are  mukaatib  and  mukaatab.  respectively. 

Purely  alphabetical  dictionaries  of  Arabic  have  occasionally  been 
advocated— for  example,  Muhammad  al-Bukhari  al-Misri  proposed  such  a 
recension  of  Lisaan  al  Qarab.^  MunirBa^albaki  also  mentions  these  issues  in 
his  introduction  to  Al  Mawrid.^  Nevertheless,  modem  lexicographers  of 
Arabic  have  generally  based  their  dictionaries  on  the  extraordinarily 
systematic  morphological  structure.  Thus,  despite  the  fact  that  they  begin 
with  different  letters  of  the  alphabet,  katab.  takaatab.  and  mutakaatib  and 
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SO  on  are  all  to  be  found  in  the  same  place  In  the  dictionary  under  the 
heading  of  the  root,  k-t-b.  Thus  the  vital  piece  of  knowledge  for  the  would- 
be  user  of  an  Arabic  dictionary  is  the  root  of  the  word  he  or  she  wishes  to 
look  up. 

In  actual  fact,  a  dictionary  ordered  according  to  root-consonants  is 
not  such  an  unnatural  demand  on  the  L2  learner  as  it  is  on  the  native 
speaker,  for,  according  to  the  system  in  most  textbooks,  the  L2  learner 
learns  first  the  root  and  then  the  derived  forms,  whereas  the  native  speaker 
has  learned  the  derived  forms  before  becoming  conscious  of  the  system,  and 
does  not  naturally  associate  them  in  his  or  her  mind,  any  more  than  a  native 
speaker  of  English  is  naturally  aware  of  the  etymological  relationships 
among  the  words  in  daily  use.  Diane  Larsen-Freeman  has  shown  that  it  is 
actually  easier  for  the  second-language  learner  to  build  vocabulary  by 
learning  morphemes  than  by  learning  discrete  lexical  items.5  (Mastering  the 
derived  forms  remains,  however,  a  difficult  but  essential  step  for  any 
earner  of  Arabic  who  wishes  to  use  a  dictionary;  that  is,  any  learner  who 
wishes  to  go  beyond  more  "kitchen-Arabic." 

The  system  will  become  clearer  if  we  examine  an  actual  entry 
(considerably  abridged)  from  the  Wehr-Cowan  Dictionary  of  Modem  Written 
Arabic.  The  root  k-t-b  will  again  be  convenient.  Wehr does  not  make 
explicit  the  relationship  between  the  ten  forms  and  their  inflexional 
derivatives,  so  this  has  been  added  in  brackets  for  forms  other  than  the 
first. 

kataba  to  write  II  to  make  (s.o.)  write  (s.th.)  Ill  to  correspond 
(with  s.o.)  IV  to  dictate  (to  s.o.  s.th.),  to  make  (s.o.)  write  (s.th.) 
VI  to  write  to  each  other,  keep  up  a  correspondence  VII  (s.o.)  to 
write  (s.th.) 

kitabpl.      *-i — ^     kutub    piece  of  writing,  letter,  note,  book 

kutubTpl.  -ya    bookseller 

vL-^^    kuttab  Dl.    V— *=t5    k^tlil^  Koran  school  (lowest  elementary 

school) 


kutauyib  booklet 
'*<*-=^     kitaba  (act  or  practice  of)  writing,  script,  poster 
vr-*^-^    kitabT  written,  literary,  scriptural 
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'*t-^    katiba  pl.        >-s*-=^    katalb  document 

'■'■'■  ^'     maktabpl.    v— ='-^     makatib  office,  desk 

**= — ^    maktaba  dI.  -at       ^^-^   makatib    library,  bookstore,  desk 

-.1 — -S^  miktab   typewriter 

mukataba  (III)  correspondence 

iktitab  (VIII)  registration 

istiktab  (X)  dictation 

^" — ^   katibpl.  -On.        w' ^    kuttab  secretary,  typist,  author 

^J — =^    maktPb  written,  recorded,  fated;  message,  letter 

»^i L  mukatib  (III)  correspondent,  reporter 

^r^ — ^   muktatib(VIII)  subscriber 

We  see  her^  that,  though  k-t-b  has  a  majority  of  the  derived  verb  forms  as 
active  members  of  the  Arabic  lexicon,  not  all  possible  forms  are 
represented.  Other  verbs  exist  in  only  a  single  form.    (For  example,  the  root 
s-k-9  exists  lexically  only  in  the  fifth  derived  form,  tasakka9a— to  giepe 
about.)  Similarly,  the  nominal  and  adjectival  measures  deriving  frem  the 
veit)al  forms  by  no  means  exhaust  the  theoretically  possible  words.  But  we 
note  also  that  the  dictionary  presumes  a  knowledge  of  all  available 
measures,  as  the  forms  of  the  Arabic  verb  are  indicated  only  by  a  number- 
Ill,  VI,  etc.— followed  by  English  equivalents,  and  the  nominal  or  adjectival 
Inflexional  measures  are  only  given  separate  entries  in  the  dictionary  when 
they  have  some  specialized  meaning  other  than  that  Implied  by  Its 
morphological  relationship  with  the  verbal  form.  Istiktaab.  for  example,  the 
verbal  noun  (masdar)  of  the  tenth  form,  Is  the  normal  term  for  "dictation", 
and  thus  merits  an  entry  of  Its  own,  without,  however,  any  explicit 
Indication  that  It  related  to  the  then  derived  form.  Mukattib.  however, 
exists  only  as  a  derivative  of  the  second  form,  kattaba— to  make  someone 
write  something— and  thus  Is  not  represented  separately  in  the  dictionary. 

If  this  were  not  sufficiently  complex,  we  have  now  to  consider  those 
varieties  of  roots  which  differ  from  the  standard  trillteral  root  described 
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above.  When  one  or  more  of  the  root  letters  Is  w,  y,  or  ?  (so-called  weak, 
hollow,  and  hamzated  verbs),  or  when  the  second  and  third  letters  are  the 
same,  further  complications,  morphological  and  orthographical,  ensue.6  For 
example,  the  Imperative  (masculine  singular)  of  kataba  Is  uktub.  but  the 
imperative  of  waaafa  —  to  stop  —  is  gjL  as  the  weak  first  radical  Is 
dropped  In  this  form.  The  first  verfeal  form  of  the  root  k-w-n  is  kaana.  and 
the  imperative  is  kyn;  in  the  perfect  tense  the  weak  radical  causes  the 
lengthening  of  the  first  vowel,  and  in  the  imperative  the  weak  radical  is 
again  dropped.  In  order  to  be  able  to  look  the  word  up  in  a  dictionary, 
therefore,  the  learner  must  already  be  familiar  with  the  morphology  of  that 
kind  of  verb.  Even  then,  a  form  like  Istlfaadah.  while  obviously  a  verbal 
noun  of  the  tenth  form  with  a  weak  middle  radical,  provides  no  clue  as  to 
whether  the  middle  radical  is  w  or  y  (f-w-d  or  f-y-d),  so  the  student  may 
have  to  look  the  word  up  twice.  An  imperative  with  only  two  letters,  say, 
z-r,  might  theoretically  derive  from  the  roots  ?-z-r,  w-z-r,  y-z-r,  z-w-r, 
z-y-r,  z-r-w,  orz-r-y,  so  a  good  deal  of  patience  might  be  necessary  to 
track  it  down. 

How,  then,  do  we  teach  the  beginner,  or  pre-intermedlate  learner  of 
Arabic  to  use  the  dictionary?  Assuming,  in  the  first  place,  that  such  a 
learner  knows  the  alphabet  by  heart,  he  or  she  should  perhaps  begin  In  the 
rather  ad  hoc  fashion  adopted  by  most  textbooks.  But  as  the  vocabulary  Is 
introduced  It  should  be  given  in  lists  ordered  according  to  the  principles  of 
Arabic  lexicography,  rather  than  simply  alphabetically.  Thus  mudarrls 
(teacher)  should  be  placed  immediately  after  darasa  (to  study,  learn),  and 
before  iMll  (blackboard),  even  though,  as  in  the  Roman  alphabet,  Arabic  i 
comes  before  m.  As  weak  and  other  difficult  radicals  are  encountered  the 
student  should  be  taught  the  principles  of  looking  them  up  in  a  dictionary, 
and  frequent  exercises  given  in  sorting  a  jumbled  list  of  words  Into  order 
according  to  the  roots.  Any  ambiguities  (see  z-r  above)  should  be  pointed 
out  at  this  stage,  so  that  the  student  is  not  confused  either  as  to  the 
morphology  of  the  words  involved  or  as  to  his  or  her  own  competence  at 
using  a  dictionary. 

II 

In  the  second  section  of  our  paper  we  shall  suggest  a  number  of 
exercises  in  dictionary  skills  for  beginning  and  Intermediate  students.  But 
before  doing  so  we  must  deal  with  the  question,  why  teach  beginners  to  use 
the  dictionary  at  all?  Why  not  wait  until  the  student  has  a  grasp  of  the 
whole  of  the  morphology  before  requiring  him  or  her  to  do  more  than  learn 
word  lists?  In  fact,  we  believe  In  the  importance  of  dictionary  skills  from 
the  earliest  stages  of  learr^lng  Arabic.  This  must  be  the  case  always,  we 
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feel,  with  adult  learners.  Children  should  certainly  be  taught  a  language  In 
the  communicative  context,  with  the  additional  incentive  of  simplified 
readers,  and  thus  the  formalities  of  grammar  and  the  use  of  dictionaries 
may  perhaps  be  put  off  for  two  or  three  years.  But  adult  learners  can 
neither  afforxj  nor  do  they  wish  to  spend  so  much  time  before  coming  to 
grips  with  genuine  unadulterated  language  such  as  is  found  in  literature, 
newspapers,  or  in  broadcasts.  With  Arabic,  far  more  than  with  a  European 
language,  this  will  necessitate  a  lar^e  amount  of  earnest  work  with 
dictionaries  even  to  get  the  gist  of  what  is  written.  It  Is  well  worth  while, 
therefore,  to  spend  time  preparing  the  student  to  perform  these  frequently 
repeated  operations  as  efficiently  as  possible. 

There  Is  of  course,  a  caveat  to  be  uttered  here.  It  Is  not  desirable 
that  the  student  be  practiced  in  the  use  of  dictionaries  to  the  detriment  of 
other  reading  skills.  The  ability  to  deduce  the  meaning  of  the  word  frx)m  its 
context  is  essential  If  the  student  Is  not  to  become  unhealthily  dependent  on 
the  dictionary,  with  consequent  loss  of  speed  and  Increase  of  boredom  and 
frustration  when  reading.  Deduction  of  meaning  from  context,  speed- 
reading,  and  other  skills  are  vital  to  the  serious  reading  In  any  language,  but 
are  not  within  the  scope  of  this  paper. 

For  the  purposes  of  the  kind  of  exercise  we  shall  be  offering  here, 
however,  we  shall  assume  that  the  learner  has  had  one  semester  of  Arabic. 
He  or  she  will  know  the  alphabet  in  Its  order,  and  have  at  least  some 
familiarity  with  the  common  prefixes  and  suffixes  of  Arabic  morphology. 
The  student  will  therefore  have  little  difficulty  looking  up  easier  words, 
such  asQaaad  wazilr.  tiarfa.  and  can  be  expected  to  know,  at  least  in  theory, 
most  of  the  basic  measures  of  the  Arabic  vert)  and  Its  nominal  and 
adjectival  derivatives.  Thus,  for  the  word  yabhaBu  it  will  not  be  necessary 
to  explain  that  Its  root  is  b-h-e  and  that  the  y  Is  an  Inflexion  of  the 
Imperfect  tense.  The  student  will  also  know,  if  only  by  reference  to  a  sort 
of  laboriously  memorized  chart,  that  the  third  form  of  the  verb  (according 
to  the  system  used  by  Wehr)  is  baatiae  and  that  the  verbal  noun  (masdar)  of 
this  form  is  mubaafiaea. 

There  are  also,  of  course,  phonological  variations  which  produce 
differences  in  meaning  without  any  graphologlcal  distinction.  For  example, 
ktb  may  be  vowel  led  kataba  orkutlba.  according  to  whether  the  verb  Is 
active  or  passive.  The  distinction  between  active  and  passive  might  come 
within  the  scope  of  this  paper,  however,  only  In  the  case  of  verbs  with  weak 
middle  radicals.  The  vert)  lotaada  (VIII  form  of  oaada)  would  become  uatllda 
In  the  passive,  thus  complicating  the  finding  of  the  root.  The  active  and 
passive  participles  of  such  verbs  differ  similarly:  muatlidand  muotaad.  In 
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order  to  find  the  radicals  of  such  forms  a  weak  middle  radical  Is  realized  as 
y,  even  If  it  is  w  in  the  root  form.  Again,  we  emphasize  that  the  student  is 
not  merely  required  to  be  told  a  rule;  as  with  any  skill,  practice  is  required 
before  a  rule  can  be  operated. 

We  are  now  at  the  point  where  the  student  may  be  led  into  the  reading 
of  Arabic  in  its  simpler  written  forms,  such  as  straightforward  newspaper 
reporting,  and  accordingly,  our  first  exercise  consists  of  a  very  short 
newspaper  article,  simplified  only  in  that  guidance  Is  given  in  finding 
difficult  words  in  the  dictionary.^  Potentially  confusing  foreign  words  are 
simply  given  in  their  original  spellings. 


Exercise  I.  In  the  following  exercise  there  Is  a  list  of  words  and  four 
columns  to  be  filled:  a.  the  root  letters;  b.  the  number  of  the  verb  form  and 
the  meaning  of  the  verb;  c.  the  part  of  speech  of  the  word  in  question;  and  d. 
the  English  equivalent  of  the  Arabic  word  in  its  context  In  the  article.  Fill 
In  the  blanks  opposite  each  word.  (Transliterations  would  not,  of  course,  be 
given,  and  Arabic  words  in  the  answers  would  be  written  In  Arabic  script.) 

a.  radicals  b.  verb  form     normal  English  equivalent 

c.  part  of  speech  d.  translation  equivalent 
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Example:  ^" — ' 

a.  ^   c  ' 

b-h-e 
c.  verbal  noun,  pi. 


1.  ^,_^ 

a 

c.  passive  participle 

2.  -.J ^ 

a.  J   ^  ^ 

w-h-d 
c.  active  participle 


3.  •^- 
a 

c 

4.  I iiL 


mubaahaQaat 
b.  Ill  ^^^( to  discuss) 
d.discussions 
mu-  tamar 


b.VIII  -^- 
i?tamara 


muttahidah 
b 

musl<ilah 


JL-i\ 


(to  deliberate) 


d 


b.  IV       JL 


.1    (to  be  difficult) 


askala 

munaagasah 

a b 

c.  verbal  noun 

5.  ^A> — =        taral<l<azat 

a b 

c.  3p  fern  sing  of  perfect  tense  d 


istamarrat 
b 


a.  J  J  f 

m-r-r 
c.  3p  fern  sing  of  perfect  tense  d 


-1—1 


iml<aaniyyah 
b 
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m-k-n 
c.  verbal  noun  d 

8.  ;-.         taquum 

a.  f    3    J         b 

q-w-m 
c d 

Exercise  2.  For  this  exercise  you  are  given  information  about  these 
possibly  tricl<y  words  which  will  enable  you  to  lool<  up  the  meaning  easily. 
Give  the  translation  equivalents. 

1.  quwaat~*^'j — '  q-w-y -- noun     *J — '    quwah  (force) -- 

plural  has  special  meaning  ~ .... 

2.  lilumam  ~    „!)U   ?-m-m  —  noun    i Ti    ummah  —  plural       ^1 

umam  +  li  *  definite  article  ~  .... 

3.*ii;>>— t    tawaari?  —  t-i^?  ~  noun  taari?a  -  plural     a;'>-^ 

tawaar)?  functioning  as  adjective  ~ .... 

Exercise  3.  List  the  words  In  exercises  1  and  2  in  alphabetical  order 
according  to  the  radicals. 

It  is  worth  noting  that  our  hypothetical  student  would  often  have  been 
at  a  complete  loss  if  asl<ed  to  perform  exercise  3  without  help.  As  we  have 
indicated,  many  educated  native  Arabic  speal<ers  would  also  find  this 
difficult.  Exercises  of  the  above  l<ind  will  help  to  confirm  the  student's 
grasp  of  Arabic  morphology  and  at  the  same  time  provides  practice  in  the 
use  of  the  dictionary.  But  as  we  have  noted,  and  as  is  exasperatingly 
apparent  to  every  lower  intermediate  learner  of  Arabic  as  a  foreign 
language,  there  are  many  Arabic  words  which  are  either  morphologically  or 
graphologically  ambiguous.  Sufficient  examples  of  this  can  be  found  in  the 
passage  just  examined.  For  example,  j—= — -'    istamarra  could,  from 
such  a  learner's  point  of  view,  be  the  VIII  form  of  the  root  s-m-i —  to  spend 
an  evening  chatting.  This  is  of  course,  a  graohological  ambiguity.  An  Arabic 
speaker,  reading  the  word  in  context,  could  never  make  this  mistake,  but  for 
a  foreign  learner  the  mistake  is  very  natural. 


Until  the  learner's  vocabulary  has  expanded  there  Is  probably  no  way 
in  which  one  can  prevent  the  waste  of  time  involved  In  such  mistaken 
identification  of  roots,  but  it  would  be  helpful  for  the  learner  to  examine 
types  of  ambiguity  in  order  that  he  will  not  merely  give  up  when  his  first  or 
second  guess  at  the  word's  root  proves  unprofitable  for  making  sense  of  the 
passage.  We  therefore  suggest  the  following  type  of  exercise  to  help  the 
learner  explore  the  possibilities  thoroughly. 

Exercise  4  Note  that  a  word  written    -« — -^-^   (mtfid)  could 
theoretically  be  derived  from  either  of  the  following  roots,  either  of  which 
might  have  this  graphologlcal  exponent  of  one  of  their  variations  in  an 
unvoweled  text. 8 

*  J  ^M,  f  m-t-t»-d:  XI  form  (quadrillteral)  j — *^  matiiad  —  third  person 
masculine  singular 

J   ^3  w-t>-d:  VIII  formj— *^i  ittahada  ~  passive  or  active 

participle        muttatild/muttahad 

In  fact,  only  the  first  and  the  last  of  these  is  an  actual  Arabic  word,  but  in 
some  cases  of  ambiguity  there  may  be  several  real  possibilities.  For  the 
following  words,  write  down,  in  alphabetical  order,  the  possibilities  for 
their  derivation  in  the  form  given  above.  Then  look  in  the  dictionary  and 
check  each  form  that  is  a  real  possibility.  (Asterisks  to  indicate  forms  that 
are  merely  theoretical  would  not  be  given  to  the  students.) 

a.  j-^ -' 

*  s-t-q-n  XI  form  (quadrl literal)  third  person  masculine 
singular,  perfect  tense. 

*  s-q-n  VIII  form  third  person  masculine  singular,  perfect  tense, 
q-r-n  X  form  third  person  masculine  singular,  perfect  tense. 

*  t-r-m:  I  form  third  person  masc.  sing.,  perfect  tense— tarama 

*  r-m-m:  V  form  third  person  masc.  sing.,  perfect  tense— 

tarammama 
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r-m-y:  I  form  second  person  sing.,  masc,  jusslve—tamnl 
*        r-m-w:  I  form  second  person  sing.,  masc,  jussive— turmu 

c.  J •    (imperative)     j — •' 

Checl<  the  roots  which  would  theoretically  yield  this  form.  Note  also  which 
of  them  actually  exist. 

w-m-r— correct 
m-w-r— correct—exists 
y-m-r— correct 
?-m-r— correct— exists 

d.  Give  the  possible  rx)ot  or  fX)ots  of  this  form.  With  the  aid  of  the 
dictionary,  check  which  actually  exist. 


e.  Give  the  possible  roots  of  the  following  verbs,  using  the  dictionary  to 
make  sure  the  r^ots  you  choose  actually  exist  in  the  form  given  below. 

— •J  da9aa,    ^ — »j  r^maa,J — ^kil,   J — -  sal,  ^j — *  yarithu 

d-9-w,  r^m-y,  ?-k-l/k-y-l,    s-?-l/s-y-l,     w-r^e 

Conclusion 

A  single  brief  paper  cannot  do  justice  to  the  complexities  of  the 
Arabic  dictionaiTj,  even  in  its  relatively  simple  bilingual  form.  Arabic 
dictionaries,  as  we  have  noted,  are  structured  on  morphological  as  well  as 
alphabetical  principles,  thus  to  use  them  efficiently  requires  a  knowledge 
of  morTDhology  as  well  as  of  the  alphabet.  Naturally,  there  Is  no  short  cut  to 
Arabic  morphology,  which  is  such  a  lar^e  part  of  Arabic  grammar  as  a 
whole;  even  a  gifted  student  will  take  year^  to  be  fluent  in  its  use. 
Nevertheless,  it  is  the  experience  of  generations  of  learr^er^  that  the 
dictionary  is  a  usable  and  indeed  indispensable  tool  from  early  on  in  the 
process  of  learning  Arabic.  Unfortunately,  such  learr^ers  have  also 
experienced  the  frustration  and  boredom  of  groping  uncertainly  around  the 
dictionary  searching  for  a  root  they  are  unsure  of.  In  this  paper  we  have 
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tried  to  suggest  ways  In  which  their  efforts  can  be  minimized,  and 
efficiency  maximized  at  the  earliest  possible  stage. 

The  principle  behind  the  exercises  we  have  suggested  is  the 
following.  As  in  the  teaching  of  any  closed  system,  it  is  essential  for  the 
learner  to  be  aware  of  the  range  of  possibilities  available  as  the  root  form 
of  the  word  with  which  he  is  faced,  Thus  exercises  in  determining  the 
possible  roots  of  a  given  graphic  form  will  save  the  labor  of  looking  up 
roots  which  could  not  give  rise  to  the  form  in  question  as  well  as  making  1t 
less  likely  that  the  student  will  give  up  in  despair  because  the  correct  root 
has  not  been  considered.  As  most  of  the  problems  are  caused  by  roots  that 
are  doubled,  weak,  hollow,  hamzated,  etc.  the  exercises  seek  to  draw  the 
student's  attention  to  the  point  of  difference  among  them,  and  to  the 
ambiguities  that  inevitably  occur.  Our  exercises  are  only  samples,  but  a 
small  text  book  or  pamphlet  which  would  set  out  a  systematic  series  to 
practice  the  student  in  every  aspect  of  these  complexities  would  be  a  most 
useful  addition  to  the  armory  of  any  teacher  of  Arabic  to  foreigners.  Such  a 
book  would  not  simply  practice  dictionar\|  skills,  but,  as  we  have  Implied 
above,  it  would  be  a  useful  aid  to  memorizing  the  morphology,  one  of  the 
hardest  tasks  confronting  any  foreign  learner  of  Arabic. 

We  must  finally  acknowledge  that  in  the  Arab  World  in  general,  little 
has  been  done  in  elaborating  a  methodology  of  teaching  Arabic  to  non-native 
speakers.  Even  the  bilingual  dictionaries  in  general  use  in  the  Arab  world 
are  largely  the  work  of  Western  Orientalists.    Arabic's  long-overdue 
acceptance  as  one  of  the  official  languages  of  the  United  Nations  makes  it 
more  urgent  than  ever  that  such  a  methodology  be  formulated.  Even  in 
America  and  Britain  energetic  research  in  the  methodology  of  teaching 
English  as  a  foreign  language  has  co-existed  for  decades  with  the  most 
traditional  grammar-translation  methods  in  teaching  Arabic.  The  two  large 
volumes  of  McCams  and  Abboud's  Elementary  Modem  Arabic  only  nod  in  the 
direction  of  the  communicative  approach  with  rather  stilted  dialogues  for 
practice.  The  problems  involved  In  teaching  Arabic  to  foreigners  are  unique; 
not  only  the  grammar  but  the  difference  between  written  Arabic  and  the 
colloquial  dialects  presents  difficulties  not  faced  by  the  learner  of  any 
European  language.  But  surely,  it  is  not  impossible  to  devise  a  teaching 
method  which  is  realistically  directed  towards  the  needs  of  foreign 
learners  to  speak,  understand,  read,  and  write.  We  hope  that  this  paper  may 
contribute  to  the  efficiency  of  teaching  the  reading  of  Arabic. 
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NOTES 


'Even  the  Irregular  past  tenses  such  as  "went"  are  generally  listed 
separately.  Under  this  head  Webster's  has  "past  of  GO." 

2ln  volume  1 ,  explanations  are  given  on  pp.  1 68  229,  and  354,  and 
drills  on  pp.  233,  320,  and  354  Volume  2  contains  no  dictionary  work. 

3aIso  worth  recording  Is  Al-Khalll  Ibn  Ahmad's  suggestion  for  a 
dictionary  ordered  according  to  the  point  of  articulation  of  the  sound, 
beginning  with  the  pharyngeals.  Foreign  learners  may  be  thankful  this 
notion  did  not  find  universal  favor. 

^Al  liawrld.  20th  edn.  (Beirut:  Darel-Ilm  111  Malayen,  1986). 

SDIane  Larsen  Freeman,  "An  Explanation  for  the  Morpheme  Acquisition 
Order  of  Second  Language  Learners,"  Language  Learning  26  ( 1 976):  125-134 

^A  useful  concise  account  of  these  complexities  can  be  found  In  David 
Cowan,  An  Introduction  to  Modem  Literary  Arabic  (Cambridge:  Cambridge 
UP,  1 978),  lessons  15- 1 8. 

"^The  article  is  taken  from  the  Amman  newspaper  Swat  al-sa9b. 
Tuesday  31  March,  1987,  p.  1.  We  gratefully  acknowledge  permission  to 
quote. 

^The  term  "exponent"  is  used  by  J.  C.  Catford  In  A  Linguistic  Theory  of 
Translation  (London:  Oxford  UP,  1965),  p.  7  and  chap.  9). 
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In  this  paper  we  develop  an  analysis  for  the  tonal 
polarity  found  in  two  Gur  languages— Moore  and  Lama. 
We  claim  that  tonal  polarity  is  an  "epiphenomenon"— 
the  superficial  reflex  of  a  rule  that  dissimilates  the 
high  tone  of  an  affix  with  respect  to  the  high  tone  of 
a  root.  Crucial  to  our  analysis  is  the  postulation  of 
an  underlying  three-way  tonal  contrast  of  high,  low, 
and  toneless  stems.  Gur  paper  Is  organized  as 
follows.  In  the  first  section  the  polar  tones  of  Moore 
are  introduced.  We  then  encounter  a  problem  in 
accounting  for  the  tonal  patterns  found  in  the  assoc- 
iative construction.  We  postulate  an  abstract  low 
tone  affix  for  this  construction.  We  then  turn  to 
Lama.  After  the  effect  of  two  superficially  obscur- 
ing tonal  rules  are  removed,  we  see  that  Lama  also 
has  reflexes  of  tonal  polarity  in  several  categories. 
We  then  show  that  the  Lama  complexities  can  be 
turned  to  advantage  in  corroborating  the  postulation 
of  the  low-tone  affix  for  the  associative  construc- 
tion In  Moore. 

1.  Some  tonal  rules  of  Moore 

Moore  is  a  two-tone  language  of  Buridna  Faso.  In  this  paper  we  build  on 
some  of  the  results  in  the  excellent  earlier  tonal  descriptions  of  Peterson 
(1971)  and  Kinda  (1983).  In  our  transcriptions  acute  accent  marks  a  high- 
toned  syllable;  low-toned  syllables  are  unmarked.  When  a  syllable  has  a 
long  vowel,  the  tone  is  transcribed  on  the  first  half  of  the  geminate.  The 
underdot  denotes  I-ATR]  vowels  and  the  undertilde  marks  nasalization. 

Moore  has  an  extensive  system  of  nominal  class  suffixes  marking 
singular  and  plural.  In  (1)  we  cite  examples  from  six  of  the  most  productive 
classes. 
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(1)  sg.  pi. 

kor-g6  kor-d6  'sock' 

roo-go  ro-t6  'house' 

w6b-go  w6b-do  'elephant' 

16r)-go  16n-do  'hole' 

bid-go  bf-t6  'sorrel' 

mi'u-gu  mi'i-du  'red' 

tii-ga  tti-se  'tree' 

kge-gi  kgg-sfe  'green' 

sao-ga  sea-se  broom" 

war)-ga  wann-se  'hollow' 

baa-ga  baa-se  dog' 

nw6b-ga  nob-se  "tree  sp.' 

gob-r6  gwab-6  'left  hand' 

tgb-re  tyb-6  'ear' 

kijg-ri  kug-a  'stone' 

s6m-de  s6m-a  'debt' 

w6m-de  wam-a  'calabash' 

n6b-r6  nwab-6  'fruit  sp.' 

Three  tonal  patterns  are  to  be  observed  in  the  data  of  (1):  low  tone  on  the 
root  and  high  tone  on  the  suffix;  high  on  the  root  and  low  on  the  suffix;  and 
high  tone  on  both  the  root  and  the  suffix.  There  are  no  cases  of  low  tone  on 
both  the  root  and  the  suffix.  This  is  a  glaring  gap  that  any  analysis  should 
explain.  Since  the  description  of  the  high-high  class  is  more  complex,  we 
shall  set  these  nominals  aside  until  section  2.  Given  the  exclusion  of  the 
high-high  class,  two  possible  analyses  suggest  themselves  for  the 
remaining  low-high  and  high-low  patterns.  First,  we  might  propose  that 
stems  such  as  [kor]  'sack'  and  [saa]  'broom'  have  the  underlying 
representations  of  (2a)  with  two  tones. 

(2)  a.  kor  saa 

I  I 

L  H  H  L 

b.  kor-go         saa-ga 

I    /  I   / 

L  H  HL 
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Since  hoore  has  the  Itmttatton  of  one  tone  per  syllable,  the  second  of  the 
two  tones  will  always  be  realized  on  the  number  suffix  (2b).  In  this  way  we 
account  for  the  striking  fact  that  a  given  root  Is  always  associated  with  the 
sanr»e  tonal  pattern  (low-high  or  high-low)  regardless  of  the  particular 
number  suffixes  It  selects.  This  analysis  claims  that  the  suffixes  lack  an 
Inherent  tonal  specification.  They  acquire  one  from  the  second  tone  of  the 
root.  Moreover,  we  could  account  for  the  high-high  pattern  by  saying  that 
the  root  contains  a  single  high  tone;  the  underlylngly  toneless  suffix  would 
then  seek  out  the  tone  of  the  root  via  autosegmental  spreading,  as  in  (3). 

(3) 


baa 
1 

baa-ga 

1 
H 

1  /' 
H 

Toneless  affixes  are  quite  prevalent  and  so  this  analysis  Is  rather 
natural.  There  Is,  however,  one  worrisome  aspect.  It  falls  to  explain  the 
systematic  absence  of  the  low-low  tonal  pattern.   Phonological  Inventories 
are  typically  hierarchical  In  nature.  The  presence  of  more  complex  members 
generally  presupposes  the  existence  of  simpler  elements.  For  example, 
CVCC  syllables  presuppose  CVC  syllables;  mid  vowels  typically  emerge  only 
In  the  presence  of  high  and  low  vowels.  The  analysis  proposed  above  violates 
this  basic  thesis  of  phonological  organization.  It  posits  the  complex  [LH] 
and  [HL]  tonal  melodies  In  the  absence  of  a  basic  (L).  A  phonological  theory 
that  permits  complex  units  such  as  ILH]  and  [HL]  only  In  the  presence  of  the 
more  elementary  ID  and  IHI  would  preclude  such  an  Interpretation  of  the 
Moore  tonal  patterns. 

An  alternative  analysis  explains  the  low-low  gap.  It  claims  that  the 
nominal  roots  are  associated  with  a  single  tone  (low  or  high)  In  the 
underlying  representation  (4o). 

(4) 


kor 
1 

saa 
1 

b.  V  -» !•  hi)  /      V 

1-*  hi) 

L 

H 

It  also  claims  that  Moore  is  subject  to  a  rule  requiring  the  sufflxal  tone  to 
be  the  opposite  of  the  root  tone  (4b).  Any  given  root  will  always  Induce  the 
same  sufflxal  tone  since  the  sufflxal  tone  Is  entirely  predictable  from  the 
root  tone  by  this  tonal  rule.  We  thus  account  for  the  fact  that  each  root  has 
the  same  tonal  pattern  (low-high  or  high-low)  regardless  of  the  number 
suffix  It  selects.  Furthermore,  If  we  assume  that  every  root  has  a  tone  then 
by  the  polarity  principle  there  Is  no  way  In  which  the  low-low  pattern  can 


be  derived  since  the  sufflxal  tone  is  always  the  opposite  of  the  root.  (We 
show  how  to  derive  the  high-high  pattern  under  this  analysis  In  section  2). 

This  alternative  analysis  thus  not  only  accounts  for  the  low-low  gap  but 
also  posits  an  underlying  inventory  of  melodies  that  does  not  violate  the 
hierarchy  thesis.  An  additional  argument  In  its  favor  will  be  presented 
shortly.  But  first  we  must  briefly  mention  a  few  additional  features  of  the 
language  that  play  a  role  In  our  analysis.  First,  like  many  other  African  tonal 
languages,  Moore  has  downdrlft  (Kinda  1983).  In  the  phrase  zb  ]br)qdrb  of 
(5)  the  first  two  syllables  are  realized  on  the  same  high  pitch  while  a  high 
following  a  low  is  phonetically  realized  on  a  perceptibly  lower  pitch. 

(5)  ze  lengere  "bring  a  bowl"        I  "  "  _  -  ] 

HHLH 

Second,  Moore  has  the  ubiquitous  rightward  extension  of  high  tones.  In  Moore 
this  is  a  sandhi  rule  operating  between  words.  Since  Moore  is  subject  to  the 
limitation  of  one  tone  per  syllable,  a  spreading  high  dislodges  a  low  tone  to 
yield  a  downstep  (6d).  We  Illustrate  by  placing  saa-ga  'broom'  and  kor-go 
'sack'  after  the  imperative  verbs  Jm  give'  and  zt  "bring*. 

(6)  a.  kQ  saa-ga  "give  a  broom" 

b.  kg  kor-go  'give  a  sack' 

c.  za  saa-ge  'bring  a  broom' 

d.  za  kor-'go  'bring  a  sack' 

If  floating  low  tones  persist  to  the  phonetic  level  then  essentially  the  same 
procedure  which  produces  the  downdrift  observed  In  (5)  will  generate  the 
downstep  of  (6d). 

(7)  za  kor-go         za  kor-go 

III  1/       I 

H     L      H     •♦      H    L    H 

Another  general  feature  worthy  of  mention  is  the  extensive  vowel 
elision  operative  In  the  language.  The  final  vowel  of  a  suffix  Is 
systematically  deleted  except  before  a  major  syntactic  break  such  as  a 
clause  boundary  or  of  course  pause.  Thus,  while  a  noun  shows  its  final 
vowel  In  the  citation  form,  this  vowel  is  generally  absent  when  the  noun  is 
situated  In  a  sentence.  To  Illustrate,  consider  the  associative  constructions 
In  (8)  built  from  the  elements:  nfed-a  'man',  naa-b6  'chief,  s6a-qa  'broom', 
and  kor-g6  'sack'.  The  downstep  in  (88)  illustrates  the  phenomenon  of  tonal 
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stobtmy.  Even  though  the  final  syllable  of  qM  has  syncopated,  its  low 
tone  persists  to  the  phonetic  surface,  where  it  triggers  a  downstep  of  the 
following  high  tone  (8e). 

(6)      a.  n6d '  s6aga  'nrtan's  broom' 

b.  ned  korg6  'man's  sack* 

c.  naab  saaga  'chief's  broom' 

d.  naab  k6r'g6  'chief's  sack' 

e.  n^da  saaga  n^d  saaga 

I  I    I    I  III 

H  L  H    L      -»         H  L  H    L 


f.   1.  n^dakorgo  ii.   nedkorgo 

I   I    I    I  III 

H  L    L    H  HLL    H 

The  form  n6d  korg6  of  (8b)  raises  some  Important  questions  concerning 
the  notion  of  adjacency  in  the  formulation  of  phonological  rules  defined  on 
autosegmental  representations.  The  question  is  why  the  high  tone  of  the 
first  syllable  does  not  spread  to  the  initial  syllable  of  the  following  noun 
korgo  to  produce  the  ungrammatical  *ned  k6rlg6.  There  are  a  number  of 
possible  answers.  The  least  interesting  simply  stipulates  that  high-tone 
extension  applies  before  elision.  Given  this  ordering,  the  rule  is  defined  on 
the  representation  in  (8f  1);  spreading  of  the  high  Is  blocked  by  the 
autosegmental  ban  on  crossing  association  lines.  However,  simply  to 
stipulate  such  an  ordering  claims  that  we  have  no  more  reason  to  expect  the 
the  grammatical  ned  korq6  than  the  ungrammatical  *ned  k6rlQ6.  Yet  there  is 
good  reason  to  believe  that,  in  a  configuration  such  as  that  of  (9a)  below,  a 
rule  spreading  a  high  tone  to  the  right  will  be  blocked  universally  by  the 
intervening  low  tone,  despite  the  lack  of  an  association  line  to  Invoke  the 
line-crossing  prohibition. 

(9)      a.       V  V 

I  I 

H     L     L 


We  shall  present  an  example  later  from  Lama.  Additional  cases  of  this  form 
are  reported  by  Stewart  (1983)  for  Adioukrou  and  by  Armstrong  ( 1 968)  for 
Vala  Ikom.  In  all  these  languages  floating  low  tones  are  postulated  in  order 
to  trigger  a  downstep.  These  languages  also  have  rules  extending  a  high 
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tone  to  the  right.  But  tn  a  configuration  such  as  (9o)  the  Intervening 
floating  low  regularly  blocks  the  rightward  spread  of  the  high  tone. 

We  nr^ust  thus  seek  an  explanation  for  why  an  intervening  floating  tone 
characteristically  blocks  tonal  extension  across  its  path.  We  may 
understand  this  by  extending  the  notion  of  "adjacency"  in  autosegmental 
representations  developed  by  Myers  (1988:5);  "Two  elennents  are  adjacent 
iff  no  element  intervenes  on  the  same  tier  between  them  or  the  elements 
they  are  associated  with".  If  the  Moore  tone  sandhi  rule  is  understood  as 
one  which  establishes  a  link  between  the  first  syllable  of  a  word  and  a  left- 
adjacent  high  tone,  then  the  rule  will  fail  to  apply  in  representation  (8e  11) 
since  the  high  tone  will  not  be  adjacent  in  the  relevant  sense;  rather  the 
intervening  floating  low  tone  is  the  one  that  is  "adjacent"  to  the  second 
vowel.  With  this  understanding  of  adjacency  we  thus  need  not  stipulate  an 
ordering  between  the  high  tone  extension  rule  and  elision.  The  former  will 
be  blocked  in  the  derivation  of  ned  korgo  even  if  elision  applies  first. 

One  more  preliminary  feature  of  Moore  worthy  of  mention  here  Is  the 
noun+adjective  construction.  In  this  collocation  the  suffix  marking  number 
for  the  whole  NP  is  realized  on  the  adjective  and  not  on  the  head  noun. 
Moreover,  the  lexical  realization  of  the  number  element  is  determined  by  the 
adjective  and  not  the  noun.  To  illustrate  these  points,  consider  the  grammar 
of  phrases  composed  of  the  noun  kor-go  sg.,  kor-do  pi.  "sack"  and  the 
adjective  k^.^^ga  sg.,  k^^-se  pi.  'green'.  "A  green  sack'  is  expressed  as  kor 
tgizgi  In  Moore.  The  plural  'green  sacks'  iskork^^-sfe.  There  is  no 
agreement  between  the  noun  and  the  adjective  (*kor-g6  k^g-gq)  and  the 
number  element  is  realized  on  the  adjective,  not  the  noun  (*kor-g6  Kgg, 
*kor-g6  K^g-gg)   Finally,  If  two  adjectives  modify  the  head  noun,  then  the 
number  marking  appears  on  the  final  adjective:  kgr  kig^  bed-a  'big  green 
sacks'. 

The  noun+adjective  construction  Is  significant  because  the  head  noun 
lacks  the  number  suffix.  We  now  have  the  ingredients  for  an  Interesting 
test  of  the  two  alternative  tonal  analyses  with  which  we  began  the 
discussion.  The  first  analysis  claims  that  the  low-high  tones  of  kor-g6  and 
the  high-low  tones  of  86a-ga  are  properties  of  the  respective  roots.  The 
second  analysis  claims  that  the  roots  have  simple  low  and  high  tones, 
respectively;  the  second  tone  Is  a  lexical  feature  of  the  suffix  that  Is 
subject  to  the  polarity  principle.  When  these  nouns  are  followed  by  an 
adjective  the  two  analyses  make  different  predictions  about  the  number  of 
tones  that  should  be  found  in  the  representations.   The  paradigm  In  (10) 
follows  straightforwardly  from  the  second  analysis.  The  relevant 
underlying  representations  are  given  in  (11). 
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(10)  a.  korb^da  'big  sacks' 

b.  korkgggi  'green  sack' 

c.  sa  b^da  'big  brooms' 

d.  s6  k^g'g6  'green  broonfi' 

(11)  a.  korbed-a    b.  kork§g-ga     c.  sab^d-a       d.  sa  kgg-ga 

III  III  III  III 

LHL  LLH  HHL  HLH 

The  high  tone  extension  rule  will  spread  the  high  tone  of  [sh]  In  (1  Id)  to  the 
following  syllable,  creating  the  downstep  found  in  (lOd).  In  the  first 
analysis,  on  the  other  hand,  the  relevant  underlying  forms  will  be  those  of 
(12). 

(12)  a.  korb9d-a     b.  kork^^-ga     c.  sa  b^d-a        d.  sa  k$^-ga 

III  ill  III  III 

LH    H    L  LH  L     H  HLH  L  HLL      H 

Here  the  unassoclated  tones  must  be  prevented  from  having  any  material 
effect.  In  particular,  we  do  not  went  the  floating  low  In  (12c)  to  trigger  a 
downstep.  Furthermore,  we  cannot  assume  that  the  high  of  sfl  has  spread  to 
the  first  syllable  of  beda  and  In  the  process  'gobbled  up'  the  floating  L  (as 
In  (13)).  Application  of  the  rule  would  be  blocked  In  this  case,  given  the 
sense  of  adjacency  invoked  earlier. 

(13) 


a.  sa  b9d-a 

sa  bed-a 

1      1    1 

\y\    1 

HL  H    L     -» 

HL  H    L 

A  final  problem  with  the  Initial  analysis  Is  that  the  high-extension  rule 
would  have  to  be  restricted  to  linked  tones.  It  Is  crucial  that  the  putative 
floating  high  tone  in  (12b)  not  spread  to  the  following  adjective  lest  *kor 
kjgj'gfl  be  derived.  We  have  not  found  It  necessary  to  distinguish  between 
associated  and  unassoclated  tones  in  the  analysis  so  far;  the  fact  that  such 
a  distinction  would  be  required  \\ere  is  an  additional  reason  to  doubt  the 
original  premise  on  which  this  analysis  Is  based— that  the  tone  of  the  suffix 
originates  as  a  property  of  the  root. 

To  briefly  summarize  the  discussion  to  this  point,  we  have  examined  two 
analyses  for  the  high-low  and  the  low-high  nominal  tone  patterns  of  Moore. 
One  analysis  construes  these  patterns  as  HL  and  LH  melodies  of  the  root 
realized  on  the  disyllabic  'text'  composed  of  the  root  plus  the  suffix.  The 
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second  analysis  posits  simple  high  and  low-toned  roots  and  a  polar  tone  for 
the  suffix.  The  noun-^adjective  construction  argues  that  the  roots  have  just 
a  single  tone  and  thus  support  the  second  analysis.  But  Moore  also  has  a 
third  tonal  pattern  of  high  on  both  the  root  and  the  suffix.  How  can  this 
pattern  be  accommodated  by  our  polar  tone  analysis? 

2.  Alternating  stems 

Let  us  now  return  to  the  analysis  of  nominals  with  a  high  tone  on  both 
the  stem  and  the  suffix.  We  list  some  additional  examples  in  (14). 

(14) 


sg. 

pl- 

moo-go 

moo-do 

'straw' 

too-go 

too-do 

'expensive' 

buu-ga 

byu-s6 

'goat' 

rum-ga 

rum-sl 

'animal' 

kyd-r§ 

kOd-6 

'old' 

r6a-r6 

r6-ya 

'day' 

pag-6 

p6g-b6 

'woman' 

These  stems  are  anomalous  in  that  they  fail  to  exhibit  tonal  polarity. 
However,  simply  to  consider  them  lexical  exceptions  to  the  polarity 
principle  is  not  an  acceptable  analysis— for  at  least  two  reasons.  First,  this 
pattern  appears  to  be  as  robust  as  the  low-high  and  high-low  ones.  Second, 
if  the  stems  of  (14)  are  treated  as  having  high  tones  that  exceptionally  fail 
to  polarize  the  suffix,  it  would  remain  unclear  why  there  are  no  low-toned 
stems  that  behave  in  the  same  way.  Absence  of  the  low-low  pattern  is  a 
glaring  gap  that  should  be  explained.  While  it  is  clear  that  the  stems  of  (14) 
cannot  have  a  simple  high  or  low  tone  (under  the  polarity  analysis),  there  is 
a  third  option  available  in  the  autosegmental  frameworlc— toneless.  Just  as 
elements  of  the  tonal  tier  may  be  unassociated  with  the  segmental  tier,  so 
autosegmental  phonology  has  found  it  necessary  to  postulate  stems  and/or 
affixes  which  lack  an  inherent  specification  on  the  tonal  tier.  For  example, 
in  many  Bantu  languages  verbal  stems  fall  into  two  classes—high  and 
toneless.  We  might  thus  postulate  the  underlying  representations  in  (15)  to 
distinguish  among  the  three-way  tonal  contrast  found  in  tloore  stems  such 
as  kor-g6.  s8a-ga.  and  baa-ga. 

(15)    kor     saa     baa 
I  I 

L        H 
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We  must  now  exploln  how  the  toneless  stem  ocqulres  Its  high  tone.  While 
we  might  postulate  o  rule  to  assign  toneless  elements  a  high,  Pulleyblank 
(1986)  has  shown  that  the  default  tone  in  a  high-low  system  is  typically 
low.  There  is  consequently  only  one  other  possible  source  for  the  high  tone 
in  baa-ga— the  suffix.  We  might  thus  postulate  that  the  tone  of  baa-qa 
arises  in  two  steps,  as  depicted  in  (16e).    First,  the  tone  of  the  suffix  is 
mapped  to  the  left-most  open  position.  We  then  postulate  a  rule  which  links 
a  toneless  syllable  to  the  high  tone  of  the  preceding  syllable.  We  state  this 
rule  in  (16b).  i 


boo-go 
1 

boo-go 
1/ 

b.    V  ® 

H 

->    H 

H 

(16)     0.   boo-ga 


This  analysis  thus  claims  that  b6a-ga  has  a  single,  multiply-linked  high 
tone  and  that  this  tone  is  a  property  of  the  suffix.  The  noun-adjective 
construction  provides  a  significant  test  of  the  validity  of  this  analysis. 
Recall  that  the  head  noun  lacks  a  number  suffix  In  this  construction.  If  the 
high  tones  of  bao-g^  orlglnote  from  the  suffix  then  the  bore  stem  will  be 
exposed  In  the  noun-adjective  construction.  It  should  appear  with  a  default 
tone,  which  Is  low  in  a  two-tone  system.  This  line  of  reasoning  thus  leads 
to  the  prediction  that  the  stems  of  (14)  will  appear  low-toned  in  the  noun- 
adjective  construction.  The  data  in  (17)  Illustrate  the  generalization  of 
Moore  phonology  that  nomlnols  In  the  high-high  pottem  systemotically 
assume  a  low  tone  in  this  construction—a  striking  confirmation  of  the 
analysis.  (Recall  from  (10)  that  roots  with  a  simple  high  or  low  such  as  [s61 
"broom"  ond  Ikor]  'sock'  remoln  unmodified  In  this  context.) 

(17)    m6o-g6  strow'  mosor)g6  'goodstrow' 

nim-go  'onlmol'  rum  sor)g6  'good  onlmol" 

w6m-de  colobosh'  wom  kgggo  'green  colobosh' 

baa-se  'dogs'  ba  b^da  'big  dogs' 

miu-gu  red'  ba  mi  bedo  'big  red  dogs' 

One  implication  of  this  onolysis  of  the  oltemoting  (toneless)  roots  is 
thot  the  number  suffixes  must  oil  be  underlylngly  high-toned.  They  ore  the 
source  of  the  root  high  In  the  high-high  tonol  pottem,  ond  this  pottem 
populotes  every  noun  closs.2    But  If  we  accept  the  ossumption  thot  oil 
number  suffixes  hove  on  underlying  high  tone,  then  the  polority  phenomenon 
moy  be  reduced  to  the  presence  In  the  grommor  of  the  widely-ottested  ond 
better  understood  process  dlssimlloting  two  successive  high  tones  (cf. 
Meeussen's  rule  In  Bantu). 


(18)  H-»  L/  H_ 

Accordingly,  we  assume  the  derivations  In  (19)  for  the  three  Moore  tonal 
patterns  found  In  kor-g6.  saa-ga.  and  baa-ga.    In  the  first  step  the  UG 
association  convention  assigns  the  tones  from  left-to-right~1n  particular 
the  tone  of  the  suffix  is  assigned  to  the  toneless  root.  In  the  next  step  a 
toneless  suffixal  vowel  assimilates  to  the  tone  of  the  root.  Finally,  high- 
tone  dissimilation  ( 1 8)  applies  to  lower  the  suffixal  tone  in  saa-ga.  This 
rule  fails  to  apply  In  b6a-g6  because,  although  baa-g6  has  two  successive 
high-toned  syllables.  Its  representation  differs  from  s6a-ga  in  having  just 
one  high  tone  spread  over  both  syllables. 

(19)  (korlgo         [saa]  ga        [baa]  ga 

L      H  H     H  H 

korgo  saa  ga         baa  ga 

II  II  I 

L    H  H     H  H  UG  Association  Convention 


inappl.  inappl.  baa  ga 

I  / 


H  tone-spread 


inappl.  saa  ga  inappl 

I      I 
H     L  H-d1ss1m11at1on(18) 

The  Moore  data  thus  suggest  that  the  polarity  phenomenon  can  be  reduced 
to  the  better  understood  dissimilation  rule  of  (18).  The  elimination  of  tonal 
polarity  as  an  independent  phonological  process  implemented  by  an  alpha- 
switch  rule  such  as  (4b)  that  reverses  the  feature  values  for  I±hll  is  a 
welcome  result  In  any  case.  Such  processes  are  anomalous  In  the 
autosegmental  framework,  where  tonological  rules  characteristically 
spread,  delink,  or  delete  tones.  In  this  regard.  It  Is  Interesting  to  note  a 
striking  parallel  to  the  Moore  state  of  affairs  In  the  Chadic  language  Margi 
(Hoffmann  1963,  Pulleyblank  1986),  another  tonal  system  in  which  toneless 
roots  and  polar  affixes  cooccur.  Like  Moore,  Margi  has  a  number  of  affixes 
with  polar  tone.  More  importantly,  Margi,  again  like  Moore,  has  a  three-way 
contrast  among  high,  low,  and  toneless  stems.  The  latter  alternate  between 
high  and  low  depending  on  the  local  tonal  context.  For  our  purposes,  the 
most  Important  point  is  that  the  alternating,  toneless  stems  systematically 
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reflect  8  high  tone  when  they  combine  with  a  polar  affix.  As  In  hoore,  the 
nargi  data  suggest  that  the  polar  tones  are  basically  high  and  that  the 
polarity  phenomenon  arises  from  the  dissimilation  of  adjacent  high  tones. 

Although  more  cases  of  polarity  need  to  be  studied  from  this 
perspective,  the  Moore  data  and  the  Margi  parallel  lead  us  to  conjecture  that 
1)  there  is  no  tonal  polarity  as  such,  that  2)  polar  tones  are  underlying  high 
tones,  and  3)  the  effect  of  polarity  is  the  reflex  of  a  rule  dissimilating 
successive  high  tones.  The  claim  that  polar  tones  are  basic  high  tones 
precludes  the  existence  of  a  system  otherwise  like  Moore  but  where  a 
member  of  the  alternating  class  of  stems  would  appear  with  a  low  rather 
than  with  a  high  tone  when  it  combines  with  a  polar-toned  affix.  That  Is  to 
say,  we  predict  the  absence  of  a  language  Moore"  where  {baa-ga]  instead  of 
Ib6a-g61  would  be  found  beside  (s6a-ga]  and  [kor-gol.  Under  our 
interpretation  of  polarity,  Moore"  would  require  the  suffix  [ga]  to  have  an 
underlying  low  tone  that  would  map  to  the  toneless  root  by  the  initial 
association  conventions  and  dissimilate  to  high  after  a  low-toned  root. 

If  the  prediction  that  such  systems  as  Moore"  will  not  be  found  proves  to 
be  true,  then  the  explanation  probably  lies  in  the  relative  unnaturalness  of  a 
rule  dissimilating  successive  low  tones  in  a  two-tone  system.^  in  such 
systems  high  tone  is  typically  the  active  value,  the  one  in  terms  of  which 
rules  of  assimilation,  dissimilation,  and  deletion  tend  to  be  defined.  In  this 
respect  low  tone  parallels  the  second  member  of  the  aspirated-unaspirated 
opposition:  dissimilation  of  aspirated  consonants  (eg.  Grassmann"s  Law)  is 
natural.   But  dissimilation  of  unaspirated  ones  would  be  quite  unusual. 

3.  The  associative  construction 

To  briefly  summarize  the  results  of  the  previous  section,  we  have 
proposed  an  analysis  for  the  three  nominal  tonal  patterns  of  Moore  In  which 
toneless  stems  such  as  baa-ga  receive  their  high  tone  from  the  suffix  via 
the  UG  left-to-right  association  convention  and  a  spreading  of  the  tone  to 
the  suffix.  Given  this  analysis,  we  expect  such  nouns  to  show  a  constant 
high  tone,  other  things  being  equal.  In  general,  this  expectation  is 
confirmed.  In  (20)  we  cite  examples  of  nominals  as  subject  to  the  copular 
yflfl,  and  first  or  second  object  to  the  verbs  zh.  "bring"  and  ]^  "give".  In  each 
case  hht-qh  has  a  stable  high  tone. 

(20)    a.       baag  yaa       dog  Is' 
s6ag  yaa       "broom  is" 
korag  yia     "sack  is' 
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za  s6aga 
za  k6r*g6 


'bring  a  dog' 
'bring  a  broom' 
'bring  a  sack' 


kQ  b6ag6 
kQ  s6aga 
kg  korg6 


give  a  dog' 
'give  a  broom' 
'give  a  sack' 


kQ  ned  '  baaga 
kQ  ned  *  saaga 
kQ  n^d  korgo 

kQ  naab  baaga 
kQ  naab  saaga 
kQ  naab  kor'gb 

kg  p6g  bdagd 
kQ  p6g  s^aga 
kQ  pag  k6r'g6 


give  a  man  a  dog' 
give  a  man  a  broom' 
give  a  man  a  sack' 

give  a  chief  a  dog' 
give  a  chief  a  broom" 
give  a  chief  a  sack' 

give  a  woman  a  dog' 
give  a  woman  a  broom' 
give  a  woman  a  sack' 


Also  when  a  toneless  root  Is  the  first  element  of  an  associative 
construction  It  consistently  appears  with  a  high  tone.  The  paradigms  In  (21) 
with  the  root  eigii  'woman'  (cf.  pog  son-g6  'good  woman')  illustrate  this 
point. 


(21)    pdg  saaga 
p6g  k6r'g6 


'woman's  broom" 
'woman's  sack' 


All  of  the  data  In  (20)  and  (21)  can  be  accounted  for  simply  by  assuming  that 
the  phrase-level  rules  of  elision  and  high-tone  extension  apply  after  the 
Initial  association  and  spread  of  the  high  tone  from  the  number  suffix  to  the 
toneless  root—presumably  a  lexical  process.  For  example,  the  phrase  nig 
liflClgi  'woman's  sack'  receives  the  derivation  in  (22). 


(22) 


pag-a  kor-go 
1/  I  I 
H       L     H 


pag  kor-go 
1/  I 
H     L    H 


There  is,  however,  one  context  where  the  toneless  nominal  takes  a 
different  tonal  shape  from  what  we  otherwise  expect—when  the  noun  heads 
an  associative  construction.   Consider  the  associative  paradigms  In  (23) 
showing  all  nine  possible  combinations  of  the  three  basic  tonal  patterns. 

(23)    n^d-8  'man',  naab-6  'chief,  p6g-6  'woman*,  b6a-g6  'dog',  kor-g6  'sack', 
s6a-ga  broom' 
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0.       n^d  boQg6     'man's  dog' 
naob  b^Q'Q^  'chief's  dog' 
pig  bQQ'go    'woman's  dog' 

b.  n^d  korg6     'man's  sack' 
naab  k6r'g6  'chief's  sack' 
p6g  k6r!g6     'woman's  sack' 

c.  n^d  's6aga    'man's  dog' 
naab  s6aga    'chief's  broom' 
pog  s^aga     'woman's  broom' 

A  nominal  in  the  high-high  pattern  such  as  baa-ga  systematically  takes  a 
low  tone  when  preceded  by  a  high-low  noun  such  as  nfeda  and  an  Internal 
downstep  when  preceded  by  a  noun  that  terminates  In  a  high  tone.  The 
paradigm  In  (23b)  shows  that  a  low-toned  stem  such  as  Ikor)  has  the  same 
tonal  pattern  In  this  context.  We  can  account  for  the  behavior  of  the  high- 
high  noun  baaqa  by  assigning  the  root  a  low  tone  In  this  context.  The  problem 
Is  to  explain  the  origin  of  this  low  tone. 

One  analysis  simply  postulates  a  special  rule  that  delinks  the  first  of 
two  multiply-linked  syllables  in  the  head  of  an  associative  construction. 
The  result  Is  a  toneless  syllable  which  will  be  assigned  a  low  by  the  default 
rule. 

( 24)  baa-ga    -»    baa-ga    ■*    baa-ga 
U  III 

H  H  L     H 

While  quite  simple  to  state,  this  solution  lacks  any  motivation.  Why  should 
the  first  syllable  delink  rather  than  the  second?  Why  only  In  this  particular 
grammatical  construction? 

Since  the  low  tone  on  baag6  only  appears  In  the  associative 
construction,  and  since  associative  constructions  often  contain  a  tonal 
particle,  we  might  pursue  the  hypothesis  that  the  low  tone  Is  a  grammatical 
particle  marking  this  particular  construction.  Although  the  syntax  would 
suggest  that  the  particle  marks  the  preceding  complement  (like  the  -'j,  of 
the  man's  house).  It  must  group  phonologlcally  with  the  following  noun. 
Such  syntactic-phonological  disparities  are  not  unprecedented,  however, 
(ep.  the  man's  coming)  and  so  should  not  prevent  us  from  exploring  this 
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approach  to  the  problem.  Given  that  the  tonal  particle  Is  proclitic  to  the 
head  noun,  a  toneless  root  such  as  (baa)  will  have  the  representation  of  (25). 

(25)  0.       [  baa  ]   ga 

L  H 

So  long  as  the  particle  comes  up  on  the  same  cycle  as  (or  a  cycle  that 
precedes)  the  suffix,  simple  left-to-right  matching  of  tones  and  tone- 
bearing  unite  will  assign  the  low  of  the  particle  to  the  root  syllable, 
merging  the  contrast  between  toneless  roots  like  [baa]  and  low  roots  like 
Ikorl. 

While  postulation  of  the  tonal  particle  provides  a  satisfactory  answer 
for  the  toneless  roots.  It  is  problematic  for  the  low  and  high-toned  roots. 
We  expect  the  underlying  representations  of  (26)  for  the  associative 
phrases  eag  kor'go  "woman's  sack'  and  cAg  saaga  'woman's  broom'. 

(26)  a. 


I  pag-a  1 

I  kor-go  1 

b. 

I  pag-al 

I  saa-ga 

1/ 

1     1 

\/ 

1     1 

H 

L     L      H 

H 

L    H     L 

Given  the  definition  of  adjacency  invoked  earlier,  the  unassociated  low 
particle  should  block  spread  of  the  high  tone  from  o^gji  and  trigger  the 
downstep  of  a  following  high  tone.  We  thus  incorrectly  predict  *ft 
and  *Bflg  !  saaga.  The  postulated  low-tone  particle  thus  must  be  removed  by 
the  time  these  phrase-level  rules  apply— perhaps  by  an  application  of  stray 
erasure  at  the  end  of  the  lexical  phonology.  We  unfortunately  have  no 
Independent  evidence  for  this  step  and  thus  must  admit  that  this  aspect  of 
the  analysis  remains  problematic.  We  shall  see  that  the  associative 
construction  in  Lama  presents  a  similar  alternation.    But  In  Lama  the 
evidence  for  the  low-toned  particle  Is  more  substantial. 

4.  Some  rules  of  Lama  tonology 

Lama  Is  a  Gur  language  spoken  by  some  60,000  persons  in  Northern  Togo. 
Previous  work  on  the  language  has  Included  a  brief  but  valuable  grammatical 
sketch  by  Prost  (1964).  As  far  as  we  know,  the  tonology  of  the  language  has 
never  been  described  before.  In  this  section  we  discuss  some  of  the  major 
tonologlcal  processes  we  have  uncovered.  Later  sections  discuss  some 
reflexes  of  tonal  polarity  In  Lama  and  the  alternations  that  arise  In  the 
associative  construction."* 
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Lama  has  three  surface  tones  on  a  syllable  spoken  In  Isolation:  a  high 
(eg.  ni  'seel'),  a  low  (eg.  nfl  'with"),  and  a  fall  (eg.  Mfl  'cow').  As  In  hoore,  we 
transcribe  the  tone  of  the  syllable  on  the  first  half  of  a  geminate  vowel. 
Our  work  has  focused  primarily  on  the  nouns.  There  are  some  ten  noun 
classes  In  Lama.  The  first  Is  unafflxed  while  the  remaining  are  marked  by  a 
suffix  which  is  rather  transparently  related  to  the  corresponding  pronoun 
that  stands  for  the  noun.  Nouns  drawn  from  the  first  nominal  class  denote 
kin  and  other  persons  as  well  as  a  few  animals.  Class-2  forms  the 
corresponding  plurals.  The  suffix  Is  -Qfl,  with  the  -q  missing  after  many 
consonants.  Monosyllabic  stems  appear  In  the  four  phonologlcally  distinct 
tonal  classes  tabulated  In  (27). 

(27)  class  1  class-2 

a.  low] 


b.  high 


c.  fans 


d.  I0W2 


cl 

cma 

father* 

r1 

rina 

'mother* 

kp 

kQna 

'sister" 

n 

ran« 

'friend' 

w6al 

w6aia 

'husband' 

y(r 

y(r« 

■person' 

w6n 

w6n(n« 

'husband's  mother" 

w6l 

w6ld 

'daughter-in-law' 

Idel 

l^eia 

"widow" 

y^lem 

y6lemp$ 

'blind  person* 

y^dem 

y$damp$ 

'co-wife' 

lAo 

taan 

'antelope' 

nao 

ndan 

cow* 

8f1 

slln 

"sheep" 

Iti 

tlln 

elephant* 

yal 

yal6 

'wife' 

opal 

apald 

'man' 

raol 

raaln* 

'brother* 

afal 

afal6 

"son* 

reen 

reenn$ 

'cousin' 

nun 

nunnd 

aunt' 

Although  the  nouns  in  (27a)  and  (27d)  are  tonally  identical  In  isolation— 
both  are  realized  as  low—they  systematically  differ  in  their  phonological 
behavior.  Furthermore,  the  contrast  between  high  (27b)  and  fall  (27c)  Is 
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realized  directly  only  before  pause.  When  a  noun  from  the  latter  group  is 
placed  before  another  word  in  the  phrase  it  appears  with  a  simple  high  tone. 
The  four  tonal  classes  thus  reduce  to  a  simple  high  versus  low  opposition  in 
the  phrase.  However,  their  effect  on  the  tone  of  the  following  word  is  such 
that  a  four-way  phonological  contrast  is  maintained  before  a  word  that 
begins  with  a  simple  low  In  Isolation  (eg.  sewa  'ran',  te  'under')  and  is 
reduced  to  a  three-way  contrast  before  a  high  tone  (eg.  ti  'chez'),  since  low^ 
and  I0W2  nouns  do  not  contrast  in  this  environment. 


(28)    lowi 

ci                 'father' 

high 

waal            'husband,  man' 

fall 

naa              'cow' 

I0W2 

yal              "woman' 

sewa 

'ran'                      t? 

under' 

c1  sewa 

'father  ran"            ci  t? 

'under  father' 

waal  se'wa 

'man  ran'                waal  ti 

'under  man' 

naa  sewa 

'cow  ran'               n6a  t? 

'under  cow' 

yol  sh'ytb 

'woman  ran'           yal  t§ 

'under  woman' 

te 

'at,  Chez' 

clt^ 

Chez  father" 

waol  te 

'Chez  man' 

naa !  t| 

'Chez  cow' 

yalt^ 

'Chez  woman' 

Like  Moore,  Lama  has  the  rule  which  extends  the  domain  of  a  high-tone 
one  syllable  to  the  right.  When  the  following  syllable  contains  a  low,  a 
falling  tone  results.  This  high-tone  extension  operates  In  a  variety  of 
grammatical  contexts. 

(29)    a.  associative  construction 


waal    husband' 

ci  "father* 

w6a1  cl   'husband's  father' 


yir  "person* 

ra  "friend" 

y(rr6  "person"s  friend" 


b.  verb+object 


nh  "see!" 

r1  "mother" 

nh  n  "see  mother" 


cf.      SM  'take!' 

su  r1  'take  mother" 
sg  w6a1  "take  husband* 


93 


c.  noun-^postposltlon 

te  'under' 

waal  1$  'under  husband'   cf.      c1  le   under  father' 

We  shall  account  for  these  alternations  by  the  rule  In  (30). 

(30)  V     V 

H 

Unlike  In  Moore,  where  the  extension  of  high  tone  only  operates  between 
words,  this  rule  applies  within  the  word  as  well  In  Lama.  We  return  to  this 
point  shortly. 

The  falling  tone  that  arises  from  high-tone  extension  only  surfaces 
prepausally.  In  other  positions  a  fall  simplifies  through  the  delinking  of  its 
low-tone  component.  This  process  applies  not  only  to  falling  tones  derived 
from  high  tone  extension  but  also  to  underlying  falling  tones  (31c). 

(31)  a.  waal  'husband',  ci  'father' 

waal  Ci  'husband's  father' 

waal  CI  sewa  'husband's  father  ran' 

b.  yir 'person',  ra 'friend' 
y(rrd   'person's  friend' 

yfr  r6  '  w6l  'person's  friend's  daughter-in-law' 

c.  nda  'cow'  naa  te  'under  cow' 
n6a  8ew6  'cow  ran'     n6a  '  te  chez  cow' 

High  tones  are  realized  at  a  lower  pitch  value  after  both  overt  low  tones 
(downdrift)  as  well  as  after  simplified  falling  tones  (downstep).  We 
assume  that,  as  in  Moore,  a  floating  low  tone  initiates  a  downstep.  (31b) 
thus  has  the  derivation  of  (32).  In  the  first  step,  the  high  tone  of  yJr 
spreads  to  the  low  of  j^.  The  resulting  falling  tone  simplifies  through  the 
disassociation  of  the  low,  which  activates  a  phonetic  downstepping  of  the 
high  tone  on  atM 

(32)  yirrawol         ylrrawol  yirrawol 

III  i/l    I  I/-       I 

HLH      -»HLH      -»      HLH 
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Note  that  the  floating  low  tone  that  results  from  simplification  of  the  fall 
in  nan  'cow'  (31c)  not  only  triggers  a  downstep  In  oflfl  '  Ifi  "chez  cow"  but, 
under  the  sense  of  adjacency  invoiced  earlier  In  our  discussion  of  Moore, 
also  helps  to  block  extension  of  the  high  tone  in  naa  sewa  'cow  ran". 

In  contrast  to  high  tones,  the  low  tone  in  Lama  is  generally  inert.  After 
a  low,  high  tones  remain  unchanged  in  most  contexts. 


(33)    a.       preposition 


na  'with",  ra  'friend*,  w6al  'husband* 
na  ra  'with  friend* 
na  waal  'with  husband' 

b.  verb 

sg  take'  imper,  cl  "father*,  y(r  person' 
SM  cl  'take  father* 
sMyir 'take  person' 

c.  negative 

ya  'no',  r1  'mother',  wSal  'husband' 
ya  r1  'no  mother' 
ya  w6al  'no  husband' 

There  Is,  however,  one  notable  exception  to  this  statement.  When  it  occurs 
in  the  head  noun  of  an  associative  construction,  a  high  tone  appears  to  lower 
after  a  preceding  low.  We  return  to  this  phenomenon  later. 

(33)    d.  associative 

rl  waal  'mothers  husband'  (cf .  w6al  'husband' 

cl  wol  'father's  daughter-in-law'  w61  'daughter-in-law' 

ra  yqdgm  'friend's  co-wife'  y^dpm  'co-wife' 

Let  us  now  consider  the  nouns  In  (27d).  Although  they  are  low  in 
isolation  they  consistently  differ  from  the  elements  of  (27a)  both  in  their 
effect  on  a  following  tone  and  In  the  way  in  which  they  are  affected  by  the 
preceding  tonal  context.  As  we  see  in  (34),  these  nouns  have  the  same 
effect  08  the  high-toned  nouns  of  (27b)  In  producing  a  fall  or  a  high  plus 
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downstep  on  o  following  low-toned  syllable.  They  thus  differ  from 
low] nouns  such  qs  ea  'friend'  which  do  not  hove  this  effect. 

(34)  yol  "wife",  sew6  'ran'        cf.      w6al  'husband' 

yal  s§'w6  'wife  ran'  w6al  s6'w6  'husband  ran' 

ra  sew6  'friend  ran' 

raal  'brother',  ro  'friend'  w6l  'daughter-in-law' 

raal  ra  brother's  friend"  w6l  ra  "daughter-in-law's  friend' 

ra  raal  'friend's  brother' 

t9  'under'  w6al  t$  'under  husband' 

yal  t$  'under  wife'  ra  t?  'under  friend' 

Clearly,  I0W2  nouns  such  as  yal  'wife'  and  asl  'brother'  nriust  contain  a  high 
tone  In  their  underlying  representation.  Since  we  have  already  postulated 
low  (eg.  Ea  'friend'),  high  (eg.  w6al).  and  HL  (eg.  nfifl  'cow")  nouns,  a  LH 
sequence  seems  the  most  likely  solution.^  Other  things  being  equal,  we 
would  posit  the  representation  of  (35a)  over  that  of  (35b). 

(35)  a. 


yal 

b.  yal 

1 

|\ 

L  H 

L  H 

So  long  as  the  high-tone  extension  rule  Is  permitted  to  affect  both 
associated  and  unassoclated  tones,  it  would  appear  that  (35a)  leads  to  a 
simpler  grammar.  Why  postulate  attachment  of  the  high  In  (35b)  If  we  must 
only  remove  It  later?  The  problem  Is  that  with  (35a)  It  becomes  difficult  to 
explain  the  behavior  of  the  I0W2  nouns  when  they  are  preceded  by  a  high 
tone.  As  the  data  In  (36)  show,  they  regulariy  appear  with  a  simple  high 
tone  In  this  environment.  In  particular,  no  downstep  Is  produced. 

(36)  yal      wife' 

w6al    husband' 

w6al  y6l  'husband's  wife' 

w6al  y61  sh-yfb  husband's  wife  ran'  (  *  w6al  y6l  '  s6'w6 

n6  'seer,  yal  'wife"  cf.    ra  'friend' 

n6  y61  see  wife'  n6  rh  "see  friend' 

raal  y61  'brother's  wife'    cf.    yal  rfi  'wife's  friend' 
yal  rial    wife's  brother' 
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If  the  I0W2  nouns  ore  represented  with  a  rising  tone  as  In  (35b),  then  the 
simplification  we  see  In  (36)  Is  very  easy  to  state.  We  may  formulate  the 
rule  of  (37a)  to  delete  the  middle  tone  In  a  marked,  triply-linked  structure. 
With  the  help  of  this  rule,  we  have  the  derivation  of  (37b).  In  the  first  step 
high  tone  extension  applies,  followed  by  (37a)  eliminating  the  low 
sandwiched  between  the  two  highs.  Subsequent  simplification  of  the  falling 
tone  gives  waal  yM  sg'wfl 

(37)  a.  V 

/  I  \ 
H  L  H 

0 

b.   waal  yal  sewa  waal  yal  sewa         waal  yal  sewa 

I     l\   I    I  I   / 1 V I    I  I   /  I  / 1     I 

H     LHL  H-»  HLHLH-»       HHLH 

However,  if  the  I0W2  nouns  are  given  the  representation  of  (35a),  then  we 
require  the  rather  unnatural  rule  of  (38a)  below. 

(38)  a.       V   V  b.       waal  'husband',  ci  'father,  see  runs 

I    I  w6al  cf '  sde  'husband's  father  runs' 

H    L    ® 

0 

This  rule  must  crucially  apply  before  high-tone  extension.  Otherwise  the 
floating  high  would  become  associated  and  the  I0W2  nouns  could  not  be 
distinguished  from  a  normal  HLH  sequence,  which  does  yield  a  downstep. 
Compare  in  this  respect  (38b,c)  where  the  spreading  high  produces  a 
floating  low  on  £i  'father  that  does  downstep  the  following  HL  sequence  on 
Sg£.  Aside  from  lacking  any  particular  phonetic  motivation,  with  its 
reference  to  the  high  on  the  left  this  rule  repeats  part  of  the  high-tone 
extension  rule.  Furthermore,  it  is  the  only  rule  we  have  discovered  so  far 
which  would  have  to  crucially  refer  to  an  unassociated  tone  (indicated  by 
the  circle  on  the  rightmost  high).  To  this  point  in  the  analysis,  all  rules  have 
treated  associated  and  unassociated  tones  the  same.  For  these  reasons  we 
shall  analyze  the  I0W2  nouns  as  having  an  underlying  LH  rising  tone.  A  rule, 
perhaps  a  generalization  of  the  rule  simplifying  falling  tones,  delinks  the 
second  portion  before  the  phonetic  surface  is  reached. 
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To  summarize  the  analysis  to  this  point,  we  posit  four  underlying  tones 
on  monosyllabic  roots:  low,  high,  falling,  and  rising.  We  also  posit  a  general 
H-spread  rule  extending  the  domain  of  a  high-tone  to  the  right. 
Simplification  processes  eliminate  the  righthand  links  in  contour  tones.  A 
special  rule  deletes  the  L  portion  of  a  postulated  rising  tone  that  has 
undergone  H-spread.  Finally,  high  tones  are  phonetically  implemented  at  a 
lower  pitch  value  after  both  associated  and  unassoclated  lows. 

5.  Traces  of  tonal  polarity  in  Lama 

Unlike  in  Moore  where  essentially  every  noun-class  suffix  has  tonal 
polarity,  the  phenomenon  is  much  more  restricted  in  Lama.  Only  a  few 
suffixes  show  traces  of  polarity.  Most  do  not  alternate  in  tone.  Furthermore, 
this  polarity  can  only  be  detected  once  the  effect  of  several  phonological 
rules  is  removed. 

Below  we  list  nouns  drawn  from  the  paired  third  and  fourth  and  the  fifth 
and  sixth  noun  classes.  The  singular  class-3  and  5  suffixes  show  a  three- 
way  tonal  alternation  while  the  corresponding  class-4  and  6  plural  suffixes 
have  a  constant  falling  tone.  When  combined  with  a  monosyllabic  root  we 
find  three  suriace  tonal  patterns  in  classes  3  and  5:  a  low  on  both  the  root 
and  the  suffix  (eg.  [wur-ul  *nef,  (wun-ol  'river'),  a  high  on  the  root  and  a  fall 
on  the  suffix  (tao-u  "forest  tree',  wur-6  "chief"),  and  a  high  on  both  the  root 
and  the  suffix  (eg.  sam-u  "bat",  wur-6  'hilD.  The  existence  of  three  tonal 
patterns  immediately  reminds  us  of  Moore.  But  we  claimed  eariier  that  a 
low-low  pattern  would  be  systematically  absent  from  a  polar  system.  If 
this  is  correct  then  the  low-low  pattern  must  be  a  surface  effect  taht 
arises  from  some  other  tonal  representation.  This  reasoning  is  corroborated 
by  the  fact  that  nouns  in  the  low-low  pattern  consistently  induce  a  high 
tone  on  the  following  word  (like  yai  "woman"  discussed  above):  eg.  le  "under", 
but  wuru  11  'under  net';  sewa  'ran',  wun-o  si'wi  'river  ran'.    Consequently, 
we  assume  that  these  nouns  derive  from  a  low-rise  representation.  We  have 
indicated  this  rising  tone  in  our  transcriptions  but  warn  the  reader  that  this 
Is  a  phonological,  spelling.  Phonetically,  the  final  syllable  is  always  low  in 
tone  since  Lama  lacks  suriace  rising  tones. 


(39)  Class  3-4 

sing. 

Pl. 

a.   caru 
resu 

car$n 
resen 

'star 
'big  pot' 
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wuru 

wuren 

'net' 

kperu 

kperan 

•ladder- 

kpelu 

kpelen 

'hawk' 

kpodu 

kpadSn 

"walking  stick' 

kposy 

koos?n 

'flute' 

nambu 

nambsn 

'room' 

mandu 

mandin 

'bamboo  gun' 

nawu 

nawan 

'bull' 

nsQ 

nsSn 

sour 

1.       temu 

tam?n 

'gourd  plant' 

komu 

komen 

'kapok  tree' 

semu 

semen 

'nut  tree' 

telu 

telen 

'baobab  tree' 

t&PM 

tap  an 

'forest  tree' 

kpasu 

kpas|n 

'acajou  tree' 

kp6sO 

kp^san 

'cough' 

remO 

reman 

'pot  for  salt' 

haamu 

hiaman 

'arm' 

laaku 

laaken 

'jaw' 

cuku 

cuken 

'patio' 

celQ 

cen,  eel an 

'wing' 

kolO 

kon^  kolan 

'tree  sp' 

8910 

sen,  sal $n 

•tree  sp.' 

c.       cernu 

c§man 

'hen' 

laku 

I6k|n 

•big  stick' 

lurii 

luren^ 

millet  stick' 

kplru 

kperpn 

bridge' 

wuru 

wuran 

'vegetable  sp' 

spmy 

samen 

'bat' 

kaasu 

kaasan 

'crocodile' 

H^my 

l^^m^n 

'lemon' 

s6arv 

s6ar$n 

•sword^ 

nindy 

n6nd9n  ^ 

■gigof 

c6lO 

cfin,  c61  en 

'tree  sp.' 

y9lM 

y9n,yel9n 

•horn' 

class  5-6 

a.        wuno 

wunqse^ 

'river 

wonk$ 

wonk6sa 

donkey 
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hQndiJ 

hQnd6s$ 

moon 

S8l5 

ssl^sa 

couvercle" 

war$ 

wartsp 

money 

WMt$ 

wMt6s9 

wonj 

wur6 

wur^se 

•chief 

I6sd 

les^se 

'sour 

top 

t^QS? 

'cour' 

r»0 

h6s9 

•dog- 

wurt 

wurqse^ 

•hiir 

kp6cO 

kp6c6s$ 

'scorpion' 

tpmO 

t^mos? 

'loins' 

wyn6 

wOnisp 

•year' 

kp6sO 

kp6s6s9 

"mat" 

trp 

trass 

•star* 

For  classes  3  and  5  there  thus  are  just  three  tonal  patterns  for 
monosyllabic  roots:  low  on  the  root  and  a  rising  tone  on  the  suffix,  high  on 
the  root  and  a  fall  on  the  suffix,  and  finally  a  high  on  both  the  root  and  the 
suffix.  Only  a  little  imagination  is  required  to  relate  this  three-way 
contrast  to  the  one  we  saw  operating  in  Moore.  The  high-fall  pattern  can  be 
derived  from  a  high-low  one  via  a  generalization  of  the  high-tone  extension 
rule  to  apply  between  the  root  and  the  number  suffix.  (This  captures  a  wider 
generalization  In  Lama:  a  simple  low-toned  syllable  may  not  In  general 
follow  a  high.)  Simllariy,  we  may  derive  the  low-rising  pattern  of  wur-u 
and  wun-5  from  an  underiying  low-high  by  postulating  a  special  rule  that 
spreads  a  low  tone  across  the  boundary  between  the  root  and  the  suffix.  We 
state  this  rule  in  (40a).  The  proposed  derivations  of  wur-u  'net'  and  tio-u 
'tree'  are  sketched  in  (40b). 


(40)    a. 


VI 
I  .' 

L 


wur-u 
I  I 
L  H 


wur-u 
L    H 


wur-u 
L    H 


tap-M 
I  I 
H     L 


tap-y 
H    L 
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The  result  of  this  reconstruction  Is  a  tonal  system  Isomorphic  to  the 
Moore  one  discussed  earlier:  low-high,  high-low,  and  high-high.  The  Moore 
analysis  invites  us  to  derive  the  high-high  tonal  pattern  of  sem-u  "bat"  and 
wur-6  'hill'  from  an  underlying  toneless  root  plus  a  high-toned  suffix.  Some 
striking  support  for  this  Inference  Is  to  be  found  in  the  Lama  associative 
construction. 

5.  The  associative  construction  in  Lama 

As  In  Moore,  a  special  tonal  change  affects  a  noun  when  It  heads  an 
associative  construction.  When  viewed  purely  from  the  surface  the  tonal 
changes  appear  much  more  radical  in  Lama.  We  illustrate  with  class-3 
nouns  drawn  from  each  of  the  three  tonal  patterns.  In  (41a,b)  we  cite  these 
nouns  in  context  after  low  and  high  tones,  respectively.  In  this  case  the 
only  rule  of  note  is  high-tone  extension  which  spreads  the  tone  of  na  'see'  to 
the  initial  syllable  of  koelu  'hawk".  But  in  the  associatives  of  (41c)  both  high 
tones  of  samu  change  to  low  while  only  the  initial  high-toned  syllable  in 
taou  does.  (41  d)  shows  an  additional  application  of  high-tone  extension 
initiated  by  the  high  tone  on  waal. 

iA\) 

sam-u     'baf  kpel-u      'hawk'  tap-u  'tree* 

a.  nas^mg   with  bat"         nakpelu  'with  hawk'    na  tapO  "with  tree' 

b.  nas^mO  'see bat'  nakpelu  'see hawk'      n6  tapu 'see  tree' 

c.  ci  ssmu   'father's  bat'     ci  kpelu  'father's  hawk'  cl  tapO  'father's  tree' 

d.  weal  semu  'men's  bat'     waal  kp§lu  'man's  hawk'  waal  ti^pg  'man's  tree' 

Before  proceeding  further  with  the  analysis  we  must  note  that  the  low- 
low  shape  assumed  by  the  high-high  nouns  such  as  sgmQ  derives  from  an 
intermediate  low-rise  representation.  This  is  shown  by  the  paradigm  in 
(42);  sgmg  continues  to  induce  a  high  tone  on  a  following  word  such  as  sew6 
'ran'  when  It  heads  the  associative. 

(42)    ci  sgmg  se'wa     father's  bat  ran' 
w6a1  s^mg  s6'w6  'husband's  bat  ran' 

Suppose  that  we  borrow  two  features  of  the  analysis  of  the  associative 
proposed  for  Moore;  that  the  associative  is  mar1<ed  by  a  low-tone  particle 
and  that  the  nomlnals  in  the  high-high  tonal  pattern  derive  from  toneless 
roots.  This  implies  the  representations  In  (43a).  (We  assume  the  root  tone 
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has  already  associated).  Assoctatton  of  the  particle  to  the  toneless  root 
yields  the  representation  In  (43b).  Then  high  tone  dissimilation  polarizes 
the  sufflxal  high  to  low  in  Itap-yl  (43c).  So  far  everything  is  the  same  In  the 
two  systems.  But  one  respect  in  which  Lama  differs  from  hoore  is  the  low- 
spread  rule  (40a).  Suppose  that  this  rule  not  only  applies  between  the  root 
and  suffix  but  also  between  the  proclitic  particle  and  the  root.  The  result  is 
the  representations  of  (43d). 


(43)    a.      (S9m]  u 

L  H 

b.  sem  u 
I  I 
L     H 


(kpell  u 
1 

Itapl  M 
1 

L      L     H 

L    H     H 

(kpell  u 
1      1 

(tap)  u 
1     1 

Assoc.  Conv. 

1      1 
L       L     H 

1     1 
L    H     H 

Inappl 

tapy 

1    1 

L    H    L 

H-dlssIm 

kpel  u 

/I/  1 

L     L    H 

tap  u 
/I     1 
L  H    L 

L-spread 

inappl 

tap  u 
/   1/  1 
L    H    L 

H-extension 

kpelu 
1/ 

L 

lap  u 
I/I 
LHL 

rise  simpl 

Inappl 


S9m  u 
I  /! 
L    H 

inappl 


s^m  M 
1/ 

L 


In  the  next  step  high  extension  applies  to  derive  the  falling  tone  on  the 
suffix  in  Iflau  'tree'.  The  derivation  is  completed  In  (43f )  by  delinking  the 
high  tone  component  of  the  rising  tones. 

We  have  thus  been  able  to  explain  the  lowering  found  in  the  associative 
construction  by  postulation  of  a  low-tone  pariicle.  This  particle  surfaces 
in  a  wider  range  of  contexts  in  Lama  in  comparison  to  Moore  due  to  the 
former  language's  low-spreading  rule,  a  rule  that  Is  missing  in  hoore.  In 
other  respects,  the  two  tonal  systems  are  remarkably  similar.  Given  the 
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geographic  distance  separating  the  two  languages,  one  wonders  whether  the 
associative  lowering  was  not  a  feature  of  the  Proto-language.  Much  further 
study  of  the  tonology  of  the  Gur  languages  is  required  before  this  conjecture 
can  be  answered  one  way  or  the  other.  We  hope  that  this  paper  will 
stimulate  further  study  of  the  tonal  structure  of  Gur  languages. 

NOTES 

•The  first  three  sections  of  this  paper  were  written  In  collaboration 
with  Emmanuel  Nikiema  while  the  first  author  was  a  Visiting  Scholar  at  the 
Universite  du  Quebec  6  Montreal  in  the  spring  of  1986.  We  wish  to  thank 
Maria  Carreira  and  Laura  Downing  for  their  valuable  comments  on  an  earlier 
version  of  this  paper. 

'It  is  also  possible  that  the  high  tone  is  lexically  associated  to  the 
suffix  and  spreads  leftwards  to  the  toneless  stem.  For  our  purposes  the 
Important  point  Is  that  the  stem  is  toneless  and  acquires  its  tonal 
specification  from  the  suffix. 

2Tonal  systems  are  often  organized  in  such  a  way  that  all  affixes  in  a 
given  category  have  the  same  tonal  specification.  For  example,  in  many 
Bantu  languages  the  noun  prefixes  are  always  low-toned. 

^The  low-tone  raising  rule  of  Hausa  discovered  by  Leben  (1971)  is  a 
well-known  exception. 

''See  Ourso  (forthcoming)  for  a  fuller  discussion  of  Lama  phonology. 

5ln  the  plural  these  nouns  belong  to  class-4,  not  to  class-2.  See  Yu 
(1968)  for  discussion. 

^It  is  Interesting  to  note  that  loW)  and  I0W2  nouns  differ  in  segmental 
shape  as  well.  The  latter  contain  a  consonantal  coda  while  the  former  have  a 
simple  CV  shape.  Lama  syllable  structure  is  CV  or  CVC,  with  the  coda 
position  limited  to  sonorants.  It  is  possible  that  historically  the  sonorant 
coda  counted  as  a  tone-bearing  unit,  thus  permitting  a  noun  such  as  yfll  to 
hold  two  tones.  However,  synchronically  no  surface  rising  tones  occur  in 
Lama. 
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A  FUNCTIONAL  TYPOLOGY  OF  "ni"  IN  KIVUNJO  (CHAGA)» 

Lioba  Moshi 
Stanford  University 

Abstract 


This  paper  discusses  the  distribution  and  functions  of  a  morpheme  ni  in 
Kivunjo,  a  language  in  the  group  of  Chaga  languages  spoken  in  Tanzania. 
The  paper  argues  that  the  distribution  of  ni  in  Kivunjo  is  compatible  with 
its  major  function  in  Kivunjo  discourse,  namely  "focus  marking"  and  that 
it  is  not  always  morphologically  or  semantically  determined  as  suggested 
by  Dalgish  (1978,  1979).  Rather  ni  is  phonologically  an  integral  part  of 
nouns  especially  those  whose  grammatical  function  is  "subject".  The  paper 
also  discusses  the  relationship  between  the  distribution  of  this  morpheme 
with  the  tonal  patterns.  The  presence  or  absence  of  ni  in  a  clause  seem  to 
influence  the  way  tones  are  assigned  to  the  tone  bearing  units.  It  is  also 
argued  that  ni  primarily  indicates  focus  of  assertion  but  when  it  combines 
with  other  operations  (e.g.  position  in  the  sentence)  or  is  replaced  by  tone, 
it  marks  contrastive  focus. 

1.  Introduction 

In  his  analysis  of  ni  in  Kivunjo,^  Dalgish  (1978&1979)  notes  that  the  morpheme  ni 
occurs  optionally  before  verbs.  Nurse  and  Phillipson  (1977)  make  note  of  a  similar  mor- 
pheme in  Kimochi,  a  dialect  which  can  be  considered  mutually  intelligible  with  Kivunjo. 
While  Dalgish  designates  the  morpheme  a  pre-verbal  ni,  Nurse  and  Phillipson  (1977) 
name  it  a  stabilizer.  In  other  Bantu  languages^  in  which  a  similar  morpheme  has  been 
attested,  it  has  been  given  a  variety  of  names  including  "copulative  particle,"  "predica- 
tive morpheme,"  and  "stabilizing  element."  These  references  indicate  that  the  morpheme 
may  appear  segmentally  as  n  before  a  vowel-initial  verb  while  it  may  be  deleted  before  a 
consonant  (with  an  optional  length  on  the  consonant)  in  a  similar  position.  Marcel  van 
Spaandonck  (1973)  notes  that  in  many  Bantu  languages  (e.g.,  Zulu,  Shona,  Shambaa) 
the  morpheme  may  be  realized  on  the  surface  structure  (phonological)  or  on  a  deep- 
structure  (semantic)  level.  Dalgish  (1979:51)  further  notes  that  the  morphosyntactic 
aJternation  involving  the  presence  and  absence  of  the  pre-verbal  ni  (henceforth  pv-ni) 
can  be  explained  as  a  result  of  a  semantic  alternation  between  the  parameters  of: 

(a)  assertion  versus  non-assertion. 

(b)  positive  versus  negative  force  of  yes-no  questions. 

(c)  the  degree  of  certainty  concerning  the  truth  of  a  proposition  that  appears  in  a  sub- 
ordinate clause. 
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As  stated  in  the  abstract  of  this  paper,  I  intend  to  show  that  the  distribution  of  ni 
in  Kivunjo  is  dectated  by  its  focus  marking  function  and  that  its  presence  or  absence  is 
not  always  morphologically  or  semantically  determined. 

The  paper  begins  with  a  short  discussion  of  the  typology  of  this  morpheme  ni,  then 
proceeds  to  look  at  the  distributional  patterns.  Meanwhile  I  offer  a  hypothesis  which  may 
explain  the  restrictions  associated  with  the  distribution  of  the  particle  in  different  clause 
types.  I  argue  that  the  distributional  patterns  reflect  a  distinction  between  assertive 
focus,  contrastive  focus  and  the  lack  of  the  need  for  focus  marking  in  a  given  proposition. 

These  claims  are  consistent  with  generalizations  that  have  been  made  about  mark- 
ers that  are  said  to  behave  similarly  in  other  languages.  For  example,  Marchese  (1983) 
identifies  some  particles  in  West  Kru  languages^  that  resemble  the  Kivunjo  ni  in  their 
function  and  distributional  patterns.  Marchese  notes  that  these  particles  occur  in  affir- 
mative declarative  sentences  and  are  excluded  in  negatives,  imperatives  and  subordinate 
clauses.  The  particles  may  appear  as  an  independent  particle,  a  suffix  attached  to  the 
verb,  or  merely  cis  a  high  tone.  Marchese  further  notes  that  these  particles  can  be  used 
as  focus  markers  distinguishing  focus  of  assertion  from  contrastive  focus.  As  will  be 
shown,  these  characteristics  summarize  the  characteristics  of  the  Kivunjo  ni. 


2.  The  typology  of  ni 

Phonologically,  ni  may  be  realized  as  a  variety  of  assimilated  nasals-for  example,  a 
syllabic  n  before  vowels(except  the  front  high  vowel  [i]-t)  (velar)  and  n  (palatal)  before  a 
consonant,  and  the  front  vowel  [i].  Further,  it  may  be  totally  assimilated  when  it  occurs 
before  another  nasal.  Otherwise  it  appears  as  ni  especially  before  NPs  at  sentence  initial 
(S-initial)^  position.  The  following  examples  show  these  possibilities. 


(1) 


h-      ii-  le-     kapa  mana  ukou. 

foe-  sm-2sg-  pst-  beat    child    yesterday 
"You  beat  the  child  yesterday." 

g-ki'te       ki'-  le-     pfa. 

foe-dog-  sm-cl7-  pst-  die 
"The  dog  died." 

n-tsi'ndo     i-  le-     in-     lem-  i-         a. 

foe-hump  sm-cUO-  pst-  om-  overcome  appl.  fv 
"The  hump  was  heavy  on  him" 

mfo  -u-  le-     raa. 

foe-river  -sm-cl4-  pst-  smell  bad 
"The  river  smelled  bad." 
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e.     ni     fhfo-  u-  le-     raa. 

foe  river  -sm-cl4-  pst-  smell  bad 
"The  river  smelled  bad." 

This  example  shows  n  before  vowels  (l)a,  ij  before  a  palatal  (l)b,  n  before  a  velar 
(l)c,  total  assimilation  before  another  nasal  (l)d,  and  as  ni  before  an  NP  (l)e.  Thus, 
(l)d  and  (l)e  may  be  considered  to  be  in  free  variation  which  can  also  be  argued  for 
(l)b,  and  (l)c  but  not  (l)a  for  reasons  that  will  be  discussed  later. 

Although  Dalgish  distinguishes  pv-ni  from  the  ni  found  in  other  syntactic  postions, 
I  will  argue  that  the  two  are  functionally  similar.  I  will  claim  that  pv-ni  is  derived  from 
ni  which  tends  to  cliticize  onto  the  verb. 

In  his  analysis,  Dalgish  (1979:48)  posits  morphosyntactic  constraints  for  pv-ni.  He 
claims  that  pv-ni  cannot  occur  before  vowels  except  the  second  and  third  person  singular 
subjects,  (u)  and  (o)  respectively.  Thus,  ni  cannot  appear  where  agreement  on  the  verb 
indicates  an  association  with  nouns  from  certain  noun  classes-in  particular,  classes  whose 
concordial  prefixes  are  vowels  (e.g.  class  3  and  9).  Consider  the  following  examples: 


(2) 


a.  mfo    (n)-  u-  le-     raa. 
river  foe-  sm-cl3-  pst-  smell  bad 
"The  river  smelled  bad." 

b.  mfo    (n)-  1-  le-     r'aa. 
river  foe-  sm-cl4-  pst-  smell  bjtd 
"The  rivers  smelled  bad." 

c.  unginyi  (n)-   ii-  le-     m-     kooya  kacha. 
bigness    foe-  sm-cll4-  pst-  om-  find      time  before 
"Power  found  its  way  to  him  a  long  time  ago." 

d.  hjofu  (n)-  i-  le-  pfa. 
elephant  foe-  sm-cl9-  pst-  die 
"The  elephant  died." 


According  to  Dalgish,  (2)a-d  should  be  ungrammatical  because  the  pv-ni  is  associated 
with  the  vowel-subject  agreement  markers  for  nouns  from  class  3  (example  2a),  class  4 
(example  2b),  class  14  (example  2c),  and  class  9  (example  2d).  Notice,  however,  that 
the  0  indicates  that  the  morpheme  may  optionally  appear.  As  will  be  shown  later,  the 
presence  of  the  morpheme  wiU  indicate  that  the  clause  is  semantically  independent  while 
its  absence  will  indicate  that  it  is  semantically  dependent  on  another  clause  or  some 
previous  discourse.  Examples  (3)a  and  (3)b  below  further  illustrates  this  optionality 
where  ni  is  associated  with  subject  agreement  markers  for  second  and  third  person 
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singular  (animate). 

(3) 

a.  iyoo  (n)    -ii  -le     -soma  kitapu 

you    -foe  -sm-2sg  -pst  -read    book 
"You  read  a  book". 

b.  Ohanyi  (n)    -a  -le     -soma  kitapu 
you         -foe  -sm-3sg  -pst  -read    book 
"John  read  a  book". 

Dalgish  makes  no  mention  of  this  optional  appearance  of  the  pv-ni  when  associated 
with  the  prefixes  denoting  the  second  and  third  person  singular  (animate).  As  will  be 
shown  shortly,  the  alternation  possibilities  borne  out  in  examples  (2)  and  (3)  are  crucial 
in  the  analysis  of  the  distributional  patterns  of  this  morpheme  in  Kivunjo. 

3.  The  distribution  of  m 

As  I  mentioned  earlier,  the  position  of  ni  in  a  clause  can  be  associated  with  the 
focus  marking  of  an  element  in  that  clause.  There  are  two  possible  ways  in  which  the 
association  of  ni  with  focus  marking  can  be  discussed:  subject  and  non-subject  focusing 
marking.  In  general,  Kivunjo  favours  subject  focusing  over  non-subject  focusing  while 
non-subject  focus  marking  is  reserved  for  special  discourse  situations.  When  a  non- 
subject  is  focus  marked,  a  special  reading  indicates  the  speaker's  intention  to  provide  a 
focus  of  contrast  between  the  focal  element  and  another  in  previous  discourse.  When  a 
subject  is  the  focal  element,  both  marked  and  unmarked  readings  are  possible,  but  this 
depends  on  the  position  the  focus  marker  occupies  in  the  clause.  In  this  section  I  will 
discuss  both  subject  and  non-subject  focusing,  starting  with  subject  focusing. 


3.1.  Subject  focusing 

A  subject  may  be  focused  in  two  ways.  The  first  is  by  placing  the  focus  marker  ni 
before  the  NP  whose  grammatical  relation  is  "subject".  The  second  is  by  attaching  the 
focus  marker  to  the  verb  that  bears  a  grammatical  relation  to  the  subject.  In  the  latter 
case,  the  subject  NP  is  optional  while  the  subject  marker  (sm),  which  is  its  anaphor, 
obligatorily  appears  on  the  verb  immediately  following  the  focus  marker.  This  pattern 
is  what  Dalgish  (1978,  1979)  refers  to  as  pv-ni.  When  the  focus  marker  precedes  the 
NP,  we  get  a  contrastive  reading  in  which  the  NP  (here  the  subject)  is  the  contrasted 
element.  When  the  focus  marker  appears  as  pv-ni  we  get  a  focus  of  assertion  reading. 
The  following  examples  are  instructive. 


(4) 

a.     (Ni)     mfo-    i-  le-     raa. 

(foe)-  river-  sm-cl4-  pst-  smell  bad 

"The  rivers  (not  someting  else)  smelled  bad." 
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(Ni)    ungdnyi  u-  16-     m-     kooya  kacha. 

(foe)  bigness    -sm-cll4-  pst-  om-  find      time  before 
"Power  (not  status)  found  its  way  to  him  a  long  time  ago. 

(N)      msodo  a  -le     -somi  kitapii? 

(foe)-  you-      -sm-3sg  -pst  -read    book 
"Did  the  man  read  a  book?" 


(5) 

a.  Mfo     (n)-  i-  le-     raa. 
river-  foe-  sm-cl4-  pst-  smell  bad 
"The  rivers  smelled  bad." 

b.  Unganyi'  n-       ii-  le-      in-     kooya  kacha. 
bigness     -foe-  sm-ell4-  pst-  om-  find      time  before 
"Power  found  its  way  to  him  a  long  time  ago." 

c.  Msondo  n       -a  -le     -soma  kitapu? 
man-       foe-  -sm-3sg  -pst  -read    book 
"Did  the  man  read  a  book?" 

In  (4)a-c,  we  have  a  ease  in  whieh  the  foeus  marker  precedes  the  subject-NP  to  mark  it  as 
the  focal  element  and  to  provide  a  eontrastive  reading  of  the  subject.  In  (5)a-c  the  foeus 
marker  appears  as  pv-ni  before  the  verb.  The  subject  is  still  the  focused  element  except 
that  in  this  case  it  does  not  have  a  eontrastive  focus  reading.  Instead  the  proposition 
has  a  focus  of  assertion  reading. 

Now  compare  the  tone  assignment  in  the  two  sets  of  examples.  In  (4)a-c,  the  subject 
NPs  have  an  initial  (first  syllable)  high  tone  (henceforth  H-tone).  In  (5)a-c,  the  subjeet- 
NPs  have  an  initial  low  tone  (henceforth  L-tone).  To  explain  this  alternation,  it  is 
necessary  to  posit  an  abstract  S-initial  L-tone.  This  L-tone  will  spread  onto  any  element 
on  S-initial  position  (Inkelas  1986,  McHugh  1985a)^.  In  addition  nj  is  underlyingly  H- 
tone  and  will  spread  its  Il-tone  to  the  first  syllable  of  the  word  in  juxataposition^.  The 
L-tone  on  ni  has  its  source  from  the  posited  abstract  S-initial  (also  boundary)  L-tone. 
This  alternation  is  further  illustrated  in  example  (6)  below. 


(6) 


n-      ii-  le-     raa. 

foe-  sm-2sg-  pst-  smell  bad 
"(The  river)  It  smelled  bad." 

^u-  le-     raa. 

sm-2sg-  pst-  smell  bad 
"It  smelled  bad." 
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(6)a  but  not  (6)b  is  acceptable  because  of  the  requirement  that  ni  should  appear  in 
this  construction,  not  only  to  indicate  the  focal  element  but  also  to  block  the  spreading 
of  the  floating  L-tone  onto  the  focal  element,  the  subject  marker  which  refers  to  the 
deleted  subject-NP. 

The  claim  that  the  presence  and  absence  of  ni  corresponds  to  the  alternation  between 
H-tone  and  L-tone  is  demonstrated  by  example  (7)a  and  (7)b  below. 


(7) 

a.  (ni)  mfo  ii-  le-  raa. 
L  H 

"The  river  smelled  bad." 

b.  mfo  n-  u-  le-  raa. 
L  H 

"The  river  smelled  bad." 

While  (7)a  correlates  to  the  tone  shift  pattern  in  (4)a-c  and  (6)a,  (7)b  exhibits  the 
pattern  in  (5)a-c  and  (6)b. 

By  considering  ni  to  be  the  source  for  the  H-tone  on  focal  elements  and  by  using  tone 
shift  to  explain  the  appearance  of  this  H-tone  on  the  focal  element,  I  am  also  claiming 
that  phrase  initial  position  is  a  focus  position.  The  claim  is  supported  by  an  observation 
made  by  Harries  (1940:134)  about  Mawia  (a  Bantu  language  spoken  in  Tanzania  and 
Mozambique)  that  nouns  of  class  5  drop  their  prefix  i  and  raise  the  tone  on  the  second 
syllable.  He  then  suggests  that  the  H-tone  must  be  originating  from  a  floating  high, 
which  supports  the  hypothesis  of  an  optional  S-initial  ni  in  the  case  of  Kivunjo  as  shown 
in  (4)a-c  and  (6)a. 

The  relationship  between  ni,  the  H-tone,  and  the  S-initial  position  is  that  they 
all  have  a  focus  function.  While  ni  may  appear  optionally,  the  H-tone  is  obligatory. 
Further,  ni  is  optional  at  S-initial  position  except  where  there  is  a  subject-NP  deletion 
which  conditions  the  appearance  of  pv-ni.  In  other  words,  ni  cannot  be  replaced  by  just 
the  H-tone  if  the  subject-NP  has  been  deleted  in  that  sentence  structures. 

Additional  support  for  S-initialposition  ni  comes  from  the  fact  that  citations  always 
show  an  initial  H-tone.  Example  (8)  shows  a  case  of  nouns  in  citation  forms*. 


(8) 


►mfo 
river 
"A  river/It  is  a  river.' 

(Ni)  ihfb 
foe.  river 
"A  river/It  is  a  river.' 


Ill 


Notice  that  (8)a  is  unacceptable  because  of  the  L-tone,  which  is  disallowed  in  cita- 
tions. The  grammar  has  a  rule  that  requires  each  string  to  have  at  least  one  element 
in  focus.  Thus,  citations  that  involve  single  items  have  to  carry  a  H-tone  on  that  single 
item  irrespective  of  whether  the  focus  marker  is  segmentally  present  or  not. 

There  is  also  further  evidence  for  the  alternation  between  focus  of  assertion  and  con- 
trastive  focus.  Because  tone  is  lexicalized,  the  syntactic  tonal  profiles  depend  on  options 
concerning  the  function  of  tone  and  its  alternation  with  the  focus  marker  ni.  Speakers 
use  H-tone  for  emphasis,  to  provide  contrastive  reading,  and  in  asserting  a  strong  point 
of  view  (cf.  Clements  1984a:318  for  similar  views  on  the  relationship  between  ne  and 
tone  in  Kikuyu).  For  example: 


(9) 


a.  (Ni)  kitapu  kicha. 
(foe)  book  good 
"A  good  book." 

b.  Kitapu  (ni)  kicha. 
book  (foe.)  good 
"The  book  is  good." 


A  speaker  will  say  (9)a  when  he  wants  to  merely  assert  something  about  a  particular 
book  which  is  also  known  by  the  hearer.  This  will  be  appropriate  in  a  situation  where 
the  speaker  wants  to  offer  a  recommendation.  On  the  other  hand,  a  speaker  will  say 
(9)b  when  he  wants  to  contrast  the  book  with  something  else  which  is  known  by  both 
the  speaker  and  the  hearer.  Notice  that  (9)a  is  possible  with  or  without  ni  in  place 
segmentally  because  the  intended  reading  can  be  achieved  through  tone. 

To  summarize,  I  have  tried  to  show  that  ni  may  be  used  to  mark  the  subject  £is  the 
focal  element  in  a  clause.  It  usually  appears  prefixed  onto  the  verb  immediately  before 
the  subject  marker  in  constructions  where  the  subject-NP  is  not  overtly  expressed. 
Otherwise  ni  will  appear  optionally  as  an  unbound  morpheme  before  a  subject-NP,  and 
it  will  assume  a  S-initial  position.  Its  absence  may  be  compensated  for  by  a  floating 
H-tone  on  the  initial  syllable  of  the  subject-NP.  This  option  is  very  much  utilized  in  fast 
speech  and  unmarked  propositions.  The  appearance  of  the  H-tone  on  the  NP  can  be 
explained  by  a  tone  shift  phenomenon  which  is  common  in  many  Bantu  languages.  The 
H-tone  compensation  is  not  a  possible  alternative  for  pv-ni. 

Because  ni  can  be  replaced  by  a  H-tone  on  the  first  syllable  of  the  subject-NP,  its 
absence  does  not  seem  to  affect  the  coherence  of  the  construction.  However,  when  extra 
emphasis  is  required,  this  focus  marker  is  generally  required. 

It  was  noted  that  ni  will  appear  before  the  subject-NP  instead  of  its  pre-verbal 
position  when  a  contrast  of  focus  is  intended.  Thus  what  was  identified  as  pv-ni  by 
Dalgish  (1978,  1979)  is  in  fact  a  case  of  a  neutral  reading  of  the  proposition. 
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3.2.0.  Non-subject  focusing  by  clefting 

Clefting  is  used  here  to  mean  the  preposing  or  postposing  of  an  element.  An  element 
may  be  preposed  to  ensure  that  the  hearer  gives  it  maximum  attention.  The  data  used 
for  this  study  showed  the  preposed  materials  to  be  old  information  and  definite.  In 
this  section  I  will  discuss  the  distribution  of  ni  in  relation  to  preposed  objects,  and 
adverbs.  I  will  also  examine  the  behaviour  of  ni  in  re-ordered  clauses  where  one  clause 
is  subordinated  or  embedded  in  another. 

3.2.1.  Clefting  the  object 

Kichaga  is  an  SVO  language.  Thus  the  normal  position  for  the  object  is  after  its 
verb.  I  noted  earlier  that  the  subject  is  inherently  the  focus  of  a  proposition  by  virtue 
of  its  S-initial  position  in  a  clause  and  that  this  position  is  a  preferred  focus  position. 
Consequently,  if  an  element  other  than  the  subject  is  the  focus  of  a  proposition,  it  has 
to  occupy  the  S-initial  position.  Examples  show  that  when  an  object  of  a  verb  is  clefted, 
ni  or  a  floating  H-tone  is  obligatorily  present.  The  following  examples  show  a  case  of 
clefted  objects. 


(10) 

a.  (Ni)  mana  Ohanyi  a-  le-  rh  -kapa 
(foe)  obj-child  su-John  sm-3sg-  pst  -om-3sg  -beat 
"It  is  the  child  (that)  John  beat  (him)." 

b.  *mana        Ohanyi    a  -le     -in  -kapa 

obj-child  su-John  sm-3sg-  pst-  om-3sg  -beat 
"It  is  the  child  (that)  John  beat  (him)." 

c.  *(Ni)    mana        Ohanyi    n-      a-  le-      in-  kapa 

(foe)  obj-child  su-John  foe-  sm-3sg-  pst-  om-3sg  -beat 
"It  is  the  child  (that)  John  beat  (him)." 

(10)a  is  acceptable  because  the  focus  marker  has  been  appropriately  placed  before 
the  clefted  NP  mana  which  is  also  the  object  of  the  main  verb.  (10)b  is  unacceptable 
because  neither  the  focus  marker  nor  the  H-tone  has  been  used.  (10)c  is  also  bad  because 
more  than  one  element  in  the  same  clause  is  focus  marked.  The  focus  marker  appears 
at  S-initial  position  as  well  as  pv-ni 


3.2.2.  Pseudo-clefts. 

It  appears  that  a  pseudo-cleft  in  Kichaga  is  simply  a  sentence  whose  subject-NP  is  a 
free  relative  clause  without  a  main  verb.  In  such  a  consruction,  ni  appears  to  resemble 
the  copula  in  other  languages.  Although  Dalgish  (1978,  1979)  argues  for  a  ni  in  Kivunjo 
which  may  be  designated  the  copula  and  which  is  distinct  from  what  he  refers  to  as  the 
pv-ni,  this  study  did  not  find  ni  in  Kivunjo  to  have  a  strong  resemblance  to  the  copula 
in  other  languages.  Thus,  despit  translation  equivalence  the  Kivunjo  ni  does  not  fit  the 
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description  of  a  copula.  There  are  unique  differences  between  this  ni  and  the  copula 
in  other  languages  in  particular  English.  In  general,  this  ni  is  optional  even  where  it 
translates  to  a  copula  in  equational  sentences.  Further  it  assimilates  to  a  nasal  where 
the  environment  is  appropriate,  it  becomes  a  syllabic  n  before  vowels,  and  it  may  appear 
as  a  tonal  clitic  (H-tone)  on  the  first  syllable  of  the  tone  bearing  unit.  The  following 
examples  are  instructive. 


(11)  .  .     . 

a.  Ohanyi  ni     msungii. 

John       foe  european 
"John  is  a  European." 

b.  6hanyi  n       -anganyi. 
John       foe-  big. 
"John  is  big." 

c.  Ohanyi  (-)  rhsungu. 
John  (-)  european 
"John  is  a  European." 

In  (ll)a  we  have  a  typical  case  in  which  ni  may  be  construed  as  the  copula.  The 
same  case  expressed  in  (ll)b  shows  that  the  same  ni  may  appear  as  syllabic  n  which  is 
conditioned  by  the  vowel.  (ll)e  which  may  be  used  to  show  the  optional  appearance  of 
ni  as  well  as  an  instance  of  total  assimilation,  is  yet  another  proof  that  this  ni  shares 
the  same  characteristics  afforded  the  focus  marker  ni.  It  is  possible  also  to  associate  the 
function  of  "focus  of  assertion"  to  this  ni  since  it  indicates  the  speaJcer's  point  of  view. 

It  is  obvious,  therefore,  that  translation  equivalence  is  not  a  necessary  nor  a  sufficient 
criteria  for  the  consideration  of  ni  in  Kivunjo  to  be  similar  to  what  may  be  designated 
the  copula  across  languages. 

Now  consider  the  examples  of  ni  in  pseudo-clefts. 


(12) 


Kindb  Ohanyi  a-  le-  udla   (g)    kite. 

thing    John       sm-3sg-  pst-buy  (foe)  dog 
"What  John  bought  is  a  dog." 

b.  *(Ni)  ki'ndo  Ohanyi  a-  le-  udla  (r))    kite. 

foe     thing   John       sm-3sg-  pst-buy  (foe)  dog 
"What  John  bought  is  a  dog." 

e.   ♦Kindo  Ohanyi  n      -a-  le-  udla   (q)    kite, 

thing    John       foe  -sm-3sg-  pst-buy  (foe)  dog 
"What  John  bought  is  a  dog." 
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d.   *Kind6  Ohanyi'  n      -a-  le-     udla  kite, 

thing    John       foe  -sm-3sg-pst-buy  dog 
"What  John  bought  is  a  dog." 

The  use  of  ni  in  (12)a  is  very  much  similar  to  that  of  the  equational  sentences  in  (ll)a. 
Notice  that  both  (12)b  and  (12)c  are  unacceptable  because  of  the  multiple  appearance 
of  ni.  In  addition  pseudo-clefts  require  that  the  distribution  of  ni  be  compatible  with 
the  tonal  patterns  of  the  construction.  (12)d  is  unacceptable  because  of  the  absence 
of  the  necessary  focus  markers.  Thus,  although  ni  is  segmentally  optional  in  pseudo- 
clefts,  its  optionality  has  to  be  compensated  by  a  H-tone  on  the  initial  syllable  of  the 
pseudo-clefted  material. 

The  appearance  of  the  focus  marker  is  a  necessary  indication  that  the  clefted  material 
with  which  it  occurs  is  the  focal  element.  Thus  kite  'dog'  in  (12)a-d  represent  the  focal 
element  with  a  contrastive  focus  reading. 

On  the  basis  of  these  facts,  I  consider  the  focus  marker  ni  and  the  ni  found  in  clefts 
and  in  equational  sentences  to  be  the  same.  It  does  not  seem  necessary  to  posit  two 
types  of  ni  with  the  same  syntactic  and  functional  characteristics. 


3.2.3.0.  Wh-questions 

A  wh-question  involves  the  same  presuppositional  structure  as  stress-focus,  cleft,  or 
pseudo-cleft.  Thus  a  questioned  element  is  necessarily  a  focal  element  because  the 
performative  function  of  the  wh-word  is  to  request  new  information.  The  performative 
function  of  focus  is  to  supply  or  state  new  or  additional  information.  In  Kivunjo,  an 
element  can  be  questioned  in  two  ways:  first,  by  placing  the  wh-word  in  situ,  and 
second,  by  preposing  or  postposing  the  wh-word.  The  wh-word  may  be  preposed  when 
questioning  elements  other  than  the  subject.  The  wh-word  may  be  postposed  when 
questioning  the  subject. 


3.2.3.1  Questioning  the  subject 

When  the  subject  is  the  element  being  questioned,  the  focus  marker  is  required 
whether  the  wh-word  is  in  situ  or  elsewhere.  Consider  the  following  examples. 

a.  (ni)     wi     a-  le-     soma  kitapii? 
(foe)  wh-  sm-3sg-  pst-  read    book 
"(it  is)  who  read  the  book?" 

b.  a-  le-  soma  kitapu  ni  wi? 
sm-3sg-  pst-  read  book  foe  wh- 
"(it  is)  who  read  the  book?" 
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•■(n)-     a-  le-     soma  kitapu  wi? 

(foe)  -sm-3sg-  pst-  read    book     wh- 
"(it  is)  who  read  the  book?" 


(13)a  and(13)b  are  acceptable  but  (13)c  is  unacceptable  with  or  without  the  focus 
marker  in  S-initial  position.  This  can  be  explained  by  the  fact  that  the  preferred  po- 
sition for  the  wh-word  when  questioning  the  subject  is  in  situ  (i.e.  S-initial)  position. 
Otherwise  the  questioned  subject  will  be  out  of  focus.  This  explains  the  fact  that  the 
focus  marker  is  required  when  the  wh-word  for  the  subject  is  not  in  situ.  This  data  is 
comparable  to  Kikuyu  (cf.  Clements  1984b:38)  as  the  following  examples  show. 


(14) 


no  6-      tem-  ir-       e    mbtfe? 

foc-wh-  sm-  cut-    pres-  fv  tree 
"Who  cut  the  tree  ?" 

*b-      tem-  ir-      e      mote  no?^ 
sm-  cut-    pres  fv-  tree     foc-wh 
"Who  cut  the  tree?" 

6-     tem-  ir-       e    mote  6? 
sm-  cut-    pres-  fv  tree    foc-wh 
"Who  cut  the  tree?" 


Notice  that  (14)b  is  unacceptable  although  its  equivalent  in  Kivunjo,  (13)c,  is  ac- 
ceptable. On  the  other  hand,  (14)c  is  also  acceptable  although  its  counterpart  (13)b 
in  Kivunjo  is  unacceptable.  This  provides  an  interesting  comparison.  It  seems  that  in 
Kikuyu  the  focus  marker  is  generally  not  required  when  questioning  the  subject,  but 
it  cannot  appear  if  the  wh-word  is  a  pseudo-cleft  as  in  (14)b.  In  Kivunjo,  however, 
the  focus  marker  or  its  equivalent  must  appear  whether  the  wh-word  is  in  situ  or  is  a 
pseudo-cleft.  It  seems  that  in  Kikuyu,  questions  have  an  intrinsic  focus  and  will  have  the 
focus  marker  ne  only  at  S-initial  position  because  that  position  is  used  for  contrastive 
focus.  In  Kivunjo,  on  the  other  hand,  questions  do  not  have  intrinsic  focus  and  will 
require  the  focus  marker  to  provide  assertive  focus.  Thus  the  S-initial  position  plus  the 
focus  marker  is  used  to  provide  contrastive  focus. 


3.2.3.2.  Questioning  the  object 

Questioning  the  object  differs  from  questioning  the  subject  by  the  fact  that  the  focus 
marker  may  not  appear  when  the  wh-word  is  in  situ  and  must  appear  when  the  wh-word 
is  clefted.  Consider  these  examples. 
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(15) 

a.  *u-  le-     soma  (ni)     kiki? 

sm-2sg-  pst-  read    (foe)  wh- 
"(It  is)  what  did  you  read  ?" 

b.  (n)-  ii-  le-  soma  kiki? 
foe-  sm-2sg-  pst-  read  wh- 
"(It  is)  what  did  you  read  ?" 

c.  (ni)     kiki  u-  I'e-  soma? 
(foe)  wh-  sm-2sg-  pst  read 
"(It  is)  what  did  you  read?" 

d.  *kiki  n-      u-  le-    soma? 

wh-  foe-  sm-2sg-  pst   read 
"(It  is)  what  did  you  read?" 

(15)a  is  unacceptable  because  the  focus  marker  appears  with  the  Q-word  in  situ. 
(15)d  is  also  unacceptable  because  the  wh-word  has  been  clefted  but  neither  the  focus 
marker  nor  its  equivalent  are  present. 

This  requirement  for  the  focus  marker  to  be  present  implies  that  information-seeking 
questions  for  subjects  will  always  have  a  focus  of  assertion  marking.  Other  questioned 
elements  can  only  be  focus  marked  for  a  contrastive  reading. 


3.2.3.3.  Yes/No  questions 

It  has  been  argued  (cf.  Dalgish  1978,  1979)  that  the  presence  or  absence  of  the  pv-ni 
in  questions  is  determined  by  the  type  of  response  expected  from  the  question.  The  main 
cltiim  is  that  the  presence  of  ni  solicits  a  positive  response,  while  its  absence  solicits  a 
negative  response.  Thus  a  positive  response  would  be  expected  from  (16)a  below,  while 
a  negative  response  would  be  expected  from  (16)b. 

(16) 

a.  Ohanyi  a-  le-     kapa  kite  a-  le-    oua? 
John       sm-3sg-  pst-  beat    dog  sm-3sg  pst  see 
"Did  John  hit  the  dog  that  he  saw?" 

b.  Ohanyi  n-      a-  le-     kapa  kite  a-  le-    ona? 
John       foe.  -sm-3sg-  pst-  beat    dog  sm-3sg  pst  see 
"Did  John  hit  the  dog  that  he  saw?" 

However,  when  speakers  of  the  same  dialect  investigated  by  Dalgish  were  asked  in 
a  separate  study  (conducted  by  the  author  of  tliis  paper)  to  interpret  (16)a  and  b, 
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the  distinction  was  not  between  affirmative  and  negative  responses.  In  fact  either  of 
the  two  can  solicit  a  yes/no  response.  Speakers  revealed  that  (16)a  would  be  used 
when  initiating  a  conversation,  while  (16)b  would  be  used  only  in  the  middle  of  an 
ongoing  conversation.  Furthermore,  in  (16)a,  the  respondent  is  expected  to  posit  new 
information  while  in  (16)b,  the  respondent  is  expected  to  confirm  information  already 
shared  by  the  speaker.  Thus  the  distinction  h'jre  is  between  "asserted  new  information" 
and  "asserted  old  information".  Consequently  ni  is  required  in  the  case  of  (16)b  but  not 
in  (16)a.  It  is  plausible  to  conclude  that  yes/no  questions  of  type  {16)a  do  not  require 
the  focus  of  assertion  because  they  have  an  intrinsic  focus  of  assertion.  In  general,  yes/no 
questions  question  the  assertion  and  do  not  necessarily  focus  on  a  particular  element 
(cf.  Marchese  for  a  similar  observation  about  West-Kru  languages).  Yes/no  questions 
of  the  type  exemplified  by  (16)b  require  the  focus  marker  because  they  are  not  intended 
to  question  the  assertion  but  rather  to  focus  on  a  particular  element,  in  this  case  the 
subject. 

Another  interesting  feature  noted  in  wh-questions  involves  the  question  tags  wai  and 
mcho.  Consider  the  following  examples. 


(17) 

a.  Ohanyi  n       -a  -le     -kapa  mana  wai? 
John       foe.  -sm-3sg  -pst  -beat   child    tag 
"John  beat  the  child,  didn't  he?" 

b.  *Ohanyi  a  -le     -kapa  mana  wai? 

John       -sm-3sg  -pst  -beat   child    tag 
"John  beat  the  child,  didn't  he?" 

c.  Ndesambudlo  a  -le  -i'zra  na  niimba  pfo  ee? 
NdesamboDo  sm-3sg  -pst  -pass  into  house  neg  tag 
"NdesambuDo  didn't  enter  the  house,  did  he?" 

d.  *Ndesambudlo  n-      a  -le     -izra    na     numba  pfo   ee? 

NdesambuDo  foe-  sm-3sg  -pst  -pass  into  house     neg  tag 
"NdesambuDo  didn't  enter  the  house,  did  he?" 

e.  Mana  n       -a  -endd  mcho? 
child    foe.  -sm-3sg  -go       tag 
"The  child  has  gone,  hasn't  he?" 

f.  *Mana  -a  -enda  rhcho? 

child    -sm-3sg  -go       tag 
"The  cliild  has  gone,  hasn't  he?" 

By  using  the  focus  marker  in  (17)a,  the  speaker  is  making  an  assertion  as  he/she 
attempts  to  solicit  a  confirmation  or  denial  from  the  respondent.  The  absence  of  the 
focus  marker  in  (17)b  defeats  the  speaker's  purpose,  which  explains  its  unacceptability. 
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In  (17)c,  the  speaker  is  trying  to  justify  what  he/she  presupposes.  There  is  no  evidence 
that  the  speaker  has  any  expectations  with  respect  to  the  respondent's  contribution  to  his 
assertion.  Instead  the  speaker  expects  the  hearer  to  confirm  that  the  action  indeed  did 
take  place.  Thus  (17)d  is  unacceptable  because  the  focus  marker  allows  the  proposition 
to  have  a  focus  of  assertion  which  is  not  compatible  with  the  speaker's  uncertainty  which 
is  signalled  by  pfo.  The  evidence  required  in  (17)c  is  in  (17)e  in  which  mcho  is  used 
and  which  implies  that  the  speaker  does  not  expect  his  proposition  to  be  challanged. 
Thus  (17)f  is  unacceptable  because  it  lacks  the  focus  of  assertion  which  will  provide  the 
necessary  condition  for  mcho  to  indicate  that  the  proposition  is  unchallengeable. 

Thus  the  tags  wax,  pfo-ee  and  mcho  reflect  the  speaker's  attitude  more  than  what 
he  anticipates  of  the  respondent.  This  in  turn  reflects  on  the  distribution  of  the  focus 
marker  in  such  constructions. 

There  is  an  additional  fact  about  the  use  of  these  tags  which  bears  on  the  distribution 
of  the  focus  marker.  The  adnominals  wax  and  mcho  will  be  used  where  the  speaker  does 
not  expect  that  the  hearer  will  challenge  the  proposition.  Contrary  to  Dalgish  (1979:50), 
the  adnominals  themselves  do  not  carry  a  presuppositional  interpretation  nor  show  the 
ability  to  control  the  type  of  response  that  will  be  provided  by  the  respondent. 

To  conclude  this  section,  it  is  clear  that  in  questioning  the  subject  or  the  object,  the 
focus  marker  is  preferred.  However,  there  are  different  consequences  for  the  subject  and 
for  the  object.  The  focus  marker  optionally  appears  when  the  wh-word  for  a  questioned 
subject  remains  in  situ  and  is  otherwise  obligatory.  On  the  other  hand,  the  focus  marker 
is  optional  when  the  wh-word  for  the  object  is  preposed  and  obligatory  when  it  remains 
in  situ.  This  is  expected  considering  the  distribution  of  ni  in  clefts.  Thus  these  two 
observations  confirm  the  hypothesis  that  S-initial  position  is  a  focus  position. 


3.2.4.0.  Ni  and  clause  types 

Other  restrictions  on  the  distribution  of  ni  depend  on  the  type  of  clause.  In  this  section 
I  will  discuss  the  distribution  of  ni  in  adverbial,  independent,  and  subordinate  clauses. 


3.2.4.1.  Adverbial  clauses 

Generally,  adverbs  appear  in  S-final  position,  which  is  not  a  focused  position.  When 
they  do  appear  at  S-initial  position,  which  is  a  focused  position,  the  focus  marker  or  its 
equivalent  is  attached.  Thus,  adverbs  behave  like  objects  with  respect  to  focusing. 

As  the  following  examples  (17)a-c  below  show,  the  focus  marker  and  the  adverb 
constitute  an  adverbial  clause  which  is  independent  of  the  main  clause.  As  such,  both 
the  adverbial  clause  and  the  main  clause  can  have  independent  focus  markers  allowing 
multiple  focus  markers  in  one  sentence  structure.  As  will  be  shown  later,  multiple 
focus  markers  are  allowed  in  some  sentence  structures  if  the  clauses  of  that  sentence  are 
semanticcally  independent. 
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(18) 

a.  Ni  wasi  n  -ii  -le  -wona  ihndu  cho. 
foe.  clear  foe.  -sm-2sg  -pst  -see  person  that 
"Clearly  (it  is  clear)  you  saw  that  person." 

b.  *Ni     wasi    -u  -le     -wona  mndu    cho. 

foe.  clear  -sm-2sg  -pst  -see       person  that 
"Clearly  (it  is  clear)  you  saw  that  person." 

c.  *wasi    n       -u  -le     -wona  mndu    cho. 

clear  foe.  -sm-2sg  -pst  -see       person  that 
"Clearly  (it  is  clear)  you  saw  that  person." 

The  unacceptability  of  (18)b  is  a  result  of  the  absence  of  the  focus  marker  in  the 
main  clause.  The  scope  of  ni  is  therefore  limited  to  the  clause  in  which  it  occurs. 
Likewise,  (18)c  is  unaceptable  since  it  is  required  that  the  adverb  wasi  form  an  adverbial 
clause  by  combining  with  nj  in  order  to  appear  semanticaJly  independent  of  the  main 
clause.  Further,  because  wasi  is  occupying  sentence  initial  and  there  is  no  other  potential 
subject-NP  which  precedes  the  verb,  the  absence  of  ni  will  make  the  advverb  appear  as 
if  it  were  the  subject  of  the  main  verb.  However,  its  association  with  the  verb  is  blocked 
by  the  agreement  marker  on  the  verb  which  distinguishes  wasi  from  the  real  subject  of 
the  verb. 

The  ni  co-occuring  with  the  adverb  has  a  distinct  function  from  that  in  the  main 
clause  in  (18)a.  The  ni  co-occuring  with  the  adverb  wasi  provides  contrastive  focus  on 
the  adverb  wasi  while  the  pv-ni  attached  to  the  main  verb  provides  assertive  focus  on 
the  subject  of  that  verb.  To  a  certain  degree,  this  stratergy  reflects  the  speakers  attitude 
with  respect  to  the  truth  of  his  assertion  (cf.  also  Dalgish  1979).  The  speaker  does  not 
expect  his/her  assertion  to  be  challenged.  In  addition,  the  presence  of  the  ni  in  the 
adverbial  clause  highlights  the  interpretation  'certainty'  which  is  part  of  the  intrinsic 
nature  of  the  adverb.  Examples  (19)a  and  (19)b  below  further  illustrates  this  point. 


(19) 


Ni    kupfa      n-     n  -le-    iwa      maduwii 

foe  because  foe  -sm-2sg  -pst  -steal  bananas    these 

gha        6       -chi         -kapo. 

sm-2sg  -fut  -beaten 

"Because  you  stole  these  bananas  you  will  be  beaten. 

*Kupfa     n       -u  -le     -iwa     maduwu  gha 

because  foe.  -sm-2sg  -pst  -steal  bananas   these 
o  -chi  -kapo. 

sm-2sg  -fut  -beaten 
"Because  you  stole  these  bananas  you  will  be  beaten. 
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(19)a  shows  the  focus  marker  in  both  the  adverbial  clause  and  the  main  clause. 
(19)b  is  unacceptable  on  the  same  grounds  as  (18)c  by  the  fact  that  the  adverb  kupfa 
appears  as  the  subject  of  the  main  verb  but  that  association  is  blocked  by  the  agreement 
marker  on  the  verb.  To  prevent  the  adverb  from  appearing  as  if  it  were  the  subject  of 
the  verb  Immediately  following  it,  the  ni  has  to  appear  to  provide  the  adverb  with  its 
own  focus  structure  as  in  (19)a.  There  are  semantic  consequences  too.  In  the  absence 
of  the  focus  marker,  the  adverb  kupfa  loses  its  contrastive  focus  reading  and  assumes  a 
'condition/consequence'  reading,  as  (20)  shows. 


(20) 

a.  Kupfa     -ii  -le     -iwa     maduwu  gha 
because  -sm-2sg  -pst  -steal  bananas    these 

n-       6  -chi  -kapo. 

foe-  sm-2sg  -fut  -beaten 

"If  you  stole  these  bananas  you  will  be  beaten." 

b.  *Kupfa      n      -u  -le     -iwa     maduwu  gha 

because  foe  -sm-2sg  -pst  -steal  bananas   these 

o  -chi  -kapo. 

sm-2sg  -fut  -beaten 

"If  you  stole  these  bananas  you  will  be  beaten." 

Notice  that  the  absence  of  the  focus  marker  reduces  the  adverbial  clause  to  a  mere 
adverb  whose  interpretation  depends  on  the  clause  to  which  it  is  attached.  This  has  some 
consequences  with  respect  to  the  distribution  of  the  focus  marker  in  the  remaining  clauses 
since  conditional  clauses  cannot  have  a  focus  marker,  which  explains  the  acceptability 
of  (20)a  and  the  unacceptable  (20)b.  This  will  be  discussed  further  in  the  next  section 
in  which  I  will  look  at  distributional  patterns  of  the  focus  marker  in  sentence  structures 
of  more  than  one  clause. 


3.2.4.2.  Coordinate  clauses 

We  have  just  seen  that  if  a  clause  does  not  depend  on  another  clause  for  its  semantic 
interpretation,  it  will  require  its  own  focus  structure.  This  is  clearly  exemplified  by 
coordinated  clauses  as  shown  in  the  following  examples. 


(21) 


Kite  h-ki       -le     -kap    -b 

dog    foc-sm  -pst  -beat  -pass. 

na     mana  n      -a      -le     -ki  -wona 

and  child    foe  -sm  -pst  -it   -see 

"The  dog  was  beaten  and  the  child  saw  it. 
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b.  ♦Kite  ki     -le     -kap-o 

dog    sm  -pst  -beat     -pass. 

na     mana  n      -a      -le     -ki  -wona 

and  child    foe  -sm  -pst  -it   -see 

"The  dog  waa  beaten  and  the  cliild  saw  it." 

c.  *Kite  n      -ki    -le     -kap    -o 

dog    foe  -sm  -pst  -beat  -pass. 

na     mana  a      -le     -ki  -wona 

and  child    sm  -pst  -it   -see 

"The  dog  was  beaten  and  the  child  saw  it." 

d.  Kite  h     -ki'    -le     -kap    -6 
dog    foe  -sm  -pst  -beat  -pass, 
na     mana  a     -le     -ki  -wona 
and  child    sm  -pst  -it   -see 

"The  dog  was  beaten  by  the  child  who  saw  it." 

The  unacceptability  of  (21)b  and  (21)c  follows  from  the  fact  that  the  clause  without 
the  focus  marker  is  forced  to  derive  its  semantic  interpretation  form  the  clause  with 
the  focus  marker.  However  such  a  derivation  is  blocked  by  the  syntactic  and  semantic 
structure  of  the  sentence.  The  clauses  are  coordinated  and  are  to  be  interpreted  inde- 
pendently of  each  other.  (21)d  is  acceptable  only  if  the  subject  of  the  lower  clause  (i.e 
mana)  is  interpreted  as  the  agent  of  the  passive  verb  of  the  upper  clause.  With  a  relative 
clause  reading,  the  lower  clause  will  be  interpreted  as  a  complement  clause  rather  than 
an  independent  clause. 

3.2.4.3.  Subordinate  clauses 

Subordinate  clauses  include  relative  clauses  and  sequential  clauses.  Characteristi- 
cally subordinate  clauses  contain  background  materials  while  the  main  clause  provides 
the  asserted  materials.  Sequential  clauses  are  different  in  this  respect  because  they  do 
not  necessarily  contain  background  information.  Sequential  clauses  are  used  for  the  most 
part  to  advance  the  mainline  events  of  a  narration.  The  following  examples  are  intended 
to  show  the  distribution  of  ni  subordinate  clauses. 


(22) 

a.  Msbdo  a  -icha     n      -a  -le     -koDa  kelya. 
man       sm-3sg  -come  foe  -sm-3sg  -pst  -cook    food 
"The  man  who  is  coming  cooked  the  food." 

b.  *Msodo  n      -a  -icha     n      -a  -le     -koDa  kelya. 

man       foe  -sm-3sg  -come  foe  -sm-3sg  -pst  -cook    food 
"The  man  who  is  coming  cooked  the  food." 
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c.  Kupfa  a  -le     -cha      ukou 
cond.     8m-3sg  -pst  -come  yesterday 
n      -a            -we     -kaa    na     -so. 
foe  -sni-3sg  -perf  -stay  and  -us 

"If  he  had  come  yesterday,  he  would  have  stayed  with  us." 

d.  *Kupfa  n      -a  -le     -cha      ukou 

cond.     foe  -sm-3sg  -pst  -come  yesterday 

n      -a  -we     -kaa    na     -so. 

foe  -sm-3sg  -perf  -stay  and  -us 

"If  he  had  come  yesterday,  he  would  have  stayed  with  us." 

e.  Mana  n-       a-      le-     cha,      k-         a-  koDa 
child    foe.-  sm-  pst-  come,  -seq.-  sm-3sg-  cook 

k-       a-  lya,  (k-      a-  enda  kanyi 

seq.-  sm-3sg-  eat,  seq.-  sm-3sg-  go       home 
"The  child  came,  (then)  he  cooked,  (then)  he  ate,  (then)  he  went  home." 

f.  *Mana  n-       a-  le-     cha,      (n)-k-  a-  koDa 

child    foe.-  sm-3sg-   pst-  come,  (foc)-seq.-  sm-3sg-  cook 

(n)-k-  a-  lya,   (n)-k-  a-  enda  kanyi 

(foc)-seq.-  sm-3sg-  eat,  (foc)-seq.-  sm-3sg-  go       home 

"The  child  came,  (then)  he  cooked,  (then)  he  ate,(then)  he  went  home." 

Examples  (22)a  and  (22)b  involve  modifying  clauses  (similar  to  relative  clauses)^", 
(22)e  and  (22)d  involve  adverbial  or  condition  clauses  while  (22)e  and  (22)f  involve 
clauses  expressing  events  in  a  sequence.  Conditional  clauses  were  briefly  discussed  earlier 
in  my  discussion  of  adverbial  clauses. 

The  unacceptability  of  (22)b,  (22)d  and  (22)e  indicate  that  ni  may  not  appear  in 
embedded  or  subordinate  clauses.  There  is  a  discourse  restriction  which  shows  that  the 
information  contained  in  subordinate  clauses  cannot  be  aissertive  because  subordinate 
clauses  like  those  described  in  this  paper  provide  background  information.  Although 
sequential  clauses  do  not  contain  background  information,  but  rather  signal  mainline 
events  in  narratives,  they  do  not  allow  the  appearance  of  ni  because  they  are  syntactically 
and  semantically  dependent  on  the  main  clause.  Syntactically  they  follow  the  main 
clause,  and  they  lack  subject  and  tense  marking.  Semantically,  they  depend  on  the 
main  clause  for  the  interpretation  of  the  actor/agent  and  the  time  of  the  action,  which 
is  generally  presented  Jis  past. 

The  restriction  on  the  distribution  of  ni  in  subordinate  clauses  also  dictates  the 
order  in  which  the  clauses  may  appear.  A  subordinate  clause  of  the  type  discussed  in 
this  paper  cannot  precede  the  main  clause.  This  is  further  illustrated  below. 
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(23)  ^ 

a.  N  -a  -le  -ona  kite  u  -le  -kapa. 
foe  -sm-3sg  -pst  -see  dog  sm-2sg  -pst  -beat 
"He  saw  the  dog  (which)  you  hit." 

b.  *a  -le     -ona  kite  n      -u  -le     -kapa. 

sm-3sg  -pst  -see    dog  foe  -sm-2sg  -pst  -beat 
"He  saw  the  dog  (whieh)  you  hit." 

c.  N     -ii  -le     -kapa  kite  a  -le    -ona. 
foe  -sm-2sg  -pst  -beat    dog  sm-3sg  pst  see 
"You  hit  the  dog  that  he  saw." 

d.  *u  -le     -kapa  kite  n      -a  -le    -ona. 

sm-2sg  -pst  -beat   dog  foe  -sm-3sg  pst  see 
"You  hit  the  dog  that  he  saw." 

e.  *n      -a  -le     -ona  kite  n      -u  -le     -kapa. 

foe  -sm-3sg  -pst  -see    dog  foe  -sm-2sg  -pst  -beat 
"He  saw  the  dog  (which)  you  hit." 

Examples  (23)a  through  (23)e,  represent  cases  in  which  the  focus  marker  may  be 
associated  with  asserted  information.  Not  only  does  the  asserted  information  need  to 
appear  in  the  higher  clause  but  it  also  needs  to  be  introduced  first  in  the  structure  of 
the  discourse.  Speakers  have  a  choice  as  to  what  information  to  present  first,  this  is 
reflected  in  the  order  in  which  the  clauses  are  presented  as  well  as  the  focus  structure. 

These  examples  also  show  that  the  focus  marker  ni  may  only  appear  in  the  clause  that 
is  ordered  first.  This  is  demonstrated  by  (23)b  which  is  the  reverse  order  of  (23)a  and 
which  shows  the  focus  marker  in  the  lower  clause.  It  is  plausible,  therefore,  to  conclude 
that  the  clause  with  the  focus  marker  must  always  be  the  higher  clause.  The  clause 
without  the  focus  marker  becomes  a  dependent  clause  which  disallows  the  focus  marker, 
as  demonstrated  by  the  unaceeptability  of  (23)b  and  (23)d.  The  unaeeeptability  of 
(23)e  is  prompted  by  the  presence  of  the  focus  marker  in  both  the  main  and  subordinate 
clause. 

To  summarize,  I  have  shown  by  examples  that  ni  is  conditioned  by  the  syntatie 
position  it  can  occupy  in  a  clause.  The  preferred  position  is  S-initial  position,  which 
is  also  the  preferred  focus  position.  I  showed  that  when  adverbs  and  objects  appear  in 
S-final  position,  they  require  the  presence  of  the  focus  marker.  While  the  focus  marker 
may  appear  in  more  than  one  coordinate  clause,  it  may  not  appear  in  subordinate  clauses 
of  the  type  described  in  this  paper.  I  also  showed  that  the  order  in  which  clauses  appear 
reflects  the  speaker's  choice  of  what  information  to  present  to  the  hearer  first.  This  in 
turn  reflects  on  the  distribution  of  the  focus  marker  in  such  clauses.  It  turns  out  that 
the  focus  marker  may  only  appear  in  the  clause  which  has  the  information  the  speaker 
considers  most  crucial.  Such  information  is  presented  to  the  hearer  first. 
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3.3.0.  Other  environments  without  ni 

Apart  from  environments  discussed  in  section  3.2.4,  ni  may  not  occur  in  negative  propo- 
sitions, nor  with  the  tense  markers  am  and  wem,  whose  interpretation  reflects  past  time. 
Negation  is  generally  accomplished  by  adding  the  adnominal  pfo  at  the  end  of  the  propo- 
sition. In  some  marked  cases  chi  or  ni  may  be  used.  The  following  section  is  devoted  to 
these  environments. 


3.3.1.  The  adnominals  pfo,  chi  and  ni 

Both  pfo  and  chi  are  negative  morphemes.   They  differ  in  that  pfo  and  ni  can  appear 
only  in  S-final  position,  while  chi  can  appear  only  in  S-initial  position. 
Consider  the  following  examples. 


(24) 


Ndesambudlo  a  -le     -okya   nyama  pfo. 

Ndesambudlo  sm-3sg  -pst  -roast  meat      neg. 
"Ndesambudlo  did  not  roast  meat." 

b.  *Ndesambudlo  n      -a  -le     -okya   nyama  pfo. 

Ndesambudlo  foe  -sm-3sg  -pst  -roast  meat      neg. 
"Ndesambudlo  did  not  roast  meat." 

c.  u  -le  -soma  kitapu  pfo. 
sm-2sg  -pst  -read  book  neg. 
"You  did  not  read  a  book." 

d.  *n      -u  -le     -soma  kitapu  pfo. 

foe  -sm-2sg  -pst  -read    book     neg. 
"You  did  not  read  a  book." 


These  examples  show  that  pfo  and  the  focus  marker  ni — which  has  been  designated 
the  focus  marker — are  in  complementary  distribution.  (24)b  and  (24d)  are  unacceptable, 
because  pfo  and  the  focus  marker  ni  have  been  allowed  to  co-occur. 

Now  consider  these  examples  with  chi. 


(25) 

a.  Chi  NdesambiiDo  a  -le  -okya  nyama  pfo. 
neg.  NdesambuDo  sm-3sg  -pst  -roast  meat  neg. 
"It  is  not  NdesambuDo  who  roasted  meat." 

b.  *Chi      NdesambuDo  n      -a  -le     -okya   nyama  pfo. 

neg.     NdesambuDo  foe  -sm-3sg  -pst  -roast  meat      neg. 
"It  is  not  NdesambuDo  who  roasted  meat." 
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c.  Ch-  ii  -le  -soma  kitapii  pfb. 
neg.  -sm-2sg  -pst  -read  book  neg. 
"It  is  not  that  you  read  a  book." 

d.  *Chi      n       -u  -le     -soma  kitapu  pfo. 

neg.     -foe  -sm-2sg  -pst  -read    book     neg. 
"It  is  not  that  you  read  a  book." 

e.  *Ch     -u  -le     -soma  kitapu. 

neg.  -sm-2sg  -pst  -read    book. 
"It  is  not  that  you  read  a  book." 

f.  *Chi    -n-    u  -le     -soma  kitapu. 

neg.  -foe  -8m-28g  -pst  -read    book. 
"It  is  not  that  you  read  a  book." 

These  examples  show  chi  at  S-initial  position  co-occurring  with  pfo.  No  restrictions 
have  been  found  that  prohibit  the  co-occurrence  of  both  negative  markers.  However,  the 
presence  of  chi  generally  requires  that  of  pfo,  as  shown  by  the  unacceptability  of  (25)e 
and  (25)f.  (25)d  is  unacceptable  on  account  of  the  co-occurrence  of  the  positive  focus 
marker  (occurring  as  pv-ni)  and  the  negative  focus  marker  chi. 

Although  chi  is  somehow  dependent  on  pfo,  the  two  have  different  discourse  and 
semantic  functions.  It  is  plausible  to  consider  chi,  but  not  pfo,  to  be  the  negative  focus 
marker^^.  As  a  negative  marker,  pfo  can  be  used  to  negate  the  entire  proposition. 
The  claim  that  the  two  adnominals  have  different  semantic  functions  can  be  further 
illustrated  by  the  following  examples  in  which  chi  appears  independently  of  pfo.  Notice, 
however,  that  this  is  possible  only  in  questions. 


(26) 


NdesambuDo  a  -le     -okya  nyama? 

(neg)  NdesambuDo  sm-3sg  -pst  -roast   meat 
"NdesambuDo  did  not  roast  meat?" 

NdesambuDo  a  -le     -oky'a  nyama  pfb? 

NdesambuDo  sm-3sg  -pst  -roast   meat      neg. 
"NdesambuDo  did  not  roast  the  meat?" 

Chi  NdesambuDo  -a  -le     -oky'a  nyama? 

neg  NdesambuDo  sm-3sg  -pst  -roast    meat. 
"Is  it  not  NdesambuDo  who  roasted  the  meat?" 

Chi  NdesambuDo  -a  -le     -oky'a  nyama  pfb? 

neg  NdesambuDo  sm-3sg  -pst  -roast   meat      neg. 
"It  is  not  NdesambuDo  who  roasted  the  meat?" 
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These  examples  may  be  interpreted  as  follows.  (26)a  is  a  simple  and  neutral  "fact 
finding"  question.  Although  there  is  no  overt  negative  marker,  negation  is  marked  by  a 
floating  tone.  (26)b  is  an  echo  question.  The  speaker  is  already  aware  that  the  action  was 
not  performed.  In  other  words,  NdesambuDo  was  expected  to  roast  the  meat.  However, 
it  appears  that  NdesambuDo  did  not  roast  the  meat.  Compared  to  (26)b,  (26)c  has  a 
shift  in  focus.  The  speaker  is  aware  that  the  action  was  performed  but  is  interested  in 
finding  out  whether  NdesambuDo  (and  not  anyone  else)  roasted  the  meat.  Thus  it  is  a 
case  of  contrastive  focus,  resulting  from  the  presence  of  the  negative  focus  marker  chi. 
Thus  chi  negates  the  subject- NdesambuDo-  but  not  the  rest  of  the  proposition.  (26)d  is 
also  an  echo  question.  The  speaker  is  aware  that  the  meat  was  roasted  but  that  someone 
other  than  NdesambuDo  roasted  it.  It  differs  from  (26)c,  because  it  has  a  contrastive 
reading  that  can  be  associated  with  the  focus  marker  chi,  while  pfo  provides  a  negative 
interpretation  of  the  entire  proposition.  The  speaker  presupposes  that  NdesambuDo  did 
not  roast  the  meat.  Further,  he  is  not  sure  if  the  meat  was  roasted  at  all.  Therefore, 
the  co-occurrence  of  chi  and  pfo  reinforces  the  negative  interpretation  of  both  the  actor 
and  the  action. 

Now  consider  the  following  examples  with  ni  as  a  negative  marker. 

(27) 

a.  Mana  a-  le-     okya  nyama  ni 
child    sm-3sg-   pst-  roast  meat      neg 
"The  child  did  not  roast  the  meat" 

b.  Chi         mana  a-  le-     okya   nyama  ni 
neg-foc  child    sm-3sg-  pst-  roast  meat      neg 
"The  child  did  not  roast  the  meat" 

c.  *Mana  a-      le-     okya   nyama  ni      pfo 

child    sm-  pst-  roast  meat     neg  neg 
"The  child  did  not  roast  the  meat" 

The  reading  of  (27)a  is  that  the  child  did  something  other  than  roast  meat.  In 
(27)b  the  reading  is  almost  a  double  emphasis  ("defmetely  the  child  did  not  roast  the 
meat").  It  may  be  used  to  indicate  that  the  child  roasted  something  else  while  someone 
else  roasted  the  meat.  It  is,  therefore,  plausible  to  consider  that  ni  may  interact  with 
chi  in  the  same  way  chi  interacts  with  pfo  to  reinforce  a  negatives  interpretation  of  a 
proposition.  On  the  other  hand,  ni  and  pfo  cannot  co-occur,  as  the  unacceptability  of 
(27)c  shows.  This  shows  that  the  two  are  in  free  variation  but  are  are  syntactically 
and  semantically  in  complementary  distribution  with  chi.  However,  it  is  important  to 
mention  that  the  data  shows  a  wide  distribution  of  pfo  but  not  ni  as  a  negative  marker. 

3.3.2.  The  aspect  markers  -am-  and  -wem-^^ 

Clauses  with  the  aspect  markers  am  and  ivem  also  seems  to  affect  the  distribution 
of  ni.  The  following  examples  show  the  destribution  of  ni  in  clauses  with  either  a  tense 
or  aspect  marker. 
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(28) 

a.  *Mai        n      -a  -am  -okya   nyama. 

mother  foe  sm-Ssg  -im.perf  -roast  meat 
"Mother  has  roasted  the  meat." 

b.  *Mai        n     -a  -wem      -okya  nyama. 

mother  foe  8m-3sg  -p.perf.  -roast  meat 
"Mother  had  roasted  the  meat." 

c.  Mki        -i  -4m         -6ky'a  nyama. 
mother  sm-3sg  -im.perf  -roast   meat 
"Mother  has  roasted  the  meat." 

d.  Mai'        -a  -wem      -oky'a  nyama. 
mother  sm-3sg  -p.perf.  -roast   meat 
"Mother  had  roasted  the  meat." 

e.  Mai        n      -a  -i         -oky'a  nyama. 
mother  foe  sm-3sg  -pres  -roast   meat 
"Mother  is  roasting  the  meat." 

f.  Mai        n     -a  -e      -oky'a  nyama. 
mother  foe  sm-3sg  -fut  -roast   meat 
"Mother  will  roast  the  meat." 

g.  Mai        n     -a  -lehi  -oky'a  nyama. 
mother  foe  sm-3sg  -fut-  -roast   meat 
"Mother  will  roast  the  meat." 

h.     Mai'        n     -a  -we  -oky'a  nyama. 

mother  foe  sm-3sg  -pst-eont.  -roast   meat 
"Mother  was  roasting  the  meat." 

Notice  that  only  (28)a  and  (28)b  are  unacceptable,  due  to  the  co-occurrence  of  the 
aspect  markers  am  and  wem  with  ni,  a  restriction  which  does  not  apply  to  other  tense 
or  aspect  markers  (cf.  Moshi  1988).  In  addition,  the  restriction  does  not  apply  to  echo 
questions  or  where  the  focus  marker  does  not  appear  in  the  same  sentence  structure  as 
the  aspect  marker.  Such  are  eases  as  those  described  in  section  3.2  and  which  involve 
non-subject  focusing.  Additional  examples  are  given  below  in  which  (29)  exemplify 
clefted  objects  and  (30)  exemplify  echo  questions. 

(29)  ,  ,  ,^ 

a.     (Ni)    mana  mai         a  -am       -mw  -oky-    i-  a  nyama. 

(foe)  child    mother  sm-3sg  -perf.-  him    roast-  appl-  fv  meat 
"The  child,  mother  roasted  the  meat  for  him." 
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b.     (Ni)    mana  mai         a  -wem       -mw  -oky-    i-         'a  nyama. 

(foe)  child    mother  sm-3sg  -p-perf-  him    roast-  appl-  fv  meat 
"The  child,  mother  has  already  roasted  the  meat  for  him." 

a.  Mai        n      -a  -am     -oky a  nyama? 
mother  foe  sm-3sg  -perf.  -roast   meat 
"Has  mother  roasted  the  meat?" 

b.  Mai        n      -a  -wem     -oky'a  nyama? 
mother  foe  sm-3sg  -p-perf  -roast    meat 
"Had  mother  roasted  the  meat?" 

Considering  that  the  focus  marker  is  not  in  the  same  constituent  as  the  aspect 
marker,  it  is  plausible  to  consider  that  the  perfective  aspect  marker  has  a  focus  of  its 
own  which  blocks  additional  focusing  by  ni.  Thus  eases  like  (28)a  and  (28)b  are  ruled 
out  because  an  assertion  cannot  have  more  than  one  focus. 

The  focus  marker  is  required  in  (30)  which  involves  echo  questions  because  it  provides 
a  contrastive  focus  reading  on  the  predicate.  The  speaker  is  already  focusing  on  the 
assertion  instead  of  on  a  particular  element,  and  the  contrastive  focus  reading  is  intended 
for  the  theme  of  the  assertion,  which  in  this  particular  example  is  nyama. 

To  summarize  this  section,  I  have  shown  that  there  are  several  areas  in  which  the 
appearance  of  the  focus  marker  ni  is  resticted.  These  are  subordinate  clauses,  construc- 
tions with  the  negative  markers  chi  and  pfo,  and  those  which  involve  the  perfective 
aspects  am  and  wem.  I  claimed  that  ni  cannot  appear  in  subordinate  clauses  because 
they  typically  contain  background  information.  As  such,  they  lack  asserted  materials 
which  prompt  the  presence  of  ni.  Negatives  and  the  perfective  aspects  must  be  ana- 
lyzed as  having  an  intrinsic  focus.  This  view  that  negative  markers  have  intrinsic  focus 
has  also  been  expressed  about  other  languages  which  utilize  focus  markers  (Givon  1979; 
Marehese  1983). 

4.  General  Remarks. 

The  paper  has  emphasized  grammatical  considerations  for  the  distribution  of  ni.  Very 
little  attention  was  given  to  pragmatic  considerations.  Without  doubt,  speakers  have  a 
large  amount  of  choice  as  to  what  to  focus  and  when  to  focus  it.  However,  the  restrictions 
that  have  been  discussed  in  this  paper  are  evidence  that  the  language  uses  gammatieal 
control  in  the  distribution  of  the  focus  marker. 

I  have  argued  that  ni  in  Kivunjo  should  be  analyzed  as  a  focus  marker.  It  may 
be  used  to  mark  contrastive  focus  as  well  as  assertive  focus.  As  an  assertive  focus 
marker,  ni  denotes  new  information.  As  a  contrastive  focus  marker,  it  denotes  additional 
information  or  corrects  false  information.  It  is  clear  also  that  Kivunjo  does  not  combine 
assertive  focus  with  contrastive  focus. 


129 


As  an  assertive  focus  marker,  m  may  not  appear  in  subordinate  clauses  (such  as 
relative  clauses)  or  other  embedded  clauses.  It  was  shown  that  the  clause  with  the  focus 
marker  is  to  be  interpreted  as  the  clause  with  the  main  information.  The  clause  ordering 
restriction  supports  this  claim  as  well  as  the  claim  that  the  main  information  tends  to 
be  presented  first. 

Although  Dalgish  (1978,  1979)  distinguishes  what  I  am  calling  the  focus  marker 
from  the  morpheme  ni,  referred  to  as  the  copula,  this  study  did  not  find  significant 
characteristic  differences  between  them.  The  two  morphemes  are  phonologically  similar 
and  share  the  same  syntatic  and  semantic  properties.  Both  may  optionally  appear  in 
the  discourse,  and  may  be  replaced  by  a  H-tone  on  the  initial  syllable  of  the  following 
word  in  the  same  clause.  It  is  plausible,  therefore,  to  claim  that  the  morpheme  ni  in 
Kivunjo  need  not  be  considered  the  copula  even  though  it  may  resemble  or  translate  to 
a  copula  in  other  languages. 

The  question  is  whether  the  ni  that  I  am  designating  the  focus  marker,  and  which 
appears  in  assertions  or  contrastive  focus  constructions,  is  the  same  as  the  ni  which 
appears  at  S-final  position  in  negative  propositions.  Although  the  two  ni's  are  phono- 
logically similar,  they  differ  syntactically  and  semantically.  The  negative  marker  ni  may 
appear  at  S-final  position  and  alternates  with  another  negative  marker  pfo.  The  focus 
marker  may  appear  before  a  focal  element.  A  focal  element  may  be  any  element  that 
appears  at  S-initial  position.  When  the  subject  of  a  proposition  is  the  focal  element,  the 
focus  marker  may  appear  at  S-initial  position  for  contrastive  focus  or  as  a  prefix  on  the 
verb  for  assertive  focus. 

Despite  these  syntactic  and  semantic  differences  between  the  two  nCs  I  am  inclined 
to  consider  the  negative  marker  ni  to  be  a  semantic  allomorph  of  the  focus  marker  ni. 
As  an  allomorph,  it  may  only  appear  at  S-final  position,  but  bears  weak  focus  marking 
effects.  With  its  focus  marking  effects  weakened,  ni  assumes  the  role  of  indicating  a 
negative  proposition.  It  may  also  be  used  to  reinforce  an  already  expressed  negative 
focus.  The  former  will  include  cases  in  which  it  freely  alternates  with  pfo  while  the 
latter  will  include  cases  in  which  it  co-occurs  with  the  negative  focus  marker  chi.  This 
consideration  implies  that  Kivunjo  has  only  one  positive  focus  marker  (ni)  and  only 
one  negative  focus  marker(c/ii).  Both  the  positive  focus  marker  and  the  negative  focus 
marker  may  only  occur  where  the  main  focus  of  the  proposition  is.  This  is  a  phenomenon 
which  is  common  in  other  languages  that  use  overt  focus  markers  (Givon  1975,  Kunene 
1975,  for  Zulu;  Marchese  1983,  for  West  Kru  languages;  Clements  1984a,  1984b,  for 
Kikuyu). 
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*  This  paper  is  a  revised  version  of  a  paper  presented  at  the  African  linguistics  con- 
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read  versions  of  this  paper  for  their  valuable  comments.  I  would  especially  like  to 
thank  Sharon  Inkelas,  Sandra  Thompson  and  Joan  Bresnan  for  their  comments.  My 
gratitude  to  Ladislaus  Semali,  the  Mlay  family  and  Mr.  Makundi  for  sharing  their 
valuable  judgements  on  the  data  used  in  the  paper.  I  am  also  indebted  to  the  anony- 


130 


mous  reviewer  for  the  valuable  comments  that  made  me  re- think  my  earlier  analysis. 
None  of  the  above  are  responsible  for  the  data  or  its  analysis. 

^  Kivunjo  is  on  the  Chaga  dialect  continuum  spoken  in  Tanzania  on  the  slopes  of 
Mount  Kilimanjaro.  There  are  several  dialects  which  can  be  listed  under  the  parent 
language  Chaga,  and  these  are  spoken  in  villages  separated  from  each  other  by  a 
ridge  and/or  a  river.  Some  dialects  are  mutually  intelligible  while  others  are  not,  a 
fact  that  can  be  explained  by  geographical  and  past  pohtical  isolation.  The  dialects 
may  be  distinguished  geographically  by  dividing  them  into  three  major  groups: 

(a)  Western  :  Kisiha,  Kimachame,  and  Kikibosho. 

(b)  Central:  Kimochi  and  Kivunjo. 

(c)  Eastern:  Kirombo  is  the  cover  term  for  the  dialects  of  the  easter  group  and  which 
contains  five  dialects — Kimashati,  Kimkuu,  Kiseri,  Kingaseni,  and  other  non-Vunjo 
dialects  spoken  in  villages  lying  between  Mwika  (Vunjo)  and  Mashati  (Rombo).  A 
non-Vunjo  dialect  will  either  be  closely  related  to  the  central  or  eastern  group  of 
dialects  depending  on  its  geographical  position  on  this  continuum. 

^  Marcel  van  Spaandonck  (1973)  notes  that  many  eastern  Bantu  languages  have  this 
morpheme,  although  Guthrie  (1971)  mentions  no  more  than  ten  attesting  ni. 

*  Cf.  Marchese  (1983:117).  The  particles  include  me,  we,  and  ne  whose  main  funtion 
is  to  indicate  a  focused  element.  These  particles  have  a  tendency  to  reduce  and  may 
become  a  particle,  suffix  or  appear  as  a  tonal  clitic  (mid-high). 

^  The  'S'  in  'S-initial'  implies  main  clause  or  sentence,  depending  on  what  is  most 
appropriate  at  that  point  in  the  discussion. 

*  Tone  shift  moves  all  underlying  tones  rightward  by  one  syllable  (McHugh,  1985a). 
Thus  the  phonetic  tone  on  any  syllable  will  depend  on  the  underlying  tone  of  the 
syllable  before  it. 

^  Similar  observations  were  made  by  Nurse  and  Phillipson  (1977)  who  did  a  study  of 
tones  in  Kimochi,  a  Chaga  dialect  which  is  mutually  intelligible  to  Kivunjo.  Although 
Salone  (1980)  -who  also  studied  Kimochi-  did  not  find  occurrences  of  ni  in  her  infor- 
mants' speech,  her  data  shows  evidence  for  an  underlying  phrase-initial  H-tone.  This 
suggests  a  floating  H-tone  since  there  is  evidence  for  an  abstract  phrase-boundary  L- 
tone  in  Kimochi.  The  logical  explanation  for  this  floating  H-tone  in  Kimochi,  seems 
to  be  the  deleted  ni.  Additional  evidence  for  the  theory  of  a  floating  H-tone  comes 
from  Marcel  van  Spaandonck  (1973)  about  Shambaa,  Tsonga,  Horohoro,  Manyika, 
to  mention  only  a  few.  Similarly  Clements  (1984a,  b)  studied  Kikuyu  and  found 
that  ne,  which  is  similar  to  ni  in  Kivunjo  and  Kimochi,  has  an  underlyingly  H-tone 
which  spreads  onto  the  first  syllable  of  the  word  that  immediately  follows  it. 

'  The  same  is  true  for  any  citation  form,  but  here  I  choose  to  exemplify  nouns.  Many 
Bantuists  (for  example  Meinhof  1948  and  Horton  1949)  would  have  been  content  to 
state  that  (8)b  may  mean  "a  river"  or  "it  is  a  river",  which  is  what  is  attested  here. 
However,  without  knowledge  of  the  function  of  tone,  it  will  be  difficult  to  see  how 
either  reading  is  possible. 
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®  ne  becomes  no  through  vowel  harmony  when  it  combines  with  the  wh-pronoun  o. 
Vowel  harmony  is  also  very  productive  in  Kivunjo  as  can  be  seen  in  example  (19) 
and  (20)  in  which  the  second  person  singular  subject  marker  surfaces  as  o.  It  is  a 
combination  of  u+i  where  u  is  the  real  subject  marker  and  i  is  part  of  the  future 
tense  marker  ichi. 

^°  Relative  clauses  in  Kivunjo  are  not  overtly  marked  by  a  relative  pronoun,  in  contrast 
to  many  other  Bantu  languages  (e.g.  Swahili,  Kamba,  Taita  from  which  Kivunjo 
borrows  extensively).  A  relative  clause  is  marked  segmentally. 

"  Makundi  (personal  conversation),  a  speaker  of  the  same  dialect  but  from  Mamba 
village  (  the  author  comes  from  Kirua  Vunjo,  three  villages  to  the  west  of  Mamba), 
points  out  that  In  his  speech  it  is  possible  to  use  both  chi  and  ni  as  in  ni  chi 
NdesdmbuDo  dleokyd  nydmd  pfo  but  not  chi  ni  NdesdmbuDo  dliokyd  nydmd  pfo. 
This  indicates  a  possibility  of  double  focus  for  emphasis  in  the  case  of  the  former 
while  the  unacceptability  of  the  latter  confirms  the  association  of  ni  with  S-initial 
position  focus  marking.  The  speaker  has  two  strategies:  negating  the  focused  element 
as  well  as  emphasizing  the  contrastive  reading.  The  translation  will  be  something 
like  "It  is  (definitely)  not  NdesambuDo  who  roasted  the  meat." 

*'  Wem  is  a  reduced  form  of  we  and  am,  the  perfective  aspect  marker.  We  on  the  other 
hand,  is  also  a  reduced  form,  a  combination  of  wa,  an  auxiliary  verb  'be',  and  i,  a 
tense  marker  representing  the  moment-of-speech  (cf.  Moshi  1988). 


Abbreviations. 

1. 

asp. 

-*■  aspect 

2. 

pst.- 

->■  past 

3. 

pst.cont.  —*  past  continuous 

3. 

pres. 

.  -»  present 

4. 

fut. 

->■  future 

5. 

im.perf.  —>■  immediate  -  perfect 

6. 

p.perf.  -^  past  -  perfect 

6. 

foe. 

-*  focus 

7. 

fv.- 

-+  final  vowel 

8. 

adv. 

— +  adverb 

9. 

cond 

.  —*  condition 

10. 

au.  - 

■*  subject 

11. 

sm. 

->■  subject  marker 

12. 

om. 

->  object  marker 

13. 

sg.- 

■*  singular 

14. 

pL- 

*  plural 

15. 

cl.- 

*  class 

16. 

neg. 

-*  negative 

17. 

pass 

.  -*  passive 
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Tones 


1. 

d-> 

high 

2. 

6^ 

low 

3. 

o-> 

super 

-  high 

4. 

©-► 

high  ■ 

-  low 

5. 

0- 

low  - 

high 

6. 

'd- 

►  downstep  -  high 

Re 

fere; 

nces. 
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SOME  MORPHOLOGICAL  AND  PHONOLOGICAL  INTERACTIONS  IN  LAKHOTA 

Trudi  A.  Patterson 

Lakhota  phonology  includes  two  processes  that 
seem  to  be  controlled  by  morphological  processes 
but  that  also  appear  to  have  other  peculiar 
aspects.  These  are  ablaut  and  velar 
palatalization.  This  paper  will  attempt  to  show 
that  Lexical  Phonology  will  more  elegantly  account 
for  these  processes  than  past  analyses.  It  will 
also  be  shown,  however,  that  certain  recently 
proposed  constraints  on  Lexical  Phonology  are  too 
strong.  In  addition,  what  appears  to  be  iterative 
application  of  one  of  the  palatalization  rules 
cannot  be  accounted  for  by  appealing  to  cyclic 
phonology;  and  though  it  appears  to  be  a  case  of 
assimilation,  we  do  not  seem  to  be  able  to  appeal 
to  feature  spreading  either. 

Lakhota  is  a  dialect  of  Dakota,  an  American  Indian 
language  in  the  Siouan  family.  Both  ablaut  and  velar 
palatalization  are  well  documented  processes  in  this  language. 
Briefly,  under  certain  apparently  morphologically  controlled 
conditions,  the  velar  stops  (k,  kh,  k')  become  palato-alveolar 
affricates  (c,  ch,  c').*  Since  Lakhota  does  have  this  series 
of  affricates  phonemically,  we  appear  to  have  an  example  of 
a  non-automatic  neutralization  process.  The  segments  that 
appear  to  trigger  palatalization  are  the  high  front  vowel  /i/ 
and  an  underlying  /a/  that  has  become  e  by  the  ablaut 
process,  which  is  itself   morphlogically  conditioned. 

Let  us  first  consider  what  morphology  controls  these 
processes.  This  investigation  will  focus  on  the  verbal 
morphology;  nouns  will  play  a  smaller  role.  Verbal 
morphology  is  very  active,  involving  pref ixation,  inf ixation, 
suffixation,  noun  incorporation  and  the  annexation  of 
enclitics.  Nouns  may  frequently  be  derived  from  verbs.  They 
may  also  be  "compiled"  from  other  pieces  (e.g.  noun+adverb-- 
example  given  below  VIII  C.  )  .  Both  velar  palatalization  and 
ablaut  are  intimately  tied  up  with  all  of  these  processes. 

VERBAL  MORPHOLOGY 

It  may  be  useful  to  give  first  a  brief  outline  of  the 
verbal  morphology,  in  order  to  make  the  following  discussion 
more   intelligible. 

Subjective  and  objective  personal  pronouns  are  affixed 
to  verbs.  The  personal  pronouns  are  as  follows  (-pi  is  the 
plural  suffix) : 


1st 

wa- 

^- 

2nd 
3rd 

r 

r 

-pi 
-pi 
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Subject  Object 

Sg.       PI.  Sg.       PI. 

ma-       1^- 

ni-       ni-  -pi 

(ft  wicha-  -pi 

A  'portmanteau'  morpheme,  chi-,  is  used  for  first  person 
subject  -  second  person  object.  The  "object"  pronouns  are 
used  as  subject  forms   in  neutral  or  stative  verbs. 

Other  verbal  prefixes  include  instrumentals,  locatives, 
and  a  set  of  category  modifying  af f ixes--benefactive, 
possessive,  reciprocal,  and  reflexive.  Among  the  suffixes 
and  enclitics  are  the  plural,  negative,  diminutive, 
conjunctions,  and  sentence  particles. 

Verb  prefixes  and  some  of  the  suffixes  and  enclitics  are 
as  follows  (they  are  given  in  the  order  they  would  be  placed 
on  the  verb,  thus  instrumental  prefixes  in  II  A  are  closest 
to  the  verb,  possessive,  reflexive,  etc  in  II  B  next,  and  so 
on;  the  continuative  suffix/enclitic  would  immediately  follow 
the  verb,  the  plural  would  appear  outside  that,  and  so  on): 

II    PREFIXES: 

A    Instrumentals 

1.  ya-   with  the  mouth 

2.  yu-   by  handling  or  manipulating 

3.  pa-   by  pushing 

4.  pu-   pressure  (indefinite  direction) 

5.  ka-   by  sudden  impact 

B    Possessive,  Reflexive,  Reciprocal,  Benefactive 
ki-         ic'i-      kichi-      ki- 

C    Personal  Pronouns 

wa-,  ya-,  ma-,  ni-,  chi-,  i^(k)-,  wicha- 

D     Instrumentals 

1.  wa-   with  edged  instrument 

2.  wo-  with  pointed  instrument 

3.  na-   by  foot  or  leg 

4.  na-   by  some  "inner  force" 

E    "In  two/in  the  middle" 
khi- 

F    Locatives 

1.  a-   on 

2.  o-    in 

3.  i-   against,  toward,  in  contact  with, by  means 

of 
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III  SUFFIXES/ENCLITICS 

1.  Continuative :  -h^ 

2.  Plural  -pi 

3.  Diminutive  -la/-laka 

4.  'Regularly'  -s'a 

5.  Augmentative  -xca 

6.  Future  -kta 

7.  'Rather'  -ka 

8.  Negative  -sni 

9.  Augmentative  -xci 

10.  'As  though'  -s'e 

11.  'Contrary  to  fact'   -tkha 

12.  Declarative  -(y)(w)e  (lo/le) 

13.  'Evidently'  -lakha 

14.  Quotative  -ska 

15.  Declarative  -' 

Involved  in  these  phonological  processes  is  also  a  set 
of  stems,  called  "dependent  verbs,"'  used  in  verb  compounds 
including  the  following  (selected  here  because  of  their 
initial  velar) : 

IV  -khiya  'to  cause  to'  (causative) 
-khiya  'to  be  of  the  opinion' 
-ki^za           'to  pretend  to' 

-kap^  'to  be  reluctant  to' 

-kho  'to  predict,  prophesy' 

-kinica         'to  be  hardly  able  to  restrain  from' 

These  combine  with  another  verb  to  form  complex  verbs  as  the 
following  example  shows: 

V  witko  'to  be  foolish' 
witko-kijza      'to  pretend  to  be  foolish' 
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AFFECTED  VELARS 
VI 


The  following  examples  show  the  process  of 
palatalization  after  the  underlying  high  front  vowel 
/i/: 


D. 


'to  shoot' 
'He  shoots  at  me' 
'He  shoots  at  you' 
'I  shoot  at  you' 

•to  be  hindered  by' 
' I  am  hindered  by' 

'to  sweep' 
'a  broom' 


Pronoun+verb 

1.  khute 

2.  ma-khute 

3.  ni-chute 

4.  chi-chute 
Locative+verb 

1.  'icaVi 

2.  'imakaVi 
Nominalizer+verb 

1.  kah^^ta 

2.  'icah^te 
Noun+verb  (Object  Incorporation) 

1.  k'a  'to  dig' 

2.  mni  'water' 

3.  mnic'a  'to  dig  for  water' 
E.    Possessive/ref lexive/reci pr oca 1+ verb 

1.  khuwa  'to  chase' 

2.  kichuwa  'to  chase  one's  own' 

3.  'ichichuwa  'to  chase  oneself' 

4.  kichichuwapi        'They  chase  each  other' 

This  process  is  phonologically  conditioned.   The  velar 
will  not   palatalize  if  it  is  followed  by  another  consonant. 

VII       kte  'to  kill' 

nikte  'He  kills  you' 

Not  all  velars  will  undergo  palatalization  after  this 
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vowel.  First,  as  Boas  and  Deloria  (1941)  note,  only  active 
transitive  verbs  with  initial  velars  are  subject  to  this 
rule.  Neutral  (stative)  verbs  do  not  undergo  this  rule. 
Other  velars  which  fail  to  undergo  palatalization  appear  on 
noun  stems.  The  following  examples  show  where  palatalization 
fails  to  apply  to  velars  immediately  following  the  high  front 
vowel  /i/. 


VIII  A.    Neutral  verbs 


1.  khata 

2.  ma-khata 

3.  ni-khata 


to  be  warm' 
I  am  warm' 
You  are  warm' 


Noun  Stems 


1.  kha 
wi-kha 

2.  khi^gi 
ni-khySi 

C.    Adverbs 


'string,  sinew' 
'rope ' 

'grandmother ' 
'your  grandmother' 


1. 


thi 


kaitep 


thi-kaitepa 


'to  dwell,  live 
'diagonally' 


lean-to ' 


Underlying  initial  velars  are  also  palatalized  after  an 
ablauted  vowel,  under  certain  conditions.  Ablaut  is  a 
morphologically  conditioned  phonological  process  which  turns 
certain  a's  into  e.  A  fairly  well-known  fact  about  Dakota  is 
that  major  category  words  may  not  end  in  consonants. 
Consonant  final  nouns  and  verbs  acquire  an  epenthetic  final 
vowel,  a.  Historically,  it  was  just  these  vowels  which 
underwent  ablaut  when  followed  by  certain  suffixes  and 
enclitics  and  the  dependent  verb  stems.  Synchronically, 
however,  some  epenthetic  vowels  have  lost  this  property  while 
some  underlying  final  /a/s  have  acquired  it.  Since  there  is 
now  no  way  to  predict  which  will  suffer  ablaut,  those  stems 
which  do  undergo  it  must  be  marked.'  This  ablauted  vowel, 
then,  but  not  underlying  /e/,  will  trigger  palatalization  of 
certain  following  velars.  Furthermore,  the  velars  that 
undergo  palatalization  after  the  ablauted  a  form  a  disjoint 
set  with  the  velars  that  undergo  this  rule  after  /i/.  The 
following   examples  show  this. 


M 

'the' 

(DET) 

1. 

sapa 

sape 

^i 

2. 

thipi 

M 

3. 

phe  k 

^i 

k' 

eyas 

'but' 

1. 

kta 

140 


'black' 

'the  black  one' 
'the  house' 
'the  sharp  one' 

' future/potential ' 

' iyayi-kte  c'eyas      'he  was  going  to  go  on, 
but. . . ' 
(go  FUT  but) 

wich-o-thi-ta  gli  k'eyaS  'he  came  back  to  the 
camp,  but . . ' 

(people-LOC-live-to 
come  back  but) 

mastica-la  khute  k'eya^  'he  shot  the  rabbit, 
but' (rabbit-DIMshoot 
but) 


•black' 

'kind  of  black' 

'rather  furious' 

3.    phe-ka  'rather  sharp' 

Past  analyses  have  tried  to  squeeze  these  two  actions 
into  one  rule.  But  it  has  been  an  uneasy  fit.  First,  the 
former  velar  palatalization  action  operates  primarily  in  the 
prefix  morphology,  while  the  latter  applies  exclusively  to 
the  suffixes/enclitics,  because  it  is  dependent  upon  the 
ablauting  vowel.  The  latter  rule  is  quite  regular--it  seems 
to  apply  to  all  suffix  and  enclitic  (not  stem)  velars 
following  all  but  only  the  ablauted  vowels,  not  the  high  front 
vowel.  The  former  applies  to  the  high  front  vowel  only  in 
certain  morphologically  constrained  environments.  The 
behavior  of  the  dependent  verbs  is  most  revealing.  If  we 
consider  them  stems,  and  not  suffixes  as  they  have  been 
called,  they  behave  quite  regularly  with  respect  to  both 
velar  palatalization  rules. 


ka 

'qualifier 

1. 

sapa 

sape-ca 

2. 

ohiti-ka 
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Verb 

Comp 

ounds 

A. 

1. 

chapha 

2. 

chaphe-khiya 

B 

1. 

cheya 

2. 

cheye-kinica 

C 

1. 

'apha 

2. 

•aphe-khiya 

3. 

'amaphe-chi-chiya 

D 

1. 

'aphe-ktjza 

2. 

'aphe-chi-c^ze ' 

'to  stab' 

•to  cause  to  stab' 

'to  weep' 

'hardly  able  to  restrain 
from  weeping' 

'to  strike' 

'to  cause  to  strike' 

•I  make  you  strike  me' 

{ [LOG. -me-str ike ] -I   you- 

cause ) 

'to  pretend  to  strike' 

'I  pretend  to  strike  you.' 
( [LOC.-strike]-I  you- 
pretend) 

Notice  that  the  ablauted  vowel  of  the  complement  verb  does  not 
palatalize  the  velar  of  the  matrix  verb  but  the  high  front 
vowel  of  the  personal  pronoun  does  palatalize  the  velar.  The 
derivations  of  "I  pretend  to  strike  you"  'aphechicuze '  (XI  D 
2.)  and  "to  pretend  to  strike"  'aphekuza  (XI  D  1.),  for 
example,  would  proceed  as  follows: 


UR 
A  Personal  Pro. 

Palatalization 
B  Compounding 

Ablaut 

C  Locative 

Ablaut*  and 
Glottal  Epen. 


[a]  [pha]  [Chi]  [kyza] 

(chi  kijza] 

(chi  cyza] 

[pha  chi  cijza  ] 

[phe  chi  cijza] 

[a  phe  chi  ci^za  ] 

[  'a  phe  chi  ci^ze  ] 
'aphechici^ze  ' 


:a]  [pha]  [k\^za: 


[pha  kijza] 
[phe  kyza] 
[a  phe  kyza] 

[  'a  phe  kyza] 
'aphekyza 
the  Locatives  do 


Recall  also  that  as  shown  above  (VI  G. 
trigger  palatalization  of  the  following  velar.  Also  recall 
that  palatalization  is  not  triggered  by  a  high  front  vowel  on 
the  following  velars  of  the  formatives  shown  in  IX.   The 
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latter  is  exactly  the  reverse  of  what  we  see  in  the  verb 
compounds  above. 

It  is  apparent,  therefore,  that  we  have  two  velar 
palatalization  rules.  I  will  refer  to  velar  palatalization 
after  underlying  /i/  as  Velar  Pal  I  and  palatalization  after 
ablauted  a  as  Velar  Pal  II. 

Now  let  us  consider  the  impact  of  our  three  rules — Velar 
Pal  1,  Velar  Pal  II,  and  Ablaut--on  proposed  constraints  in 
Lexical   Phonology. 

The  prefixes  given  above  (II)  are  affixed  to  the  verb  in 
roughly  the  order  given.  Examples  were  given  in  III  showing 
the  operation  of  Vel  Pal  I  between  some  of  these  prefixes  and 
verb  stems.  One  of  the  potential  candidates  for  triggering 
Vel  Pal  I  would  seem  to  be  the  benefactive  prefix,  kl-.  It, 
however,  does  not  trigger  palatalization.  Consider  the 
following  examples: 

XII  A    1.    khuwa  'to  chase' 

2.  kikhuwa  'He  chased  it  for  him' 

B     1.  khute  'to  shoot' 

2.  kikhute  'He  shot  at  it  for  him.' 

C     1.  kaVa  'to  make' 

2.  kika»a  'He  made  it  for  him.' 

Since  this  prefix  appears  in  the  same  position  in  the  verb 
complex  as  the  reflexive,  reciprocal  and  possessive — which 
do  trigger  Vel  Pal  I--we  need  to  determine  how  these  are 
added  to  the  verb  stem  and  how  it  is  that  the  benefactive 
fails  to  trigger  Velar  Pal  I.  Notice  that  however  we  apply 
these,  we  violate  some  proposed  constraint  on  the  phonological 
rules  and  levels  of  Lexical  Phonology.  Let  us  consider  the 
following  potential  ordering: 

XIII  Morphology  Phonology 

1  Possessive  Vel  Pal  I 
Reflexive 

Reciprocal 

2  Benefactive 

3  Personal  Pronouns  Vel  Pal  I 

This  suggestion,  however,  violates  a  constraint  that  Mohanan 
(1986)  proposes,  namely,  that  phonological  rules  apply  on 
continuous  levels: 
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XIV  "The  domain  of  a  rule  may  not  contain  nonadjacent 

strata." 

If,  on  the  other  hand,  we  assume  the  following  strata, 
we  violate  a  proposal  made  by  Kiparsky  (1985)  that  all 
phonological  rules  are  "on"-  at  the  beginning  of  the  cyclic 
phonological  component  and  are  only  "turned  off"  at 
appropriate  times. 

XV  Morphology  Phonology 

1  Benefactive 

2  Possessive,  etc.       Vel  Pal  I 

3  Personal  Pronouns     Vel  Pal  I 

It  will  be  seen  below  that  the  behavior  of  phonological 
rules  with  respect  to  the  suffixes  favors  slightly  the  schema 
in  XV. 

Another  correct  prediction  we  will  be  able  to  derive  from 
this  treatment  is  that  Vel  Pal  I  "turns  off"  before  the  lower 
verb  is  added,  consider  the  following: 


XVI  A 


ani-kapj  'He  is  reluctant  to  walk' 

(walk-be  reluctant) 
(3rd  Sg.=0) 

B    nyw^-ihi^ni-khiya     'He  causes  his  swimming  to  end' 

( swim-end-CAUSE ) 

Notice  in  these  forms  that  the  final  i  of  the  lower  verb  fails 
to  trigger  palatalization.  Using  the  structure  in  XV,  we  can 
then  assume  that  the  complement  verb  is  added  to  the  verbal 
complex  on  some  level  4,  where  Vel  Pal  I  is  no  longer 
applicable . 

Another  peculiar  aspect  of  the  Benefactive  appears  to 
complicate  the  issue  here--it  appears  to  suffer  from 
"Palatalization  Creep."  That  is,  IT  can  become  palatalized, 
and  when  it  does,  it  will  become  a  palatalizing  prefix. 
Consider  the  following: 

XVII      khuwa  'to  chase' 

1.    i.    kikhuwa  'He  chased  it  for  him' 

ii.   wakikhuwa  'I  chased  it  for  him' 

iii.  makikhuwa  'He  chased  it  for  me' 
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i.    nicichuwa 
ii.   chicichuwa 


•He  chased  it  for  you' 
'I  chased  it  for  you' 


If  these  prefixes  are  added  according  to  XV  above,  we  need  to 
explain  how  the  benefactive  prefix  suddenly  acquired  the 
ability  to  palatalize  a  following  velar.  It  has  been 
proposed  by  Carter  (1974)  that  the  "benefactive  prefix  is 
marked  with  a  rule  feature  [-velar  palatalization  trigger! 
...  and  that  application  of  VELAR  PALATALIZATION  changes  the 
value  of  this  feature."  Exactly  how  a  phonological  rule  is 
to  effect  a  phonological  change  as  well  as  a  rule  feature 
change  is  not  outlined.  But  additionally,  we  would  like,  in 
the  framework  of  Lexical  Phonology,  not  to  have  to  appeal  to 
such  ad  hoc  rule  features. 

If  we  assume  the  schema  outlined  in  XV  above,  we  can 
frame  the  Vel  Pal  I  "straightforwardly"  as  follows: 


XVIII   Velar  Palatalization  I 


kh. 


ch,  cV   i 


We  can  then  allow  Vel  Pal  I  to  apply  "across  the  board"  and 
derive  the  correct  forms.  The  exact  nature  of  this  rule  is 
somewhat  problematical,  however.  One  of  the  expectations  for 
the  idea  of  the  cycle  in  phonology  was  that  it  would  explain 
iterative  application  of  phonological  rules.  We  do  not 
appear  to  be  able  to  appeal  to  the  cycle  here,  though.  In 
the  case  of  nicichuwa  (XVII,  2,  i.  above),  we  would  have  the 
following  derivation: 


UR 


II 


Benefactive 
Velar  Pal  I 
Pers  Pronoun 
Velar  Pal  I 


[ni]  [ki]  [khuwa] 
(nil  ((kij  [khuwa]] 

[[ni]  [[ki]  [khuwa]]] 
[[ni]  [[ci]  [khuwa]]] 
*nicikhuwa 


There  appears  to  be  no  way  for  us  to  obtain  palatalization  on 
the  verb   stem  without  iterative  application  of  this  rule. 

One  might  suggest  that  recent  proposals  involving  the 
internal  structure  of  segments  may  be  of  assistance  in 
explaining  this  action.  Without  going  into  great  detail  here, 
Sagey  (1986)  discusses  the  case  of  Fanti  where  alveolars 
"assimilate  in  dorsal  features  to  a  neighboring  vowel."  This 
involves  the  spreading  of  the  dorsal  node  in  the  distinctive 
feature  hierarchy  and  the  delinking  of  the  coronal  node.   We 
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seem  to  have  just  the  reverse  case  in  Lakhota:  the  coronal 
node  may  be  said  to  spread  from  the  high  front  vowel  /i/  in 
the  personal  pronoun  over  the  next  segments  (applying 
"vacuously"  shall  we  say  to  the  intervening  /i/s)  until  the 
high  back  vowel  (in  this  case)  is  reached.  However  initially 
attractive,  the  appeal  to  a  complex  structure  in  the  segment 
does  not  get  us  over  the  boundary  problems  reflected  in  the 
derivation  above. 

One  might  be  tempted  to  propose  a  post-lexical  rule 
acting  "across  the  board."  But  a  rule  of  this  nature  would 
incorrectly  predict  that  all  velars  should  palatalize  after 
/i/--an  obviously  undesirable  result. 

What  we  are  left  with  appears  to  be  a  case  of  iterative 
application  of  the  phonological  rule  Velar  Palatalization  as 
in  XVIII  above  on  levels  2  and  3  of  the  suggested  schema  in 
XV.  Notice,  however,  that  this  proposal  does  not  violate  the 
strict  cycle  condition.  Those  benefactive  forms  shown  in  XII 
are,  on  levels  2  and  3,  underived  forms  and  will  not  be 
subject  to  the  rule.  Those  forms  in  XVII  2,  however,  will 
have  undergone  morphological  rules  on  these  levels,  thus 
allowing  them  to  be  input  to  the  phonological  rule  of  Velar 
Palatalization  I--applied  iteratively. 

Velar  Palatalization  II  and  more  importantly  Ablaut,  in 
addition  to  having  what  is  apparently  a  global  relationship, 
seem  to  cause  other  problems  for  proposed  constraints  in 
Lexical  Phonology. 

Of  the  suffixes  and  enclitics  (IV  above),  some  of  those 
ending  in  a,  like  stems  ending  in  a,  undergo  ablaut  when 
followed  by  other  suffixes/enclitics.  Therefore,  not  only 
do  they  cause  ablaut,  they  may  undergo  it  themselves. 

XIX     Suffixes/Enclitics  and  Ablaut 

Suffix/Enclitic      Triggers  Ablaut    Undergoes  Ablaut 


1. 

-h? 

2. 

-pi 

3. 

-la 

4. 

-s'a 

5. 

-xca 

6. 

-kta 

7. 

-ka 

8. 

-Sni 

9. 

-xci 

10. 

-s  'e 

11. 

-tkha 

12. 

-ye/-we 

13. 

-laka 

14. 

-ska 

15. 

- ' 

N/A 


N/A 
N/A 
N/A 

N/A 

+ 

+ 

N/A 
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The  future  enclitic  triggers  a  special  rule.  It  turns 
ablauting  vowels  into  J,  but  itself  undergoes  "regular" 
Ablaut, 

XX  w^yfka  'to  see' 

w^yfk\  kta  'He  will  see  him' 

The  suffixes  and  enclitics  are  added  to  the  verb  complex 
in  the  order  given  above.  Notice  in  XX  that  once  again  a 
rule,  this  time  Ablaut,  must  not  apply  on  the  very  first 
stratum — Ablaut  is  not  triggered  by  the  first  three  suffixes. 
The  following  forms  show  the  placement  of  some  of  these 
particles  with  respect  to  each  other: 

XXI  1.    'eye-Sni  'He  didn't  say  it' 

'eya-h^-pi  'They  were  saying' 

2.  hokSi-pi-la  'The  ones  who  are  little  boys' 

3.  sapa  'to  be  black' 
sapeca  'It  is  rather  black' 

4.  t'i-kte-sni  'He  is  not  going  to  die' 

5.  miyoglas'l  hiyuya-pi  s'e  'Like  flashes  being  sent' 
(mirror-send-pl-as  though) 

6.  'owehaha-pi-la  s'a    'They  make  jokes  regularly' 

7.  ' iyokiphi-Sni  s'e     'As  though  he  was  displeased' 

8.  nayaxy-sni-ye-lakha  'Evidently  you  have  not  heard' 

9.  ci-ke-"Sni        'He  is  rather  disinclined  to  do  it' 

10.  'opha-xc^-kta  'He  really  wanted  to  join' 

11.  thath^ka   ota   opila-xca-ke  lo 

lo+pi+la+xca+ka  e] 
(buffalo  many  shoot-PL-DIM-AUG-rather  DECL) 
•They  certainly  killed  many  buffalo.' 

12.  chachiphapha  kta  tkha  ye  lo 
(stab  (redup)+I-you  FUT  contrary  DECL) 
'I  could  have  stabbed  you.' 

13.  slolye-lni-xc^       'not  knowing  verily' 

14.  wichowe-pi-la  ye     'They  are  of  common  parentage, 

little  ones. ' 
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The  suffix/enclitic  #12  in  XX  poses  somewhat  of  a 
problem.  First,  this  particular  affix  seems  to  be  simply  -e, 
which  acquires  a  glide  when  it  follows  underlyingly  vowel 
final  stems.  As  Boas  and  Deloria  (1941)  explain,  "The 
particle  ye  is  attached  to  stems  ending  in  unchangeable  a  or 
f,  or  in  e,  i,  j^.  .  .after  ,o,  u,  y,  it  takes  the  form  we." 
Ablauting  vowels  behave  somewhat  differently,  "with 
changeable  a  or  ^  it  is  contracted  to  e."  (Boas  and  Deloria 
1941)  The  interesting  question  to  ask  is  whether  the  vowel 
ablauts  and  is  then  collapsed  with  the  following  vowel  or  if 
the  ablauting  vowel  is  simply  collapsed  without  first  being 
ablauted.   Shaw  (1980)  opts  for  the  former  explanation: 

Specifically,  if  the  final  vowel  in  a  stem  normally 
subject  to  ABLAUT  does  not,  for  some  reason,  undergo 
the  rule,  then  an  epenthetic  glide  is  inserted. 
This  shows,  therefore,  that  it  is  not  just  the 
ablauting  "potential"  of  a  given  vowel  which  will 
trigger  a  merger  with  the  predicative  e;  rather, 
the  stem-final  a  or  f  must  actually  undergo  ABLAUT. 
The  type  of  data  which  show  this  are  reduplicated 
vowel-final  stems,  for  although  ABLAUT  applies  to 
these  stems  in  non-reduplicated  form,  it  never 
applies  to  the  reduplicated  f orms ... .Compare, 
therefore  the  following  data: 

a.  /yatkf=e  lo/  [yatke  lo]   'he  drank  it  (male  spkr) 

/yatk^+tkf=e  lo/  [yatkftk^ye  lo) 

'he  drank  repeatedly'  (male  speaker) 

(Shaw  1980) 

If  we  opt  for  this  solution,  however,  what  we  are  saying 
is  that  ablaut  applies  when  suffixes/enclitics  number  4 
through  10  (in  XX)  are  added  to  the  stem,  ablaut  does  not 
apply  when  number  11  is  added,  but  then  ablaut  applies  once 
again  when  the  particle  -e  is  attached.  That  is,  we  seem, 
therefore,  to  have  the  following  strata: 

XXII       Morphology  Phonology 


Morphology 

I 

-la 

II 

-s'a  through 

-s'e 

III 

-tkha 

IV 

-e  (le/lo) 

V 

-lakha 

Ablaut 
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-ska 


This  turning  rules  off  and  on  again  is  precisely  the  kind  of 
weakening  of  the  theory  that  we  would  hope  to  avoid.  Since 
the  particular  forms  to  which  this  "ablaut  loss"  occurs  are 
just  those  underlyingly  vowel  final  stems  which  did  not 
originally  (diachronically )  undergo  ablaut,  it  would  not  be 
too  surprising  to  see  that  these  forms  loose  this  feature 
under  certain  circumstances.  Also,  it  is  not  uncommon  for 
reduplicated  forms  to  become  "frozen"  in  particular  ways.  In 
addition,  this  particular  vowel  seems  to  "collapse"  with 
another  suffix,  namely  -pi  (plural),  to  form  -pe.  So  it  may 
be  rather  more  likely  that  this  is  a  vowel  deletion  process, 
whereby  the  vowels  of  vowel-final  forms  subject  to  ablaut  are 
simply  deleted,  and  we  need  not  appeal  to  ablaut. 

We  are  left  with  rather  weak  arguments  in  favor  of  either 
analysis  for  the  declarative/predicative  -e  and  ablaut  of  the 
vowel-final  stems.  The  one  strong  argument  in  favor  of  not 
allowing  ablaut  to  apply  here  is  the  fact  that  we  can  thereby 
save  the  continuous  strata  constraint  {XIV  above). 

But,  we  have  just  one  last  problem.  As  Boas  and  Deloria 
(1941)  explain,  "the  end  of  a  declarative  statement  of  a 
commonly  known  fact... is  expressed  by  the  terminal  glottal 
stop. . . .Changeable  terminal  a  becomes  e."  (suffix/enclitic  #15 
in  III  above).   Consider   the  following  data: 

XXIV  1.    tase  nit'api  kte  ka  'Surely  you  shall  not  die' 

2.    ye-la-ka  'He  is  really  going  along' 

'oxloka-la-ke- '      'That  is  some  hole' 

We  see  here  that  the  last  particle  added  to  the  verb 
complex  once  again  triggers  ablaut.  So  while  we  can  add  a 
vowel  deletion  rule  and  ablaut  to  level  III  in  XXIII  above  to 
avoid  the  "turning  off  and  on"  of  ablaut,  we  must  apparently 
"turn  on"  ablaut  one  last  time.  It  is  possible,  however,  that 
this  may  in  fact  be  a  rule  of  the  post-lexical  (phrasal) 
phonology.  I  do  not  have  sufficient  data  to  address  this 
question  here. 

We  have  seen  that  the  phonological  and  morphological 
processes  of  Lakhota  may  be  described  more  efficiently  by 
Lexical  Phonology  and  that  these  processes  have  relevance  for 
proposed  constraints  in  Lexical  Phonology. 
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NOTES 

'  Lakhota  does  not  make  phonemic  use  of  the  distinction 
+/-voice.  It  does  differentiate  between  plain,  aspirated  and 
glottalized  voiceless  stops--thus  the  spelling  Lakhota,  the 
It  is  aspirated,  the  t  is  plain.  I  use  the  notation  Ch  to 
indicate  aspirated  stops  and  C  to  indicate  glottalized  stops. 

*  These  formatives  have  been  called  dependent  verbs, 
incomplete  verbs,  auxilliar ies,  or  even  suffixes  in  Lakhota 
sources.  As  Boas  and  Deloria  (1941:74)  note,  however,  "The 
position  of  the  pronoun  is  always  as  though  they  were 
independent  verbs,  the  verb  to  which  they  refer  being  in 
subordinate  form."  In  more  current  terminology,  these 
"dependent  verbs"  would  be  martix  verbs  and  the  verbs  "to 
which  they  refer"  would  be  complements .  I  will  break  with 
tradition  and  use  the  more  current  terms. 

As  Shaw  (1980)  convincingly  shows,  this  involves  less 
marking  than  other  alternatives. 

Ablaut  is  triggered  here  by  the  "declarative  particle," 
discussed  further  below. 
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ON  DERIVING  SPECIFIERS  IN  SPANISH:  MORPHQ- PHONO- SYTfTAX  INTERACTIONS' 

Uthaiv^Tin  Wong-opasi 


The  Speuiish  masculine  singtilar  definite  article  el 
exemplifies  interactions  between  morphology,  phonology, 
and  syntax.  El  is  claimed  to  be  a  converted  feminine 
form  found  solely  in  front  of  a  feminine  noun  begining 
with  a  stressed  low  vowel  [A]  (cf.  Plank  1984,  Zwicky 
1985).  Its  regular  appearance,  however,  is  before  a 
masculine  singuleu-  noun.  Phonological  rules  such  as 
Apocope,  Vowel  Truncation  and  Epenthesis  must  take  both 
morphological  and  syntactic  information  into  considera- 
tion during  word  formation  in  deriving  the  two  instances 
of  the  definite  article  el.  The  proposed  ansdysis  is 
supported  also  by  the  derivation  of  the  whole  class  of 
Spanish  specifiers. 


Identifying  the  problc 


The  variation  in  forms  of  Spemish  definite  au'ticles,  e.g.  el  m. 
sg.,  los  m.  pi.,  la  f.  sg.,  las  f.  pi.,  and  the  whole  class  of 
specifiers,  has  been  viewed  as  highly  idiosyncratic,  and  thus,  is 
considered  as  needing  memorization.  An  issue  »Aich  has  been  taken  up 
recently  in  the  literature  is  hut  a  narrow  aspect  of  this  wide  range 
of  alternations,  namely,  the  replacement  of  la  by  el,  before  a 
stressed  [A]  of  a  following  feminine  noun.  Plank  1984  proposes 
disagreement  rules  while  Zwicky  1985  suggests  referral  rules.'  As 
criticized  by  Harris  1987,  the  former  position  is  not  justified  for 
this  phenomenon  because  the  converted  eu-ticle,  i.e.  la  — >  el  does 
not  disagree  in  gender  with  a  following  feminine  noun  beginning  with 
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a  stressed  vowel.  The  latter  pxDstulation  is  not  tenable,  either, 
since  Zwicky's  referral  rule  is  not  broad  enough  to  cover  all  the 
differing  memifestations  of  Speuiish  definite  articles.  However, 
Harris'  account,  which  is  tentative  in  nature,  needs  a  more  indepth 
exploration.  Firstly,  his  allomorphy  rule  does  not  explain  why  it  is 
that  only  definite  articles  undergo  the  change.  In  fact,  we  also 
find  simil£ir  alternations  in  other  sets  of  specifiers,  i.e.  an  array 
of  variable  determiners:  6n^  -  Ano  -  iana.  'a(n),  one',  ^ste  -  ^sto  - 
^sta  'this',  and  qusmtif iers:  algfm  -  alg<ino  -  alg6na  'some',  primer 
-  primero  -  primera  'first',  etc.  Secondly,  he  left  unexplored  the 
overlapping  areas  of  morphology,  phonology,  and  syntax.  To  this 
date,  in  the  realm  of  morphosyntax ,  no  attempt  has  been  given  in  the 
literature  to  explain  why  the  binary  masculine  vs.  feminine  contrast 
in  Spemish  must  be  releixed  to  allow  the  third  parameter,  the  neuter 
gender,  e.g.  (el)lo,  and  the  like  forms,  to  show  up  only  in 
determiners.  I  will  also  try  to  improve  on  Harris'  analysis,  e.g. 
that  the  diacritic  [A]  can  be  explained,  that  definite  articles  are 
not  themselves  pronominals  (rather,  they  are  pronominal i zed  via 
deletion  of  the  head  of  NP),  and  that  Harris'  allomorphy  rule  cannot 
distinguish  lo  bu^no  ' the  good  ( thing/part ) '  from  el  butoo  ' the  good 
one'  since  both  articles  are  masculine  pronominals.  T^e  objective  of 
this  paper  is  thus  to  argue  for  a  better,  more  comprehensive 
analysis  of  the  phenomenon. 


2.  Allomorphic  variations: 


Harris  (1987:178-181)  postulates  an  allooorphy  rule, 
responsible  for  the  varying  phonological  segments  selected  from  the 
ordered  sequence  e-l-l-o/a-(s) .  His  rule  is  reprod\*Ded  below. 

(1) 

V  -->  ja-  /  [  X  [...~]c  Y  ]  »P    (  X,  Y  possibly  null  ) 
Condition:  (a)  C  =  Pronominal 
(b)  C  =  Masculine 
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According  to  Harris,  the  notation  of  this  rule  is  intended  to  be 
neutral  over  'syntactic'  and  'phonological'  interpretations. 
Interpreted  syntactically,  (1)  specifies  the  contexts  in  which  *1  , 
el,  aqu^l  occur,  on  the  assunption  that  their  phonological 
representations  are  underived.  Interpreted  phonologically,  (1) 
specifies  the  conditions  under  which  a  vowel  is  deleted,  on  the 
assumption  that  it  is  present  in  the  underlying  phonological 
representation  of  the  items  in  question.  Harris'  approach,  however, 
is  phonological.  Rule  (1)  must  contain  the  diacritic  C  to  exclude 
final-vowel  deletion  to  nonpronominals ,  e.g.  prijno  — >  »prim 
'cousin,  m.',  emd  to  feminine  forms,  e.g.  nominals  prima  — >  «prim 
'cousin,  f . ' ;  nominative  pronouns  ella  — >  »el(l);  accusative 
pronouns  la  — >  *1;  definite  aurticles  la  — >  »1;  and  distal  deictics 
aqu^lla  — >  «aqu^l(l).  The  dropping  of  the  initial  unstressed 
syllable  of  /ellV/  of  accusative  pronouns,  is  described  as  follows: 

V  V 

(2)   $$  ]«rroa   (  S=  stressless  syllable  ) 

i 

To  warrant  the  change  from  the  feminine  to  the  masculine 
article,  i.e.  la  — >  el,  Harris  includes  the  specification  of  Y  sis  = 
[A..lNoy»,  e.g.  «la  Agua  — >  el  Agua. 

Finally,  the  label  NP  is  crucial  since  it  excludes  the 
af>plication  of  apocope  to  the  accusative  masculine  clitic  (i.e., 
ello  — >  lo  — >  »1) .  This  is  due  to  syntactic  distribution  of  these 
lexical  items.  That  is,  only  nominative  pronouns  can  be  heads  of 
hk}un  Phrases,  while  the  surticles  euid  accusatives  can  only  be 
non-heads  of  Noun  Phrases  and  Verb  Phrases,  respectively.  Evidence 
can  be  drawn  from  similar  alternations  in  deictic  forms  which  occur 
syntactically  in  isolation  as  head  of  a  Noun  Phrase  (aqu^l  'that 
one')  and  in  either  pre-  or  post-nominal  position  (aqu^l  libro,  el 
libro  aqudl) . 
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3.  In  the  lexicon 

I  propose  that  alloroorphic  variations  of  definite  articles  as 
exemplified  in  Hsirris  1987  be  accounted  for  in  a  broader 
perspective,  namely,  that  these  changes  in  forms  are  systematic  as  a 
result  of  the  spelling  out  of  a  derivational  stem  (DS)  plus  an 
ending  (or  Word  Marker  in  Harris'  1985  term),  by  the  so-called  "Word 
Marker  Spell"  operative  in  all  other  types  of  derived  lexical  items. 
This  hypothesis  is  supported  on  independent  grounds  by  similar 
variations  in  all  classes  of  specifiers  in  Spanish.  Data  below  show 
the  effect  of  Word  Msu-ker  Spell.  The  WM  is  either  the  unmarked 
masculine  [o]  or  a  more  marked  feminine  [a].  The  [s]  is  the  unmarked 
plural  morpheme. 

(3) 
UR  m.sg.     f.sg.     m.pl.     f.pl.      gloss 

1.  Indefinite  articles: 

An-t-WN  i!ino  Ana  ianos  Anas  'a(n),   one' 

2.  (juantifiers: 

algAn-t-WM  algAno  algAna    algAnos   algrlmas  'seme' 

ningAn+WM  ningAno  ningAna  (ningAnos) (ningAnas)  'none' 

prim^r+WM  priroiro  primira   priro^roa  prim^ras  'first' 

terc^r+WM  terc^ro  terc*r«   terc^ros  terciras  'third' 

postrir+WM  postr^ro  postr^ra  postr^ros  postr^ras  'post' 

Of  interest  here  is  the  fact  that  the  final  vowel  [o]  of  the 
masculine  singular  form  of  all  the  above-listed  indefinite  articles 
and  quantifiers  is  apocopated  before  a  masculine  singular  noun  so 
that  adgAno  — >  algAn,  ningAno  — >  ningAn.  etc.  Following  is  my 
formulation  of  the  apocope  rule,  which  is  described  as  the 
truncation  of  the  association  line  linking  the  vowel  slot  to  the  WM 
in  the  morphological  tier. 
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(4) 

Apocope: 

— > 

WM  / 

V 

Condition 

:  X  = 

Head  N 

Ispec   X 


Else.  sing,  axxi  nonnull 

The  feature  masculine  singulsu-  of  the  following  Head  N''  is 
crucial  to  exclude  the  rule  from  applying  to  the  feminine  gender  and 
the  plural  ntmber.  These  specifiers  must  also  precede  the  head  of  MP 
since  when  they  follow,  no  apocope  occiirs  (cf.  algfai  hiimbre  'some 
man'  vs.  hiSmbre  algiino  'any  man').  It  is  also  mandatory  that  the 
specifiers  and  the  Head  N''  are  sister  nodes  of  the  same  maximal  NP 
to  account  for  the  lack  of  apocope  in  the  accusative  clitics.  (For 
the  syntactic  analysis,  cf.  section  4  below.) 

The  reason  why  only  the  masculine  singular  form  undergoes 
apocope  can  be  explained  as  follows.  Word-final  rhyme  in  Spanish  is 
a  weak  position  where  the  phonetic  segment  contained  therein  may 
suffer  loss,  e.g.  through  apocopation  or  deletion  as  found  in  the 
s-Aspiration  process.  Moreover,  unlike  VMs  in  nominals  which  must  be 
specified  as  [+faninine]  to  bring  out  the  WM  [a],  or  the  more 
constrained  [-low]  for  vowels  other  than  the  masculine  [o]  and  the 
feminine  [a],  e.g.  [e],  [i]  or  [u],  etc.  (cf.  Harris  1985),  WMs  of 
specifiers  and  adjectival  forms  systematically  end  in  the 
contrastive  pair,  i.e.  the  overt  [o]  or  zero  masculine  vs.  the 
more-marked  feminine  [a]  (guapo  -  guapa,  espaflol  -  espafiola ) ,  or 
shGa:*e  a  noncontrastive  marker  for  both  genders  (grande,  cualquiera) . 
"Hierefore,  in  closely-knitted  units  such  aa  Specifier  +  N,  the 
unmeirked  masculine  singular  WM  is  dropped  out. 

The  following  derivation,  which  is  consistent  with  the  cyclic 
proposal  in  Wong-opasi  (1987a,  b),  demonstrates  the  effect  of 
apocope.  TTie  cyclic  approsuih  also  accounts  for  stress  distribution, 
i.e.  the  penultimate  vs.  ultimate  stresses  in  the  full  vs. 
apocopated  forms,  respectively  ( algiino  -  algiin ) . 
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(5) 
faigunlos  +  WM  lsp«c 
algun)D.s  +  V*l 

NY 

S  $ 

algim   +  WM 


alg6n  +  p 

Its 

al£i!ino 
S  $     $ 


al, 


$  S 

U 

algAn 


[papeDos  +  WM  ]» 
papel ) 0  s  +  VW 

$  S 

pap^l  +  WM 

I 


$  $ 

pap^l 
$  S 

M 

pap^l 


papil 


'some  paper' 

Syllabi  ficatic»i 
Stress  Assignment 

Slot  Supply 

WM  Spell 

Resyllabification 
Surface  Represent 


Apocope 
Resyllabification 

Surface  Hepresent 


I  represent  here  the  deriN-ation  of  algi!in(o)  and  pap^l.  first  as 
separate  lexical  entries  (cf.  steps  1-6)  and  later,  as  a  syntactic 
unit,  i.e.  NP,  to  demonstrate  the  effect  of  apocope  (cf.  steps  7-9). 
Step  1  is  syllabification.  Wong-opasi  (1987a,  b)  proposes  the  rule 
of  stress  assignment,  "Stress  the  last  syllable  of  the  rightmost 
deri\-ational  stem  (DS)  unless  marked  otherwise  in  the  lexicon  by 
Extrametricality."  This  is  peu-tially  due  to  the  fact  that  the  final 
syllable  of  DS  is  always  heavy,  under lyingly.  In  effect,  primair 
stress  is  placed  on  the  last  syllable  of  edgun  and  papel  to  give 
algAn  £ind  pap^l  before  the  integration  of  W1  in  step  3  where  a  V 
slot  must  be  supplied  for  the  attachment  of  the  VW  in  step  4. 
Lexical  resyllabification  read,justs  the  outputs  to  include  the  new 
morpheme.  Ihe  surface  representation  (SR)  of  the  specifier  at  this 
point  is  thus  the  parox-ytone  alg6no  although  its  UR  carries  ultimate 
stress.  When  the  specifier  staixis  in  front  of  a  masculine  singular 
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nominal,  apocope  applies  in  step  7  where  the  association  line 
between  the  WM  and  the  skeletal  plane  is  cut  in  the  process  of 
apocope  so  that  the  WM  is  not  realized  phonetically.  Lexical 
resyllabification  reapplies  in  step  8.  The  surface  representation  of 
specifiers  after  apocope  preserves  the  underlying  ultimate  stress 
given  that  the  WM  is  dropped  out.  Thus,  the  surface  contrast  algiiino 
-algrim  is  explained.  Apocope  cannot  apply  to  the  feminine  singular 
given  that  the  Spec  must  precede  a  meisculine  singular  head  N'',  nor 
can  it  affect  the  plural  forms  of  both  genders  since  the  WM  no 
longer  staixls  at  the  edge  of  the  word. 


It  can  be  noted  that  apocope  is  restricted  to  certain  words  in 
Spanish.  Firstly,  only  three  ordinal  nunbers  and  three  adjectives: 
bti^no,  mAlo,  grdnde  undergo  the  final-vowel  deletion  sis  a  result  of 
their  being  preposed  from  the  post-NP  position  to  be  an  adjunct  of 
specifiers.  Otherwise,  apocope  applies  to  specifiers  themselves, 
e.g.  determiners,  deictics,  and  certain  quantifiers,  among  others. 
Hiat  is,  selected  items  within  these  lexical  classes  are  apocopated 
in  accordance  with  the  Possible  Syllable  Structure  Condition  which 
allows  only  sonorants,  i.e.  alveolar  nasal  [n] ,  and  liquids  [1,  r] 
in  word-final  position  in  Spanish.  Thus  it  rules  out  apocope  in 
forms  which  would  otherwise  end  in  a  nonpermissible  coda. 


(6)  I.  Ordinal  Nvmbers: 


priro^ro  — >   primer 

'first' 

seg6ndo  ~>   «segT!ind 

'second' 

terc^ro  ~>   tercer 

'third' 

cuArto     ~>   *cuArt 

' fourth ' 

quinto     ~>   «qulnt 

•fifth' 

six-to       ~>    «s*xt 

•sixth' 

s^ptimo  ~>   »s*ptim 

' seventh ' 

octAvo     —  >    »octAv 

•eighth' 

nov^no     ~>   tnov^n 

'ninth' 

d^imo     — >   «d6cim 

'tenth' 

postriro  ->  postr^r 

'last' 

liltijDo     -->   «AltiJD 

'last' 

ri.  Quantifiers: 

algtino     ~>  algiin 

'some' 

ningrino  —  >  ningiiii 

•none' 

ci^rto     — >   «ci*rt 

'certain' 

tAnto       — >   «tAnt    'so 

much/many' 

fll.  Adiectives: 

bu^no       — >  bu^n 

'good' 

niAlo         ~>  mil 

'bad' 

As  seen,  the  three  apocopated  ordinal  nunbers  end  in  [r,  1,  n).  The 
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11-formed  «nov^n  is  the  only  exception  to  the  rule. 


The  fact  that  ordinal  numbers,  quantifiers,  and  a  few 
ad,jectives  manifest  apocope  when  they  precede  a  noun,  e.g.  bu^n 
h6mbre  'good  man',  mAl  llbro  'bad  book',  but  they  preserve  the  full 
form  postnominally,  e.g.  hAmbre  bu6no  'good  man',  libro  mAlo  'beui 
book',  may  suggest  their  sensitivity  to  the  pre-NP  position. 
However,  this  is  not  generalizable  to  all  prenominal  adjectives, 
e.g.  »p6r  h6mbre  'mere  man'  vs.  h6mbre  p6ro  'pure  man',  »p6br  nlfio 
'unfortunate   child'   vs.   nino   p<!)bre   'penniless 


child' . 
latter 


One 
case 


observation  can  be  pointed  out  here.  Adjectives  in  the 
create  contrastive  meaning  with  respect  to  their  position  with  the 
noun  they  modify  while  those  in  the  former,  apocopating  set  do  not. 
On  the  assumption  that  transformations  cannot  change  meaning,  we 
must  claim  that  contrastive  ad,jectives  are  generated  as  separate 
horoophonous  lexical  entries  and  each  takes  its  corresponding 
position  in  relation  to  the  nominals,  i.e.  Ad.j.  +  N  vs.  N  +  Adj.,  as 
specified  in  the  lexicon.  Nevertheless,  contrastive  vs. 
noncontrastive  meaning  is  not  a  reliable  parameter  since  we  also 
find  lack  of  apocopaticai  in  noncontrastive  preposed  adjectives  such 
as  edificio  Alto  — >  Alto  ediflcio  'tall  building',  Arbol  bonlto  — > 
bonito  Arbol  'beautiful  tree'.  I  advocate,  in  contrast,  that  the 
difference  lies  in  the  fact  that  adjectives  which  allow  apocope  are 
preposed  from  the  postnominal  position  and  ad,joined,  not  to  its 
sister  node  N°,  as  in  the  ceise  of  nonapocopated ,  noncontrastive, 
preposed  adjectives,  but  to  the  higher  Specifier  position.  The 
relevant  structures  are  represented  below. 


el         nlfio  pibre 
'the  penniless  child' 


el       pAbre    nlilo 
'the  unfortunate  child' 
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N'  ' 


I       I 

edificio  alto 


bueno 


As  can  be  seen,  it  is  the  Spec  position  that  triggers  apocope. 
It  is  predicted,  then,  that  when  the  movement  is  not  to  Spec  as  in 
the  rest  of  adjectives  (cf.  (7.11)  or  when  the  adjectives  which 
allow  apxx^ope  (cf.  7. Ill)  form  a  complex  adjectival  unit  with  other 
ad,jectives,  no  apocope  occurs,  e.g.  un  buen  padre  'a  good  father' 
but  un  bueno  y'  carinoso  padre  'a  good  and  loving  father'  (cf .  un 
»buen  y  carifloso  padre)  . 

We  turn  now  to  the  more  complicated  derivation  of  Spanish 
deictics  which  involve  the  rule  of  epenthesis  in  addition  to 
apocope.  Iheir  distribution  is  as  shown  below. 


(8) 

Deictics: 

m.sg. 

neuter  sg. 

f.sg. 

m.pl. 

f.pl. 

*ste 

*sto 

*sta 

*St08 

*stas 

*se 

*so 

*sa 

*SOS 

*sas 

aqu^l 

aqu^Uo 

aqu^lla 

aquillos 

aqu^l 

gloss 

'this/ these' 
'  ttiat/those ' 
' that/those 
over  there' 


The     above  distribution  always  puzzles  language  learners  since 
Spanish  has     caily  two  genders:  —  masculine  and  feminine  —  yet  we 
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find  the  neuter  gender  only  in  deictics  (6sto,  iso,  aqu^llo), 
definite  article  {*llo,  lo),  and  accusative  pronoun  (lo).  I  propose 
that  the  answer  to  this  problem  lies  in  the  morpAiosyntax  of  these 
items.  For  the  moment,  let  us  focus  on  the  morphophonological 
aspects  of  the  contrastive  masculine  singular  and  neuter  forms  of 
deictics . 

The  underlying  representation  of  all  three  deictics  is  ^st+WM; 
^s+WM;  and  aqu^l+WM,  respectively.  I  postulate  that  the  masculine 
singular  deictics,  emother  class  of  specifiers,  when  standing  in 
front  of  a  masculine  noun,  are  subject  to  apocope.  Hence,  6sto  — > 
/^st/ ;  ^so  — >  /^s/;  agu^llo  — >  aqu^l  (the  form  between  two  slashes 
denotes  an  intermediate  stage  of  derivation).  Since  /»4st/  cannot 
surfEKJe  because  of  its  final  consonant  cluster,  the  CV  adjustment 
provides  an  additional  syllable  nucleus  to  break  the  cluster  into 
heterosyllables ,  yielding  ^ste.  Although  cluster  simplification 
cannot  be  claimed  for  the  derivation  of  6se  from  ^so,  one 
observation  is  helpful  here.  That  is,  [s]  is  a  highly-marked 
word-final  rhyme  in  Spanish,  found  mostly  in  morpheme  markers,  i.e. 
plural  ending  of  nonverb  forms  and  person  markers  of  verbs,  and  in  a 
handful  of  lexical  items.  When  encountered  with  a  marked  word- final 
consonantal  coda,  Spanish  provides  two  alternatives,  either  dropping 
the  final  C  or  preserving  it  with  the  help  of  epenthesis,  as 
practiced  in  the  nativization  of  loan  words  such  as  beefsteak  — > 
bist^(k).  check  — >  ch^ke,  ticket  — >  tik^te.  and  so  on.  In  the  case 
of  deictics,  epenthesis  is  chosen  in  order  to  preserve  information 
which  would  otherwise  suffer  further  loss,  i.e.  ^sto  — >  /^st/  — > 
/^3/  — >  t^  and  ^so  — >  /^s/  — >  *i.  Additionally,  there  is  a 
palatalization  rule  of  [1]  before  em  overt  WM,  e.g.  aqu^llo, 
aqu^lla.  etc.  from  aqu^l-t-WM.  and  ^llo,  4illa  from  H-WM  (cf.  (11) 
below) .  6  The  claim  that  the  final  [e]  of  both  ^ste  and  ^se  is 
epenthetic  is  clearly  supported  by  their  plural  forms:  ^stos  and 
6sos,  respectively  (cf.  *^stes;  *^ses) . 

We  will  now  look  at  the  veu-iation  of  definite  articles, 
nominative  pronouns  and  accusative  clitics.  According  to  Harris  1987, 
the  underlying  representation  is  ellV  for  all  these  lexical  classes. 
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However,   1  will  account  also  for  neuter  forms.  Harris'  proposal  is 
given  in  (9)  while  I  offer  a  broader  analysis  in  (11). 


(9) 
Noun 

Non  Pron 

Ace  Pron 

primo 

ella  ello 

ella  ello 

Underlying  Phono.  Repres 

ell 

Rule  (1) 

prljDo 

^lla  «11 

Stress  Assignment 

la    lo 

Rule  (2) 

«1 

Depedatalization 

primo 

*lla  *1 

la    lo 

Output 

The  explanation  is  straightforward.  The  final  vowel  of  the  masculine 
nominative  pronoun  611o  is  not  manifested  phonetically  due  to  the 
effect  of  mle  (1).  Harris  attributes  the  non-application  of  rule 
(1)  to  accusative  pronouns  to  syntax.  That  is,  the  nominative 
pronouns  can  only  be  heads  of  Noun  Phrases  while  the  eirticles  and 
accusative  clitics  can  only  be  non-heads  of  Noun  Phrases  and  Verb 
Phrases,  respectively.  Thxxs  the  status  of  these  three  sets  of  forms 
as  clitics  or  not,  hence  their  ability  to  be  stressed,  is 
predictable  (inversely)  from  their  status  as  heads  of  syntactic 
phrases.  in  effect,  stress  assignment  operates  on  nouns  and 
nominative  pronouns  but  not  on  accusative  clitics.  Subsequently, 
rule  (2)  applies  to  drop  the  initial  stressless  syllable  of  the 
accusative  clitics.  The  intermediate  form  /«*11/,  however,  cannot 
siirface  since  Spanish  bars  word-final  palatals.  Thus, 
depalatalization'  follows  to  render  *1  (cf.  tdonc^ll  — >  donc^l; 
desd^fi  — >  desdto). 

Harris'  observation  of  the  distinction  between  the  nominative 
and  the  accusative  case  for  the  application  and  non-application  of 
apocope  is  valid  since  these  two  cases  are  syntactically  different. 
However,  instead  of  postulating  that  the  apocopated  forms  must  be 
pronominal,  I  propose  that  they  are  specifiers.  Both  the  nominative 
pronouns  and  pronominal i zed  definite  articles  are  specifiers 
originally  but  can  be  pronominal i zed  via  deletion  of  head  NP  (cf . 
section  4).  Moreover,  rule  (2)   is  untenable  since  there  is  no 
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obvious  reason  why  the  initial  stressless  syllable  must  be  dropped. 
Harris'  analysis  is  not  clear,  either,  how  the  sequence  ellV  can  be 
parsed  as  el-la  where  the  two  liquids  do  not  form  a  palatal. 
Furthermore,  assuning  that  the  palatal  consonant  is  in  the  UR, 
Harris'  depalatalization  only  takes  care  of  word-final  palatals  and 
not  those  in  word-initial  position.  Alternatively,  I  suggest  that 
the  UR  of  this  morphosyntactic  entity  is  1+WM.  T^is  accounts  for  the 
opposing  forms  lo,  lo,  la  for  the  neuter,  masculine,  and  feminine 
gender,  in  that  order.  The  monosyllabic  vs.  bisyllabic  alternants, 
e.g.  ello  vs.  lo,  is  due  to  the  universal  requirement  that  a 
phonological  word  must  contain  a  minimal  stress  unit  in  languages 
that  have  stress  contrast  (cf.  McCarthy  and  Prince  1986).  For 
Spanish,  both  the  articles  and  the  accusative  clitics  cannot  exist 
Eis  a  sepeu~ate  word  phonologically  while  nominative  pronominals  can 
be  found  in  isolation.  Therefore,  nominative  pronominals  are 
required  to  carry  stress.  However,  the  underlying  monosyllabic  l+WM 
poses  some  problems.  Since  stress  is  assigned  before  the 
manifestation  of  WM  (cf.  Wong-opasi  1987a,  b) ,  it  follows  that 
stress  cannot  be  placed  on  WM.  Tlius,  the  phonetically  empty  stressed 
syllable  of  the  nominative  pronominals  must  be  filled  and  the 
default  vocalic  nucleus  for  Speuiish  is  the  epenthetic  [e].  This 
process  is  illustrated  in  (10).  Peculiarly,  the  [1]  before  an  overt 
>*1  is  palatalized  (cf.  aqu^l-fo/a  — >  aqu^llo/a). 

(10)  Syllable  Addition: 

Word  Word 


Dbs  +  Wl  *  Dos  +  WM 

Following  is  the  full  derivation  of  all  the  forms  in  question. 

(11) 
N  Ncm  M  Noo  F  Norn   N  Art  M  Art  F  Art  N  Ace  M  Ace  F  Ace 

1.  1+WM   1+WM   1+VW   1+WM   1+WM   1+WM   1+WM   1+VW   1+VM  UR 

2.  $-l+VW  $-l+WM  $-l+WM  SA 
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3.  *-l+WM  *-l+WM  *-l+VgM  10 

4.  *-lo   6-lo   6- la     lo     lo     la     lo     lo     la   WM 


5. 

4-1 

«1 

Ap 

6. 

el 

Ep 

7.  *llo 

411a 

Pa 

8.  $  $ 

\A 

9.  *llo 

S 

\ 

41 

$  $ 

i 

$ 

\ 

el 

$ 

$ 

h 

lo 

i 

$ 

k 

RS 
SR 

Forms  shown  in  step  1  are  the  underlying  representation  (UR):  1+WM. 
Step  2  is  the  stress  assigninent  (SA)  and  it  only  applies  to 
nominative  pronouns  but  to  neither  definite  articles  nor  accusative 
clitics  since  the  latter  two  are  in  [-stress]  positicm  and  they 
cannot  exist  as  independent  words,  i.e.,  nonpronominsLlized  articles 
and  clitics  cannot  be  used  in  isolation  without  the  head  of  the  NP 
and  VP,  respectively,  e.g.  tel  'the'  (cf.  el  libro  'the  book'),  *lo 
'it'  (cf.  lo  termini  'I  finished  it').  However,  given  the  underlying 
nonsyllabic  1,  rule  (10)  applies  to  add  a  stressable  unit  to  the 
prosodic  word  here.  Regular  WM  is  spelled  out  in  step  4.  Apocope 
(Ap)  takes  effect  only  on  the  masculine  nominative  pronoun  and 
article  but  not  on  their  accusative  counterpart.  (This  point  will  be 
exemplified  later.)  After  apocope,  the  masculine  singular  definite 
article  in  step  5  cannot  be  left  stranded  as  «1;  therefore,  the 
consonant  is  first  parsed  as  a  possible  coda  according  to  the 
unmarked  Possible  Syllable  Structure  Condition  of  Spanish,  hence  its 
nucleus  vowel  is  filled  in  with  the  Spanish  unspecified  vowel  [e]. 
Finally,  the  [1]  of  specifiers  is  palatalized  when  it  stands  before 
an  overt  WM.  Lexical  Resyllabification  (RS)  in  step  8  brings  out  the 
surface  forms. 

Tlie  above  derivation  also  accounts  for  the  stressed  masculine 
singular  nominative  pronoun  41  and  the  unstressed  masculine  singular 
definite  article  el.  Tliat  is,  the  epenthetic  form  naturally  does  not 
receive  stress  given  its  stressless  position.  Postulating  the 
epenthetic  [e]  of  definite  article  is  also  supported  by  the  fact 
that  the  vowel  in  question  does  not  show  up  in  contraction  with  a 
monosyllabic  preposition  which  precedes  such  as  a  'at,  to'  and  6& 
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'of,   from',   e.g.  a  «el — >  al;  de  «el  — >  del  while  contraction  is 
impossible  in  stressed  forms,  e.g.  a  ^1  — >  <al;  de  ^1  — >  «del. 

The  epen thesis  to  a  stranded  nonsyllabic  consonant  readily 
accounts  for  the  change  from  the  feminine  definite  article  la  to  el 
in  front  of  a  stressed  [A]  of  a  following  feminine  noun.  I  propose 
to  dispense  with  Heirris'  diacritic  [&...1noui.  First,  it  is  peculiar 
why  it  is  only  the  stressed  [A]  and  not,  say,  unstressed  [a]  or 
other  [+/-stressed]  vowel.  Secondly,  there  is  no  explanation  why  the 
switch  should  be  to  the  masculine  singular  form  and  not  others  even 
though  the  gender  of  the  article  in  question  is  still  feminine  in 
accordance  with  its  following  noun.  Finally,  I  claim  that  apocope 
and  truncation  before  [4...]n  Sire  separate  processes.  Moreover,  the 
two  rules  are  ordered  before  and  after  the  rule  of 
pronominalization,  respectively.  Hence  the  distribution:  el  Alto,  lo 
Alto,  and  la  Alta  (but  el  Alma,  »la  Alma)  is  accounted  for.  The 
first  two  points  are  described  sis  follows. 

The  WM  of  the  feminine  singular  definite  and  indefinite 
articles  la  and  una  is  truncated  in  front  of  a  following  stressed 
vowel  of  exactly  the  same  shape  (cf.  rule  (12))  while  other  vowels 
will  not  cause  vowel  deletion.  (13a,  b)  are  sample  derivaticais  of 
the  la  -  el  replacement. 

(12)  Truncation: 

a  — >  er  /   ]5p.c   X 

Condition:  X  =  head  N'',  feminine  sing,  and  beginning  with  [A] 

(13) 

$  $  $  S  I  $  $ 


a.  1+a  Agua  !     b.       urt+a  Aina 

u 


UR 


c  V  V  ;     c  V  V 

1   Agua  ;  un   Ama        Truncation 

II  II 

C   V  i       C   V 
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el  I         -  Bpenthesis 

$   $       ;       $   )  Resyllabif. 


/I  I  M 

el       Agu&  I  un       ama 


iigua. 

After  truncation  of  the  feminine  WM,  the  remaining  onset  [1]  is 
stranded  in  the  same  fashion  03  the  {1]  of  the  masculine  singular 
definite  article  after  apocope  has  applied.  Hence,  epenthesis 
follows  automatically  to  remedy  the  situation.  It  is  not  accidmtalt 
then,  that  the  resulting  form  of  definite  airticles  in  both  cases  is 
the  same  el  given  that  the  WM  of  lo  and  la  suffers  phonetic  loss 
(through  apocope  and  vowel  truncation,  respectively)  and  the 
epenthetic  vowel  to  prevent  further  loss  is  the  unspecified  vowel 
[e].  An  apparent  switch  in  gender  of  the  indefinite  article  una  — > 
\m  before  a  stressed  [A]  is  adso  due  to  vowel  truncation  and  not  by 
either  the  disagreement  rules  or  the  referral  rules. 

It  must  be  distinguished  that  apocope  is  a  more  general  rule 
which  is  obligatory  to  all  specifiers  or  the  materials  inside  Spec, 
while  vowel  truncation  is  obligatory  only  to  determiners  and 
optional  to  other  specifiers.  The  optional ity  of  vowel  truncation  is 
examplified  in  m6cha  Agua  'much  water',  tAnta  (h)ambre  'a  lot  of 
hunger',  ^sta  Ama  'this  mistress',  where  the  adjacoit  low  vowels  can 
be  pronounced  as  two  separate  vowels  or  eis  one  single  vowel. 
Harris 'observation  of  the  differing  aqu^lla  Agua  vs.  aqu^l  Agua 
'that  water'  is  borne  out  from  our  prediction  (Harris  1987:181,  fn. 
7).  Of  course  with  an  additional  palatalization  of  (1]  where  no 
vowel  truncation  applies  according  to  the  present  analysis. 


4.  Postlexicon 


Lexical  Allomorphy  Rule  and  Postlexical  Apocope,  Vowel 
Truncation  and  Epenthesis: 
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As  mentioned  earlier,  the  feature  Spec  for  the  applicability 
vs.  nonappl icabi  1  i ty  of  apocope  to  the  UR  l-t-W  can  be  explained  in 
syntfictic  terms.  Specifically,  the  rule  cannot  operate  on  accusative 
clitics  because  they  neither  originate  in  nor  can  they  be  preposed 
to  Spec  position.  Accusative  pronouns  are  clitics  and  they  cannot 
exist  as  sepeo^te  words  but  are  cliticized  to  verbs.  I  follow 
Zagona's  1987  proposal  for  the  structural  description  of  the 
cliticization  rule  as  shown  below. 

(14) 

V'  a)  Clitic  bears  Case  feature. 


b)  Clitic  Spellout:  [+CASE]  — > 
[ o( PERSON , ^ NUMBER , ^  GENDER ] 


V+CLi 

Tlie  following  example  is  an  approximation  to  Zagona's  rule. 


(el  libroi) 

(Assignment  to  N"  sister  in  syntax) 

The  accusative  noun  (el  libro)  contrasts  with  the  acctjsative 
clitic  (lo)  in  that  the  specifier  of  the  noun  form  is  subject  to 
apocope  due  to  its  being  a  specifier  before  a  masculine  sin^fular  N 
both  belonging  to  the  same  maximal  projection,  i.e.  N",  while  the 
clitic  must  be  spelled  out  by  the  cliticization  rule  which  spells 
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out  the  case  assigned  by  the  verb  maintaining  the  same  person, 
nunber,  arxd  gender  as  its  antecedent.  In  morpho-phono- syntactic 
terms,  the  form  chosen  is  lo/a(s)  and  not  the  stressed  form  because 
clitics  are  in  [-stress]  position.  Moreover,  there  is  no  apocope 
here,  given  that  the  clitic  pronoun  is  cliticized  to  the  verb  which 
does  not  meet  the  structural  description  of  the  rule,  regardless  of 
the  position  of  the  clitic  to  the  verb,  as  proclitic  or  enclitic. 

The  alloroorphy  rule  of  specifiers  is  the  same  lexical  rule  as 
the  one  which  selects  the  proper  WM  of  a  lexical  item,  including 
that  of  the  specifiers.  This  predicts  that  in  the  lexicon,  we  have 
the  [+stress]  selection  ^llo,  ^lla,  ^llos,  ^llaa  as  well  as  the 
[-stress]  alternants  lo,  la,  los.  las.  In  contrast,  variations  such 
as  lo  -  el  and  la  -  el  (before  [A] )  which  are  caused  by 
apocope /vowel  truncation  and  the  subsequent  epenthesis  are 
roorphosyntactic  rules  since  the  structural  description  for  their 
operation  must  include  syntactic  information.  Tliua,  the  latter 
processes  are  found  in  the  postlexical  component. 


4.2.  Neuter  =  pro-Head  N"  and  Pronominal ization: 


In  the  following  section,  three  points  will  be  argued  for.  (i) 
to  replace  the  feature  [+neuter]  gender  of  specifiers  and  pronouns, 
(^llo,  lo;  ^sto,  ^so.  and  aqu^llo),  with  the  notion  "pro-Head  N"" 
(or  "headless  N''").  Tliis  correctly  accounts  for  the 
ungrainnaticality  of  forms  such  as  «(^l)lo  hombre*  'the  man'  (cf.  el 
hombre ) ,  *^sto  libro  'this  book'  (cf.  ^ste  libro),  etc.  (ii)  to 
falsify  the  appeu-ent  existence  of  the  neuter  gender  given  that  the 
neuter  exponence  is,  in  fact,  a  morphophonological  variant  of  the 
unnKU-ked  masculine  singular  gender  and  nimber  with  all  its 
morpho-phono-syntactic  properties  still  left  intact,  e.g.  lo  bueno 
(«lo  buena,  »lo  buenos,  »lo  buenas;  »la  bueno,  »los  bueno,  >las 
bueno).  Ihe  masculine  singular  features  are  clearly  seen  in  the 
correlation  between  the  headless  specifier  and  the  adjective.  (iii) 
to  prove  that  specifiers  are  not  themselves  pronominals  but  rather 
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they  can  be  pronoounalized  via  deletion  of  the  head  of  NP, 
hombre  bueno  'the  good  nan'  — >  el  bueno  'the  good  one'. 


e.g.  el 


A.  Lexical  lo/a  vs.  Postlexical  el 


Points  (i)  and  (ii)  above  su^  illustrated  below.  Since  the  Spec 
position  is  a  stressless  position,  the  form  chosen  must  be  the  weak 
one:  l+WM  (and  not  the  strong  counterpart:  ^11-t-WM) .  We  will  consider 
the  derivation  of  specifiers  in  the  structure:  Spec  +  Head  N '  '  ■*■  X" 
(X  =  any  complenient/niodifier  phrase). 


(16) 


_S£ 


1+WM 

libr+WM  buen+WM 

;  l+WM  Alm+WM  buen+WM 

1. 

lo 

libro   bueno 

!  la 

Alma   buena 

2. 

1 

; 

- 

3. 

- 

!  1 

4. 

el 

iel 

5. 

el 

libro   bueno 

!el 

dim   buena 

'the  good  book' 

;'the 

good  soul' 

neuter  sg. 

1+WM  pro  buen+VW  UR 

lo   pro  bueno   VW 

Ap 

VT 

Bp 

lo      bueno   SR 

'the  good  (thing)' 


All  WMs  are  spelled  out  in  step  1:  the  m.sg.  [o],  the  f.sg.  [a],  and 
in  the  presence  of  a  'pro'  head,  the  default  m.sg.  [o]  for  the 
so-called  "neuter  sg."  (cf.  also  the  [o]  ending  of  the  strong  form 
^llo  and  the  deictics:  ^sto,  ^so,  and  aqu^llo) ■  TTie  masculine  and 
feminine  singular  definite  articles  lose  their  WMs  via  apocope  and 
vowel  truncation,  respectively  (steps  2  and  3)  while  the  two  rules 
are  inapplicable  to  the  neuter  article  since  it  stands  before  a  pro. 
{The  head  is  phonetically  absent  given  that  it  is  neither 
identifiable  nor  is  there  a  corresponding  overt  lexical  noin  as  it 
often  refers  to  an  abstract  idea,  usually  rendered  as  'thing/part' 
in  English.)  Tlie  above  derivation  also  shows  the  agreement  of  the 
Spec  with  the  Head  N'  '  and  X' '  (or  the  ad,iective  here). 
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B.   Pronominal ization  euid  Lack  of  Apocope/Vowel  Trvmcation 

I  postulate  that  pronominal  ization*  in  Spanish  is  done  via 
deletion  of  Head  N''.  It  is  possible  in  Spanish  but  not  in  English 
due  to  the  rich  inflection  on  the  Spanish  determiners.  In  syntactic 
terms,  this  is  done  by  coindexing  the  determiners  with  the  head, 
thereby,  encoding  features  such  as:  [o(  CASE, 'S  NUMBER, J  GENDER)  onto 
the  specifiers.  Thus,  structurally,  specifiers  must  be  represented 
under  N'',  or  N"»»,  for  reasons  of  c-coninand.  Giorgi  1985  argues  for 
a  structure  similar  to  the  one  represented  below. 


(17)  N 


Spec 


eli 

vasoi 

lleno  de  agua 

la* 

chica, 

con  unas  rojas 

los. 

mo  linos, 

de  Espafia 

lasi 

frutasi 

que  llegaron 

(X' 


S') 


'the  glsiss  filled  with  water' 
'the  girl  with  red  nails' 
'the  windmills  from  Spain' 
•the  fruits  that  arrived' 


e  Pronominalization 

Giorgi  1985  proposes  the  above  structure  for  the  NP  in  Italian. 
What  concerns  us  here  is  the  fact  that  such  a  structure  maintains 
the  basic  regularity  expressed  in  (18): 

(18)  If  an  anaphor(/-  c-comnands  its  antecedent 'p  no  principle  B 
or  C  violation  arises. 


It  is  clear  from  (18)  that  Spec  c-coramands  everything  else  dominated 
by  N'.  Tlierefore,  deletion  of  the  antecedent  ijnder  N»  is  possible 
after  coindexing  it  with  the  specifier,  while  in  English,  the 
position  must  be  filled  by  a  pronominal  due  to  lack  of  inflectional 
percolation),  e.g.  el  lleno  de  agua,  la  con  uftas  rojas,  los  de 
Espafei,   las  que  llegaron,  and  so  on.  Coindexing  between  the 
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specifier  and  the  Head  N''  in  the  process  of  pronominal ization  also 
accounts  for  the  obligatory  retention  of  specifiers  e.g.  Voy  a  ver 
la  pelicula  clAsica  y  la  modema.  'I  am  going  to  see  the  classic 
fiLn  and  the  modem  one.  '  (cf y  tmodema.  ) . 

The  intervening  node  N'  is  important  due  to  the  fsict  that  such 
a  structure  allows  the  N'  to  take  a  complement  under  it  whereas 
modifiers  such  as  adjectives  must  be  dominated  by  N".  In  addition, 
representing  distinct  levels  of  domination,  as  shown  below,  predicts 
the  ungramnaticality  of  la  chica  con  ufias  rojas  »bella  'the  pretty 
girl  with  blue  eyes'  given  that  movement  of  adjectives  must  be  done 
within  the  same  domain  (cf.  la  chica  bella/la  bella  chica  con  ufias 
rojas  in  (19. I. a,  b) )  or  to  account  for  their  differing  meaning  to 
la  chica  con  unas  rojas  bellas/bellas  uflas  rojas  'the  girl  with 
beautiful  red  nails'  in  (19. II. a,  b). 


la  chica  bella  con  unas  rojas  la  bella i  chica  t,  con  trflas  rojas 


la  chica  con  ufias  rojas  bellas  la  chica  cai  bellasi  uflas  rojas 


TTie  proposed  structure  generalizes  the  contrast  between  the 
[+specified]  vs.  the  [-specified]  gender  form  of  specifiers 
according  to  the  distinction  deleted  vs.  null  head  of  N"  in  complex 
NP  structures  (cf.  20.1)  and  headless  relative  clauses  (20.11).  That 
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is,  for  the  [+specified]  gender  and  nunber,  the  allomorphy  rule 
correctly  supplies  the  masculine/ feminine  WM  depending  c«  a  given 
case  whereas  the  [-specified]  gender  and  ntmber,  i.e.  [o],  is 
supplied  in  accordance  with  'pro'  which  is  [-specified]  also  for 
gender  and  nunber. 


(20) 
.  1+WM  hombre  bueno 
lo 
I 
el 


el  bueno 

II. a.  1-i-WM  hombre  que  es  bueno 
lo 


el 


el  que  es  bueno 


b.  1+fcW  pro  bueno 
lo 


lo  bueno 


UR 
VJM 
Ap 


b.  1+fcW  pro  que  es  bueno  UR 

lo  WM 

Ap 


Pron 


lo  que  es  bueno 


Resistance  to  apocope  £tf*i/or  vowel  truncation  also  finds 
explanations  in  the  deletion  of  head  N".  Thus,  the  proposed 
analysis  predicts  the  application  of  vowel  truncation  in  forms  like 
la  AJjna  — >  el  Alma  'the  soul'  (where  there  is  no  intervening 
pronominalization) ,  but  lack  of  it  as  a  consequence  of  the 
intervening  pronominalization  la  Alma  p6ra  — >  la  pAra  'the  pure 
soul  ~>  the  pure  one'  (»el  p6ra)  and  la  Alma  Alta  — >  la  Alta  'the 
high  soul  — >  the  high  one'  ( «el  Alta) ,  provided  that  the  following 
[A]  is  not  a  head  of  N' '  or  that  the  specifier  has  a  zero  block  up. 
To  illustrate  the  distinction,  I  offer  the  derivation  of  el  Alto 
'the  tall  (one),  m. '  as  well  as  lo  Alto  'the  tall  (thing)',  and  la 
Alta  'the  high  (one),  f.'.  (Harris'  rule  cannot  distinguish  el  Alto 
vs.  lo  Alto  and  it  must  be  specified  as  [+prononiinal/  -Hnasculine]  to 
bar  the  incorrect  form  »el  Alta  for  la  Alta. 
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(21) 
a.  1+WM  chic+WM  dlt+WM 

1 .  lo   chico   ^to 

2.  1 
3. el 

.         i 

5. 

6. el  Alto 


b.  1+WM  pro  Alt+WM 
lo   pro  Alto 


lo  Alto 


M  Alm+WM  Alt+WM 

ALna 

Alta  WM 

- 

Ap 

e 

Bp 
Pr 

- 

VT 

Alta 

SR 

T^e  WMs  of  all  the  specifiers  are  spelled  out  as  expected.  Apocope 
is  applicable  to  only  specifiers  before  a  non-null  mascxiline 
singular  Head  N"  ,  thereby  ruling  out  any  change  in  (21b,  c). 
Epenthesis  automatically  adjusts  the  apocopated  specifier. 
Pronominalization  has  the  effect  of  deleting  the  head.  Thei'efore, 
the  resultant  forms  of  (21a,  c)  at  this  point  are  el  Alto  and  la 
Alta,  respectively.  At  step  5,  vowel  truncation  cannot  take  effect 
in  any  of  the  three  representations  because  the  specifiers  no  longer 
stand  before  a  Head  N' '  .  Nevertheless,  in  case  there  is  no 
pronominalization  in  step  4,  the  prediction  of  vowel  deletion  on  /la 
Alma/,  yielding  61  Alma  is  bome  out  given  an  overt  head. 


Finally,  the  proposal  made  in  this  paper  additionally  accounts 
for  the  fact  that  certain  quantifiers  and  deictics  are  sometimes 
used  as  postnominal  adjectives  with  an  appeu-ent  resistaixje  to 
apocope  in  the  example  in  (22.1)  and  with  the  co-occurrence  of  a 
determiner  and  a  quantifier/deictic  in  both  examples. 


(22) 

I.  a. 

un  hombre 

'a  man' 

~>  b.  el  hombre  Ano 

'the  #1  man 

11.  a. 

este  hombre 

'this  man 

— >  b.  el  hombre  *ste 

'this  man' 

Specifically,  (22. I. a)  contrasts  (22. I. b)  in  meaning  since  'uno'  has 
two  (or  more)  separate  lexical  entries:  'a/an,  indefinite 
article/quantifier',  and  'nunber  one',  among  others,  smd  each  takes 
up  its  corresponding  place  as  specified  in  the  lexicon.  Tliat  is,  the 
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quantifier  precedes  the  noun  and  is  sensitive  to  apocope  while  the 
latter  is  employed  as  a  postncminal  modifier.  No  movement  is 
involved  here  (cf.  *el  hombre  6n).  However,  for  (22.II.b),  the 
specifier  moves  out  of  its  original  position  to  obtain  more  stress 
relative  to  the  head  of  NP.  Tliis  is  patterned  after  the  structure 
Head  N '  '  ■*■  AdJ .  where  the  head  is  not  the  peak  of  the  prosodic 
phrase  but  the  attribute  is  since  Spanish  is  focus-final. '"  This  is 
confirmed  by  the  fact  that  contrastive  adjectives  that  differentiate 
the  nouns  they  modify  from  the  rest  in  their  classes  are  postncminal 
and  ne\er  prenominal ,  e.g.  el  profesor  generoso  ' the  generous 
professor',  given  that  preposed  adjectives  cannot  make  such  a 
contrastive  distinction.  The  only  effect  achieved  by  the  stylistic 
preposing  of  adjectives  is  to  emphasize  the  specific  quality 
conveyed  by  the  meaning  of  the  employed  adjective,  e.g.  el  generoso 
profesor.  Tliis  is  done  via  the  marked  word  order,  i.e.  Ad.j.  ■»  Head 
N'  '  which  is  irregular  for  descriptive  adjectives,  and  not  by  the 
change  of  the  stressed  element.  Tlius,  preposed  adjectives  are 
prosodically  procliticized  to  the  stressed  unit  which  usually  is  at 
the  end  of  a  prosodic  phrase. 


5.  Conclusion 


I  hope  to  have  outlined  a  full  account  of  the  specifier  class  in 
order  to  supplement  Harris'  1987  proposal.  The  distribution  in  forms 
of  these  specifiers  is  independently  motivated  as  suj^port  can  be 
drawn  from  other  related  phenomena:  apocope,  epenthesis,  and 
pronominal ization  of  specifiers' ^  regulated  by  the  postlexical 
phonologj'  and  syntax.  It  is  also  to  confirm  Harris'  claim  that  the 
change  of  the  definite  article  from  la  to  el  before  (A]  is  not  dOT»e 
via  a  disagreement  rule  or  referral  rule.  The  apocope  and  the 
prosody  for  the  truncaticm  of  the  masculine  WM  and  the  addition  of 
an  initial  syllable  of  the  UR  H-WM,  respectively,  can  be  translated 
in  syntactic  terms  in  that  these  morpho-phono-syntax  interactions 
are  sensitive  to  the  Spec  position  and  a  nonnull  mEiscuIine  singular 
Head  N"  . 


174 


NOTES 


*  I  wish  to  thank  Michael  Kenstowicz  for  his  valuable  connents 
on  an  earlier  draft  of  this  paper.  All  errors  of  fact  or 
interpretation  are  my  responsibility. 

*  The  problem  struck  rae  as  interesting  and  I  was  able  to  draw  a 
lot  of  generalizations  while  I  was  working  on  the  relations  between 
stress  assignment  and  apocope  in  forms  like  algdmo  — >  alg6n.  This 
paper  reacts  to  a  later  reading  of  Harris  1987. 

'  In  suidition,  Posner  1985  approaches  the  problon  from  a 
diachronic  viewpoint. 

3  The  use  of  a  written  accent  mark  in  this  paper  is  simply  to 
indicate  the  stressed  syllable.  It  is  not  intended  to  correspond  to 
the  orthographic  accent  mark  in  Spanish,  ^k)reov•e^,  phonetic 
transcription  is  employed  when  it  is  necessary  to  illustrate  a  given 
point. 

*  I  employ  the  feature  WM  to  cover  also  the  apocope  found  in 
word-final  vowel  other  than  [o).  Examples  are  grande  'big,  adj.  m. 
f.',  and  cualquier-a  'whatever,  adj.  m.  f.'  which  are  sporadically 
apocopated  to  gran,  and  cualquier,  respectively,  before  a  nominal  of 
either  gender.  TTiese  two  lexical  items  are  highly  idiosyncratic  to 
the  proposed  generalization  here  given  that  pre-  vs.  post-nominal 
grande  do  contrast  in  meaning,  e.g.  gran  hombre  'great  man'  vs. 
hombre  gi-ande  'big  man'.  Moreover,  the  Possible  Syllable  Structure 
Condition  would  rule  out  apocope  in  grande  — >  »grand  since  it  would 
end  in  a  consonant  cluster.  Two  alternatives  are  then  available 
here.  Grande  either  undergoes  cluster  simplification  by  dropping  out 
the  final  consonant  or  it  breaks  from  the  preceding  syllable  to  form 
a  heterosyllable  with  an  epenthetic  [e].  The  results  are  gran  and 
the  form  apparently  resistant  to  apocope  grande,  e.g.  gran/grande 
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hombre ,  gran/grande  ocas i An  'great  occasion,  f.',  gran/grande 
sacrificio  'imnense  sacrifice,  m. ' ,  gran/grande  desgracia 
'tremendous  misfortune,  f.'  However,  according  to  Ramsey  1956,  this 
is  due  to  the  fact  that  "the  natural  effect  of  emphasis  is  to  give 
to  a  word  its  fullest  form."  (Ramsey  1956:137).  Additionally,  the 
plural  form  of  cualquiera  is  the  exceptional  apocopated  cualesquier 
preceding  a  plural  nominal  and  the  full  form  cualesquiera  both 
before  and  after  a  noun.  Another  idiosyncratic  apocope  is  found  in 
sAn  from  siLnto  which  retains  its  full  form  in  the  phonological 
environment  of  to-  or  do-  of  a  following  head  of  MP,  e.g.  SAn  PAblo, 
SAn  AntAnyo  but  SAnto  TomAs,  SAnto  Domingo,  etc. 

*  Unlike  English,  a  series  of  two  or  iDore  adjectives  must  be 
coordinated  by  x  'and',  in  Spanish. 

*  Harris  1983  proposes  depalatalization  to  account  for  the 
relationship  of  underlying  [11]  to  surface  [1]  and  eQso  [fi]  — >  [n]. 

'  Following  Harris  1987,  I  leave  the  problem  of  petlatalization 
in  this  case  to  future  research. 

*  Although  the  expression  lo  hombre  'the  manhood'  may  be 
acceptable  to  some  speakers,  it  is  not  to  be  confused  with  the 
ungrajunatical  form  here.  Lo  hombre  patterns  after  lo  4  ad.) . ,  e.g.  lo 
mejor  'the  best  part',  and  the  nominal  hombre  here  is  adjectivalized 
by  zero  affixation.  The  noun  must  be  employed  as  an  adjective.  One 
coraroon  example  is  lo  cabal  lero  'characteristics  of  being  a 
gentleman'.  Some  speakers  accept  less  coomon  exsunples  such  as  lo 
nifto  'childishness',  and  lo  aujer  'characteristics  of  being  a 
wanan' ,  while  others  reject  them. 

»  Such  a  transformation  is  possible  since  there  is  no  real 
change  in  meaning  between  the  full  NP  and  the  pronominal i zed 
version.  Tliat  is,  the  deleted  noun  is  always  recoverable  at  the 
discourse  level,  e.g.  el  hombre  bueno  'the  good  man'  vs.  el  bueno 
'  the  good  one ' ,  las  canciones  antiguas  '  the  old  songs '  va .  las 
antiguas  'the  old  ones,  etc.  In  lo  bueno,  on  the  contrary,  no 
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transformation  is  claimed  given  the  presence  of  a  null  head  both 
underlj-ingly  and  on  the  surface.  The  contrast! ve  meaning  el  bueno 
vs.  lo  bueno  is  in  its  underlying  overt  vs.  'pro'  head  of  NP. 

»o  At  the  sentential  level,  Contreras  1978  claijns  that 
Spanish  is  'focus-final'.  By  implication,  this  may  also  apply  to 
phrasal  units.  I  hope  this  proposal  will  stimulate  future 
investigations . 

'I  Harris'  1987  morphological  explanation  of  cases  of  apparent 
application  of  the  el-la  replacement  in  compound  nouns  such  as  el 
aguafu^rte  'nitric  acid'  (from  Agua-ff u^rte ) ,  el  aguamAla  'jellyfish' 
(from  Agua+mAla ) ,  and  in  diminutives,  e.g.  el  agOlta  'water,  dim.' 
(from  JLgua-t-lta)  ,  el  amita  'mistress,  dim.'  (from  Ama-Uta)  reflects  a 
stratified  lexicon,  similar  to  the  OTie  suggested  in  Wong-opasi 
1987a.  We  must  either  claim  that  these  items  are  formed  at  the 
syntactic  level  or  they  reenter  the  lexicon  with  the  article 
assigned  to  the  base  representations.  "Hie  former  positicxi  is 
probably  assumed  by  Harris  for  his  distinction:  stem-  vs.  word-level 
affixation,  while  lexical ization  in  the  latter  view  is  allowed  in 
Wong-opasi  1987a. 
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SYLLABLE  PATTERNS  IN  LEVANTINE  ARABIC* 

Issam  M.  Abu-Salim  and  Hassan  R.  Abd-el-Jawad 

A  quantitative  analysis  of  the  actual  word  syllable-patterns  that 
occur  phonetically  in  Levantine  Arabic  is  given  in  this  article.  The 
analysis  is  based  on  an  examination  of  the  syllable  structure  of  all  the 
words  found  in  Stowasser  and  Ani  1964.  The  frequency  and 
distribution  of  the  five  phonetic  syllable  types  in  all  the  syllable 
patterns  are  calculated  and  discussed.  It  is  shown  that  the  frequency 
and  distribution  of  syllables  are  determined  by  their  weight  and  the 
degree  to  which  they  are  altered  by  phonological  rules.  It  is  also 
shown  that  the  application  of  phonological  rules  may  be  influence  in 
some  cases  by  the  availability  or  absence  of  certain  syllable  patterns. 

Introduction 

This  study  aims  at  describing  and  analyzing  the  syllable  structure  of 
words  in  Levantine  Arabic  (henceforth,  LA)  in  terms  of  the  syllable  patterns  they 
have.  The  term  'syllable  pattern'  is  used  here  to  refer  to  the  number  and  type  of 
syllables  words  consist  of.  Words  like  katab  'he  wrote',  ?akal  "he  ate',  and 
samak  'fish',  for  instance,  have  the  bisyllabic  pattern  CV.CVC,  whereas  words 
like  katab-u  'they  wrote',  ?akal-o  'he  ate  it  (m.)',  and  samak-e  'a  fish'  have  the 
trisyllabic  pattern  CV.CV.CV.  All  the  phonetic  syllable  patterns  found  in  LA 
along  with  examples  illustrating  them  are  given  in  the  appendix  at  the  end  of 
this  paper. ■> 

The  data  used  for  this  research  was  drawn  primarily  from  Stowasser  and 
Ani  1964.2  All  the  words  in  this  dictionary  were  examined  in  order  to  find  out 
all  the  possible  combinations  of  syllables  that  LA  words  may  consist  of.  In 
addition,  we  had  to  consider  inflected  and  derived  forms  of  lexical  items  so  as 
to  exhaust  all  the  possible  syllable  patterns.  These  included  the  following: 

a)  The  regular  dual  and  plural  forms  of  nouns  and  adjectives,  which  are 
formed  by  the  addition  of  the  dual  and  plural  suffixes,  respectively,  to  the 
singular  forms,  as  in  maktab-een  'two  offices',  ta9ban-iin  'tired  (m.pl.)',  and 
za9/an-aarangry  (f.pl.)'. 

b)  The  possessive  forms  of  singular,  dual,  and  plural  nouns,  which  are 
formed  by  the  addition  of  pronominal  suffixes  to  the  nominal  stems,  as  in 
Tawilt-o  'his  table*,  Taawlit-ha  'her  table',  and  Taawl-aat-hum  'their  tables'. 


c)  The  definite  forms  of  nouns  and  adjectives,  which  are  formed  by  the 
addition  of  the  definite  article  /I-/  to  the  nondefinite  form,  as  in  ?il-walad  'the 
boy'  and  ?il-9aali  'the  high  (m.sg.)'.^ 

d)  The  conjugated  forms  of  perfect  and  imperfect  verbs  with  all  the 
possible  subject  and  object  pronominal  affixes,  such  as  sa?l-at  'she  asked', 
sa?al-u  'they  asked',  sa?al-ha  'he  asked  her',  sa?al-uu-hum  'they  asked  them', 
sa?l-at-o  'she  asked  him',  bi-s?al'he  asks',  and  bi-s?al-u  'they  ask".  Verbs  with 
the  negative  suffix  /-$/  were  also  considered,  such  as  bi-s?al-i$^  'he  doesn't 
ask',  and  bi-s?al-uu-$'ihey  don't  ask'. 

The  phonetic  syllable  types  that  occur  in  LA  are  CV,  CVV,  CVC,  CVVC, 
and  CVCC  (Abu-Salim  1982,  Cowell  1964,  Haddad  1984.  Kenstowicz  1985, 
1986).  These  syllable  types  are  illustrated  by  the  following  examples: 

(i: 


CV 

katab 

'he  wrote' 

CVV 

Taalab 

'he  demanded' 

CVC 

mal9ab 

'playground' 

CVVC 

muftaaH 

'key' 

CVCC 

?akaltha 

•1  ate  it  (f.)' 

Heavier  syllables,  such  as  CVVCC,  do  not  occur  in  LA  at  the  phonetic 
level.  The  CVVCC  syllable  may,  however,  arise  at  the  underlying  level,  but  it  is 
subsequently  reduced  to  CVCC  by  a  rule  of  nucleus  reduction  that  applies  as 
part  of  the  syllabification  mechanism  to  ensure  that  the  output  is  properly 
syllabified,  given  the  five  phonetic  syllable  types  above.  A  word  like  $afilha  {< 
/$aaf-l-haa/)  'he  saw  for  her'  is  accordingly  derived  as  follows: 

(2)     a.      /$  a  a  f-l-h  a  a/ 


b.       $  a  a  f  I  h  a  a  syllabification 


Aa 


c.       $  a  f  I  h  a  a  nucleus  reduction 


k 


d.       $afilha  other  rules 


3 

In  (2a)  the  dative  suffix  /-I/  can  on'y  be  syllabified  in  the  coda  of  the  first 
syllable  due  to  a  constraint  limiting  the  number  of  consonants  in  the  onset  to 
just  one.  This  syllabification  method  results  in  a  CVVCC  syllable  which  is  not 
allowed  phonetically  as  mentioned  earlier.  The  nucleus  reduction  rule  sub- 
sequently applies  to  the  outcome  of  (2b),  changing  the  CVVCC  syllable  into  the 
acceptable  CVCC  syllable.  (For  more  details  on  this  analysis  see  Abu-Salim 
1982,  and  for  an  alternative  analysis  see  Kenstowicz  1986.) 

The  examination  of  all  the  words  considered  for  this  study  showed  that 
words  in  LA  may  consist  of  1-7  syllables.  Theoretically,  the  number  of 
combinations  of  the  five  syllable  types  into  words  1-7  syllables  long  is  5  , 
which  is  equal  to  78,125  different  syllable  patterns.  But  the  actual  phonetic 
syllable  patterns  found  in  LA  are  just  440,  which  is  only  0.56%  of  the 
theoretically  possible  patterns.  This  low  percentage  of  the  actually  occurring 
phonetic  syllable  patterns  to  the  theoretically  possible  patterns  is  due  to  a 
number  of  factors,  one  of  which  is  the  application  of  phonological  rules.  We 
will  see  that  rules  such  as  vowel  shortening  and  syncope  result  in  reducing 
words  and,  consequently,  decreasing  the  number  of  syllable  patterns  in  LA. 
The  diachronic  loss  of  inflectional  endings  and  the  neutralization  of  some 
affixes  and  function  words  may  also  be  cited  as  possible  explanations  for  the 
small  number  of  syllable  patterns  currently  found  in  LA.  It  is  beyond  the  scope 
of  this  study,  however,  to  discuss  the  diachronic  factors  in  detail. 

The  remainder  of  this  paper  is  organized  as  follows:  First,  the  frequency 
of  the  five  syllable  types  in  the  440  actual  syllable  patterns  is  given.  This  is 
followed  by  a  section  on  the  frequency  of  the  seven  syllable-pattern  categories. 
The  distribution  and  frequency  of  every  syllable  type  in  each  syllable-pattern 
category  and  each  position  of  the  word  are  then  given  and  briefly  discussed. 

Syllable  frequency 

The  total  number  of  syllables  found  in  the  440  patterns  is  1910  syllables  of 
the  five  types  given  above.  The  frequency  of  each  syllable  type  was  determined 
by  calculating  the  number  of  its  occurrences  in  every  category  of  word-size 
patterns  (monosyllabic,  bisyllabic,  etc.).  In  the  five  monosyllabic  word  patterns, 
each  syllable  occurs  only  once,  so  the  five  syllable  types  may  be  said  to  have 
the  same  frequency  in  this  category  of  word  patterns.  In  longer  syllabic 
patterns,  the  five  syllable  types  differ  in  their  frequency,  as  shown  in  Table  1 
(next  page),  where  M,  B,  T,  Q.  P,  H.  and  S  are  used  to  stand  for  mono-,  bi-,  tri-, 
quadri-,  penta-,  hexa-,  and  septasyllabic  word  patterns,  respectively. 

The  table  shows  that  the  CV  syllable  is  by  far  the  most  frequent  (701 
occurrences)  and  that  the  CVCC  syllable,  as  expected,  the  least  frequent  (130 
occurrences).  The  second  frequent  syllable  is  the  CVC  syllable,  followed  by  the 
CVVC,  and  then  the  CVV  syllable.  That  the  CV  syllable  is  the  most  frequent  and 
the  CVCC  syllable  the  least  frequent  does  not  require  much  of  an  explanation. 
The  CV  syllable  is  crosslinguistically  the  most  natural,  or  least  marked,  whereas 


the  CVCC  syllable  is  much  less  natural,  or  more  marked  (Sloat  et  al. 
Cairns  and  Feinstein  1982).5 


1978, 


M 

B 

T  Q 

P   H 

S  Total 

cv 

1 

10 

68  174 

272  139 

37  701 

cvv 
cvc 

1 
1 

7 
10 

40  50 
61  153 

55  22 
237  145 

9  184 
45  652 

cvvc 

1 

9 

43  81 

81   23 

5  243 

CVCC 

1 

8 

31  46 

35   7 

2  130 

Total 

5 

44 

243  504 

680  336 

981910 

Table  1:     Syllable  frequency 

But  the  lower  frequency  of  the  CVV  syllable,  when  compared  to  that  of  the 
CVVC  syllable,  requires  an  explanation.  One  reason  behind  the  relatively  low 
frequency  of  the  CVV  syllable  is  that  it  systematically  undergoes  a  vowel- 
shortening  rule,  whereby  its  long  vowel  is  reduced  to  short,  thus  changing  a 
CVV  syllable  into  CV  and,  consequently,  increasing  the  frequency  of  the  CV 
syllable  relative  to  that  of  the  CVV  syllable,  in  particular,  and  the  other  syllables, 
in  general.  In  Palestinian  Arabic,  for  instance,  CVV  syllables  are  generally 
shortened  when  they  occur  in  unstressed  weak  positions:  baab  'door',  babeen 
(<  /baa. been/)  'two  doors',  mabsuuT  'happy  (3  m.sg.),  mabsuTiin  (< 
/mab.suu.Tiin/)  'happy  (3  m.pl.)',  sarfi9a  'fast  (3  f.sg.)',  sahSaat  {<  /sa.rii.9aat/) 
'fast  (3  f.pl.)'.  The  CVV  syllable  also  undergoes  reduction  in  word-final  position: 
sa?al-u  'they  asked"  (cf.  sa?al-uu-ha  'they  asked  her'),  sa?al-ni  'he  asked  me' 
(cf.  ma-sa?al-nii-$  'he  didn't  ask  me').  (For  more  information  on  vowel 
shortening  in  Palestinian  Arabic  see  Abu-Salim  1986.)  Note  that  long  vowels 
shorten  only  in  weak  open  syllables;  in  checked  syllables,  they  remain  long. 

Table  1  also  shows  that  there  is  no  one-to-one  correspondence  between 
syllable  weight  and  frequency  of  occurrence.  Although  both  CVV  and  CVC 
syllables  are  considered  as  having  the  same  weight  (McCarthy  1979),  the  CVC 
syllable  is  systematically  more  frequent  in  all  syllable  patterns.  In  addition  to  the 
vowel-shortening  rule  which  lowers  the  frequency  of  the  CVV  syllables  as 
illustrated  above,  there  is  a  syncope  rule  in  LA  that  increases  the  frequency  of 
the  CVC  syllable.  This  rule  deletes  short  high  vowels  in  unstressed  open 
syllables  (Brame  1973,  1974,  Cowell  1964,  Haddad  1984,  Kenstowicz  and 
Abdul-Karim  1980,  among  others)  as  the  following  examples  show: 


(3)  a.  $irib  'he  drank' 
b.  $irb-at  'she  drank' 

(4)  a.  fihim  'he  understood' 

b.  fi'hm-ak  'he  understood  you  (m.sg.)' 

(3b)  and  (4b)  have  the  underlying  representations  /$irib-at/  and  /fihim-ak/, 
respectively,  and  both  have  the  underlying  syllable  pattern  CV.CV.CVC.  When 
syncope  applies  to  the  penultimate  vowel  in  both  examples,  the  onset  of  the 
second  syllable  is  resyllabified  with  the  preceding  syllable,  thus  resulting  in  the 
phonetic  syllable  pattern  CVC.CVC,  with  an  Initial  CVC  syllable  instead  of  the 
underlying  CV  syllable. 

The  same  discrepancy  holds  true  for  CVVC  and  CVCC,  which  are  also 
viewed  as  having  the  same  weight  (McCarthy  1979).  The  former  syllable  is 
systematically  more  frequent  than  the  latter  in  all  syllable  patterns.  This  is  partly 
due  to  the  application  of  the  syncope  rule  mentioned  above,  which  results  in 
creating  CVVC  syllables  from  underlying  CVV  plus  the  onset  of  the  following 
syncopated  CV  syllable,  as  the  following  examples  show: 


(5)     a  raa.ylH  'going  (3  m.sg.)' 

b.  raay.H-a  'going  (3  f.sg.)' 

c.  raay.H-fin  'going  (3  m.pl.)' 

d.  raay.H-aat  'going  (3  f.pl.)' 


The  underlying  form  of  the  stem  in  (5b,c,d)  is  /raayiH-/,  which  has  an  initial  CVV 
syllable  followed  by  a  CV  syllable  (H  is  syllabified  in  the  onset  of  the  last 
syllable.).  The  application  of  syncope  and  resyllabification  to  the  underlying 
forms  results  phonetically  in  initial  CVVC  syllables,  as  the  following  partial 
derivation  of  (5c)  shows: 

(6)  /raayiHiin/ 

raa.yi.Hiin  syllabification 

raa.yi.Hiin  stress 

raa.y.Hiin  syncope 

raay.Hfin  resyllabification 

Another  phonological  rule  that  lowers  the  frequency  of  CVCC  and, 
consequently,  increases  the  relative  frequency  of  CVVC  syllables  is  epenthesis. 
In  LA,  this  rule  inserts  a  high  front  vowel  between  two  unlike  consonants  when 
they  occur  in  word-final  position  or  followed  by  a  consonant-initial  suffix  (Abu- 
Salim  1980,  Brame  1973,  Haddad  1984,  Kenstowicz  and  Kisseberth  1979, 
among  others).  In  other  words,  the  heavy  coda  of  a  CVCC  syllable  is  split  up  by 
an  epenthetic  vowel  in  the  environment  described  above,  as  illustrated  by  the 
following  examples: 

(7)  a.      jfsir  'bridge' 

b.  jisir-na      'our  bridge' 

c.  jisr-een     'two  bridges' 


Epenthesis  and  resyllabification  in  (7a,b)  result  in  the  decomposition  of  an 
underlying  CVCC  syllable  into  phonetically  two  syllables  of  the  form  CV  and 
CVC.  This  has,  of  course,  the  consequence  of  increasing  the  frequency  of  both 
the  CV  and  CVC  syllables  and  decreasing  the  frequency  of  the  CVCC  syllable 
compared  to  that  of  the  CVVC  syllable. 

To  sum  up,  a  number  of  phonological  rules  conspire  to  bring  about  the 
discrepancy  observed  in  the  frequency  of  the  heavy  syllables  CVV  and  CVC,  on 
the  one  hand,  and  the  superheavy  syllables  CVVC  and  CVCC,  on  the  other. 

Syllable-pattern  frequency 

Table  2  below  gives  the  number  of  syllable  patterns  in  each  word-size 
category  and  its  percentage  relevant  to  the  total  number  of  all  syllable  patterns. 


Word  category 

No.  of  patterns 

Percentage 

Monosyllabic 

5 

1.13% 

Bisyllabic 

22 

5.00% 

Trisyllabic 

81 

18.41% 

Quadrisyllabic 

126 

28.64% 

Pentasyllable 

136 

30.91% 

Hexasyllabic 

56 

12.73% 

Septasyllabic 

14 

3.18% 

Table  2:     Syllable-pattern  frequency 

This  table  shows  that  the  pentasyllable  patterns  are  the  most  frequent  and 
the  monosyllabic  patterns  are  the  least  frequent.  The  order  of  the  other  patterns 
in  terms  of  frequency  is  as  in  the  following  hierarchy: 

Most  frequent  Least  frequent 

P       Q        T        H        B        8        M 

Table  2,  however,  does  not  imply  anything  about  the  frequency  of  words 
under  each  of  the  440  syllable  patterns.  It  simply  compares  the  number  of 
syllable  patterns  in  each  word-size  category  to  those  of  the  other  categories  and 
to  the  total  number  of  patterns  in  all  categories. 


Syllable  distribution 

In  this  section,  the  distribution  of  every  syllable  type  in  word-initial,  medial, 
and  final  is  examined  in  order  to  find  out  the  frequency  of  these  syllables  in  the 
three  word  positions  and  to  see  if  there  are  any  constraints  governing  their 
distribution  in  all  syllable  patterns. 

a)  The  CV  syllable 

Table  3  below  gives  a  summary  of  the  number  of  occurrences  of  the  CV 
syllable  in  word-initial,  medial,  and  final  position  in  all  syllable  patterns,  except, 
of  course,  the  monosyllabic  patterns.  The  table  shows  that  there  are  no 
constraints  on  the  distribution  of  the  CV  syllable  in  any  position  in  any  word 
patterns.  It  occurs  freely  in  initial,  medial,  and  final  position  in  all  word  sizes. 
The  CV  syllable  is,  of  course,  more  frequent  in  the  pentasyllable  word  patterns 
because,  as  we  have  seen  above,  the  pentasyllable  patterns  are  the  most 
frequent  of  all  the  syllabic  patterns.  It  is  also  more  frequent  in  medial  position 
due  to  the  fact  that  in  long  words  there  are  more  medial  than  initial  or  final 
syllables.  This  is  in  harmony  with  the  fact  that  this  syllable  type  is  the  most 
natural  and  least  marked  syllable  in  language. 


B 

T 

Q   P 

H 

S  Total 

1 

5 

24 

59  86 

42 

4  220 

M 

- 

20 

75  148 

84 

27  354 

F 

5 

24 

40  38 

13 

6  126 

Total 

10 

68 

174  272 

139 

37  700 

Table  3:    CV  distribution 

b)  The  CVV  syllable 

There  are  some  constraints  governing  the  distribution  of  the  CVV  syllable 
in  LA.  The  distribution  of  this  syllable  type  is  summarized  in  Table  4  (next 
page).  The  table  shows  that  this  syllable  is  less  frequent  in  final  position  than  in 
the  other  two  positions,  and  that  it  is  much  more  frequent  in  medial  position  than 
in  the  other  two  positions.  There  are,  moreover,  some  restrictions  on  its  occur- 
rence in  long  words.  It  does  not  occur  in  initial  position  in  hexa-  and  septa- 
syllabic  words,  and  in  final  position  it  is  limited  to  just  the  bi-  and  trisyllabic 
words.  Table  4  also  shows  that  this  syllable  is  more  frequent  in  pentasyllable 
patterns  than  in  the  other  patterns,  and  that  it  is  least  frequent  in  bi-  and 
septasyllabic  patterns. 


B 

T 

Q 

P 

H 

S  Total 

1 

4 

16 

11 

2 

- 

-      33 

M 

- 

14 

39 

53 

22 

9    137 

F 

3 

10 

- 

- 

- 

-      13 

Total 

7 

40 

50 

55 

22 

9    183 

Table  4:    CVV  distribution 

The  reason  behind  the  high  frequency  of  the  CVV  syllable  in  nnedial 
position  is  that  there  are  more  medial  than  initial  or  final  syllables  in 
multisyllabic  words.  The  low  frequency  of  this  syllable  in  final  position,  on  the 
other  hand,  is  due  to  the  vowel-shortening  rule  mentioned  earlier.  In  final 
position,  underlying  CVV  syllables  in  Arabic,  along  with  the  CVC  syllables, 
behave  as  light  for  purposes  of  stress  assignment,  and,  therefore,  they  do  not 
receive  stress  (Kenstowicz  1983,  McCarthy  1979,  1980).  The  absence  of  stress 
on  final  CVV  syllables  leads  to  their  reduction  to  CV  by  vowel  shortening  (Abu- 
Salim  1980),  as  illustrated  by  the  following  examples: 

(8)  a.      rama         (</ramaa/)        'he  threw' 
b.      katabu       {<  /katabuu/)     'they  wrote' 

The  only  situation  where  CVV  syllables  remain  phonetically  strong  in  final 
position  is  when  they  are  underlyingly  followed  by  a  segment  that  is  later 
deleted.  In  Brame  (1971)  and  McCarthy  (1979),  for  instance,  examples  like 
ramaa  "he  threw  it  (m.)'  and  ?axuu  'his  brother'  are  analyzed  as  having 
underlying  structures  with  final  /h/,  which  means  that  at  that  point  such 
examples  have  final  CVVC  syllables.  Stress  is  first  assigned  to  the  final 
superheavy  syllable,  followed  by  the  deletion  of  the  final  /h/.  This  results  in  a 
representation  with  a  final  stressed  CVV  syllable,  to  which  vowel  shortening 
cannot  apply,  as  illustrated  by  the  following  derivation  of  ramaa:  (For  more 
details  on  /h/  deletion  see  Brame  (1971)  and  McCarthy  1979.) 

(9)  /ramaah/ 

ramaah  stress 

ramaa  /h/  deletion 


vowel  shortening 


c)  The  CVC  syllable 

This  syllable  is  like  the  CV  syllable  in  its  distribution:  There  are  no 
constraints  whatsoever  on  its  occurrence  In  any  position  in  the  word  or  in  the 
syllabic  patterns,  as  the  following  table  shows: 


B 

T 

Q   P 

H 

S  Total 

1 

5 

22 

47  46 

12 

10  142 

M 

18 

71  146 

107 

31  373 

F 

5 

21 

35  45 

26 

4  136 

Total 

10 

61 

135  237 

145 

45  651 

Table  5:     CVC  distribution 

This  table  shows  that  the  CVC  syllable  occurs  freely  in  word-initial,  medial, 
and  final  position  regardless  of  the  size  of  the  syllabic  pattern.  It  also  shows  that 
the  CVC  syllable  is,  as  expected,  more  frequent  in  medial  than  in  other 
positions,  and  also  more  frequent  in  the  pentasyllabic  patterns  than  in  the 
others.  The  apparent  explanation  for  this  distribution  is  that  there  are  more 
medial  than  final  or  initial  word  positions,  especially  in  quadrisyllable  and 
longer  patterns.  The  CVC  syllable  is  also  expected  to  occur  more  in  the 
pentasyllabic  patterns  since  they  are  the  most  frequent  in  LA. 

A  comparison  between  the  two  heavy  syllables  CVV  and  CVC  shows  that 
the  former  is  by  far  more  limited  in  distribution.  We  have  seen  that  the  vowel- 
shortening  rule  is  largely  responsible  for  this  limitation. 

d)  The  CVVC  syllable 

More  constraints  are  found  on  the  distribution  of  superheavy  syllables. 
Table  6  below  summarizes  the  distribution  of  the  CVVC  syllable: 


p 

1      4 

T 
11 

Q 

4 

P 

H 

S  Total 
-   19 

M 

15 

44 

49 

11 

1  120 

F     5 

17 

33 

32 

12 

4  103 

Total  9 

43 

81 

81 

23 

5  242 

Table  6:     CVVC  distribution 

This  table  shows  that  this  syllable  is  less  frequent  in  word-initial  position 
than  in  medial  or  final  position.  It  does  not  occur  at  all  in  initial  position  in  words 


10 

5-7  syllables  long.    It  is  also  less  frequent  in  the  long  hexa-  and  septasyllabic 
patterns  than  in  the  shorter  patterns. 

e)  The  CVCC  distribution 

There  are  also  some  constraints  governing  the  distribution  of  this 
extraheavy  syllable,  as  shown  in  the  following  table: 


B 

T 

Q 

P 

H 

S  Total 

1 

4 

8 

5 

2 

2 

-      21 

M 

- 

14 

23 

12 

- 

2      51 

F 

4 

9 

18 

21 

5 

-      57 

Total 

8 

31 

46 

35 

7 

2    129 

Table  7:     CVCC  distribution 

This  syllable  is  not  as  frequent  in  initial  position  as  in  the  other  positions.  It 
does  not  occur  initially  in  the  septasyllabic  pattern,  and  its  frequency  in  initial 
position  in  the  penta-  and  hexasyllabic  patterns  is  very  low.  Unlike  the  other 
syllables,  this  syllable  is  most  frequent  in  final  position.  Moreover,  this  syllable 
is  more  frequent  in  patterns  3-5  syllables  long  than  in  the  other  patterns. 

One  reason  behind  the  low  frequency  of  this  syllable  is  the  epenthesis  rule 
mentioned  earlier,  which  inserts  a  high  front  vowel  between  the  two  consonants 
in  the  coda  of  this  syllable  when  they  are  nongeminate,  thereby  decomposing 
this  superheavy  syllable  into  light  CV  and  heavy  CVC  syllables  in  examples  like 
jisir  (<  /jisr/)  'bridge'.  This  rule  of  epenthesis  not  only  lowers  the  relative 
frequency  of  the  CVCC  syllable,  but  also  raises  that  of  the  other  lighter  syllables 
such  as  CVC. 

Conclusion 

We  have  tried  in  this  paper  to  give  a  statistical  analysis  of  the  phonetic 
syllable  patterns  that  occur  in  LA.  This  analysis  has  shown,  among  other  things, 
that  the  syllable  frequency  and  distribution  are  partly  determined  by  syllable 
weight:  the  lighter  the  syllable  the  most  frequent  it  is  and  the  fewer  the 
constraints  on  its  distribution.  This  generalization  works  well  for  the  light  CV 
and  the  superheavy  CVCC  syllables.  We  have  seen  that  the  CV  syllable  is  the 
most  frequent  and  that  there  are  no  constraints  whatsoever  on  its  distribution  in 
the  word.  The  CVCC  syllable,  on  the  other  hand,  is  the  least  frequent  and  has 
more  limitations  on  its  distribution. 


Another  factor  that  plays  a  major  role  in  determining  the  frequency  and 
distribution  of  syllables  is  the  application  of  phonological  rules.  We  have  seen 
that  the  vowel-shortening  rule  has  lowered  the  frequency  of  the  CVV  syllable  to 
the  extent  that  it  has  become  lower  than  that  of  its  heavy  counterpart  CVC  and 
even  lower  than  that  of  the  heavier  syllable  CVVC.  We  have  also  seen  that  the 
syncope  rule  has  lowered  the  frequency  of  the  CV  syllable  and  simultaneously 
raised  that  of  the  CVC  syllable,  thus  narrowing  the  difference  in  the  frequency  of 
the  two  syllables.  The  same  holds  true  for  the  CVCC  syllable  whose  frequency 
is  reduced  and  whose  distribution  limited  by  the  epenthesis  rule. 

The  application  of  phonological  rules  may,  on  the  other  hand,  be 
influenced  by  the  phonetic  syllable  patterns  found  in  the  language.  The  applic- 
ability or  inapplicability  of  certain  phonological  rules  may  be  triggered  by  the 
presence  or  absence  of  certain  syllable  patterns.  The  vowel-shortening  rule 
mentioned  earlier  shortens  the  long  vowel  of  a  CVV  syllable  when  it  occurs  in  a 
weak  metrical  position  (Abu-Salim  1986,  Kenstowicz  1983).  This  rule  applies  to 
examples  such  as  SaHibto  (<  /SaaHibato/)  'his  friend  (f.)',  where  the  underlying 
initial  syllable  of  the  form  CVV  is  reduced  to  CV,  but  fails  to  apply  to  examples 
such  as  kaatabato  {<  /kaatabato/)  'she  corresponded  with  him',  where  the  initial 
CVV  syllable  escapes  shortening  although  it  occurs  in  a  weak  metrical  position. 
It  is  important  to  note  that  both  types  of  examples  have  the  same  underlying 
syllable  pattern  and  the  same  surface  stress  pattern.  In  spite  of  this,  the  rule  has 
applied  in  the  first  case  but  not  in  the  second.  Examples  such  as  kaatabato 
have  been  considered  exceptions  to  the  vowel-shortening  rule  (Abu-Salim 
1986).  This  exceptional  behavior  has  been  explained  as  follows:  If  vowel 
shortening  applied  to  kaatabato,  it  would  yield  the  structure  *katabato  with  the 
syllable  pattern  CV.CV.CV.CV.  which  does  not  phonetically  exist  in  LA.  The 
vowel-shortening  rule  has,  therefore,  been  blocked  so  as  not  to  yield  a  syllable 
pattern  that  does  not  phonetically  occur  in  the  language. 


NOTES 

*An  earlier  version  of  this  paper  was  presented  at  the  lirst  annual 
symposium  on  Arabic  Linguistics,  University  of  Utah,  Salt  Lake  City,  April  1987. 
We  would  like  to  thank  Hans  Henrich  Hock  and  Charles  Kisseberth  for  their 
valuable  comments  and  suggestions.  The  following  special  symbols  have  been 
used  in  the  transcription  of  Arabic  words:  /3/  =  voiceless  interdental  fricative.  /$/ 
=  voiceless  alveopalatal  fricative,  IZI  =  voiced  alveopalatal  fricative,  /8/  =  voiced 
velar  fricative,  /H/  =  voiceless  pharyngeal  fricative,  /9/  =  voiced  pharyngeal 
fricative,  /T/  =  emphatic  voiceless  alveolar  stop,  /D/  =  emphatic  voiced  alveolar 
stop,  and  /S/  =  emphatic  voiceless  alveolar  fricative. 

i|t  is  to  be  noted  here  that  not  all  the  examples  given  in  the  appendix  are 
found  in  one  particular  variety  of  LA. 
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2The  term  'Syrian  Arabic'  in  the  title  of  this  dictionary  is  used  to  refer  to  the 
'dialects  and  subdialects  spoken  by  the  sedentary  population  of  Greater  Syria', 
which  comprises  present-day  Jordan,  Lebanon,  Palestine,  and  Syria  (Stowas- 
ser  and  Ani  1964:i).  We  have  decided  to  use  the  term  "Levantine  Arabic"  in  the 
title  of  this  paper  so  as  not  to  limit  its  scope  to  just  the  dialect(s)  spoken  in 
present-day  Syria. 

3The  vowel  l\l  preceding  the  definite  article  l\l  is  inserted  in  some  colloquial 
Arabic  dialects  to  prevent  generating  syllables  with  two  consonants  in  the  onset. 
Since  vowel-initial  syllables  are  not  allowed  in  Arabic,  the  glottal  stop  /?/  is  then 
inserted  before  the  epenthetic  vowel.  Only  in  phrase-initial  position,  however, 
are  two-consonant  onsets  phonetically  possible  in  LA  (Abu-Salim  1980,  Cowell 
1964,  Haddad  1984,  Kenstowicz  1985).  Due  to  this  limitation  on  their  dis- 
tribution, we  decided  to  exclude  all  words  beginning  with  two  consonants. 

'^The  vowel  l\l  preceding  the  negative  suffix  /$/  is  also  epenthetic. 

SThere  have  been  some  proposals  for  excluding  the  CVCC  syllable  from 
being  part  of  the  underlying  syllable  inventory  of  Arabic.  McCarthy  1979,  for 
instance,  considers  this  syllable,  as  well  as  the  CVVC  syllable,  as  extraheavy, 
or  superheavy,  and,  based  on  the  constraints  on  its  distribution  in  Classical 
Arabic  and  in  some  colloquial  varieties  which  limit  its  occurrence  to  prepausal 
position,  he  proposes  the  derivation  of  this  syllable  from  underlying  CVC  plus  a 
following  stranded  unsyllabified  consonant.  This  consonant  is  later  adjoined  to 
the  preceding  heavy  syllable  by  a  rule  to  give  the  following  representation  for 
the  phonetic  CVCC  syllable:  ^ 


CVCC 

Selkirk  1981  also  argues  for  excluding  the  CVCC  syllable  from  the  underlying 
syllable  inventory  of  some  Arabic  dialects.  She  offers,  however,  a  different 
analysis,  where  the  final  consonant  is  treated  at  the  underlying  level  as  a 
syllable  by  itself,  but  with  an  empty  nucleus,  as  shown  below: 


Ax  A     hh 

CVC-C      orCVCC- 

The  empty  nucleus  is  later  filled  by  an  epenthetic  vowel  l\l  in  some  colloquial 
dialects  of  Arabic.  (For  more  details  on  these  proposals  see  McCarthy  1979 
and  Selkirk  1981.) 
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APPENDIX 

The  actual  syllable  patterns  in  LA 
1  =  CV.    2  =  CW.   3  =  CVC,   4  =  CWC,   5  =  CVCC 
Monosyllabic  patterns 

PATTERN    EXAMPLE  GLOSS 


1 

wa/?u 

•and' 

2 

$uu 

■what" 

3 

min 

■from' 

4 

haab 

'door' 

5 

bint 

'daughter,  girl' 

Bisyllabic  patterns 

1.1 

ma$a 

'he  walked,  went' 

1.2 

ramaa 

'he  threw  it  (m.)' 

1.3 

?asad 

•lion' 

1.4 

ma9aa$ 

'salary' 

1.5 

mamarr 

'aisle' 

2.1 

raaHu 

'they  went' 

2.3 

Zaamig 

'mosque' 

2.4 

baaloon 

•balloon' 

25 

saabaTt 

'1  raced' 

3.1 

?bra 

'needle,  pin' 

3.2 

$uftuu 

'you  (pi.)  saw  him" 

3.3 

maktab 

•office^ 

3.4 

fusTaan 

■dress' 

3.5 

SaTranZ 

'chess' 

4.1 

kaasto 

•his  glass' 

42 

taabloo 

'dashboard' 

4.3 

Taabtak 

■your  (m.sg.)  (foot)bair 

4.4 

SaaTriin 

'clever  (m.pl.)' 

5.1 

bi9tha 

•1  sold  it  (f.)' 

5.3 

?ultlak 

'1  toW  you  (m.sg.)' 

5.4 

?ingliiz 

'Englishmen^ 

5.5 

mist9idd 

•prepared  (m.sg.)' 

Trisyllabic  patterns 

1.1.1 

katabu 

1hey  wrote' 

1.1.2 

sa?aluu 

'they  asked  him' 

1.1.3 

masakak 

'he  caught  you  (m.sg.)' 

1.1.4 

Talabaat 

•orders,  application  forms' 

1.1.5 

makatapS 

•he  didn't  write' 

1.2.1 

?afii(iha 

'her  father' 

1.2.2 

nxibaaraa 

'match,  game' 

1.2.3 

bawaaxir 

•steamers' 

1.2.4 

wilaayaat 

'states' 

1.3.1 

maHaTTa 

•station' 

1.3.2 

masaktuu 

•you  (pi.)  caught  him' 

1.3.3 

sa?a«^ 

•|  asked  you  (m.sg.)' 

1.3.4 

'banister' 
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1.3.5 

tamanTa9$ 

'eighteen- 

1.4.1 

Habiibha 

•her  lover* 

1.4.2 

biraasluu 

they  correspond  with  him" 

1.4.3 

makaatbak 

'your  (m.sg.)  offices' 

1.4.4 

mubiilyaat 

'furniture' 

1.4.5 

bi$aawri$$ 

'he  doesn't  consult  him' 

1.5.1 

?akalha 

'1  ate  it  (f.)' 

1.5.2 

bikassmu 

"they  smash  rt  (m.)' 

1.5.3 

Tabaxthum 

'!  cooked  them' 

1.5.4 

?iliktroon 

'electron' 

1.5.5 

bakattbi$$ 

'1  don1  make  him  write" 

2.1.1 

booyaZi 

'bootblack' 

2.1.2 

raasakju 

'they  correpond  with  him" 

2.1.3 

?aabali< 

'he  met  you  (fsg.)' 

2.1.4 

saaba?uuk 

'they  raced  you  (m.sg.)' 

2.1.5 

baaromitr 

'barometer' 

2.2.1 

?aaraami 

'Aramaic' 

2.2.3 

beenaaton 

'between  them' 

2.2.4 

baaloonaat 

'balloons' 

2.3.1 

?aaballo 

'1  met  him' 

2.3.2 

Taawa9tii 

"you  (f.sg.)  obeyed  him' 

2.3.3 

jaawabtak 

•I  answered  you  (m.sg.)' 

2.3.4 

Siikiyyiin 

'Czechs  (m.pl.)' 

2.4.1 

?aa$uurya 

"Assyria" 

2.4.3 

baaluunkum 

"your  (m.pl.)  balloon' 

2.5.1 

naawaltha 

"1  handed  (sth.)  over  to  her" 

2.5.3 

9aatabthum 

'1  blamed  them' 

3.1.1 

maklabe 

"library" 

3.1.2 

bax$a$uu 

■they  bribed  him' 

3.1.3 

muTtamar 

"conference" 

3.1.4 

maTrabaan 

"jaf 

3.1.5 

?istafatt 

"1  benefited" 

3.2.1 

fallaaHa 

"peasant  woman" 

3.2.3 

ba?duunis 

"parsley" 

3.2.4 

?i9laanaat 

"ads" 

3.3.1 

musta$fa 

"hospital" 

3.3.2 

farraZtuu 

'you  showed  him  around" 

3.3.3 

musta?Zr 

'tenant  (m.)' 

3.3.4 

?isti9maar 

'colonization' 

3.3.5 

?isla?balt 

'1  welcomed' 

3.4.1 

?almaanya 

"Germany" 

3.4.3 

Zibnaalhum 

•we  brought  (to)  them" 

3.4.4 

dazziinteen 

"two  dozens' 

3.4.5 

bitkaatbi$$ 

'she  doesn't  write  to  him' 

3.5.1 

wa??aftha 

'1  stopped  her* 

3.5.2 

bit9allmuu 

'you  (pi.)  teach  him" 

3.5.3 

?allabthum 

"1  turned  them  over* 

3.5.4 

mitHayyriin 

"bewildered  (m.pl.)* 

3.5.5 

bitmarr?i$$ 

"she  doesnl  let  him  pass* 

4.1.1 

?aasyawi 

"Asian,  Asiatic  (m.sg.)* 

4.1.3 

xaaSSatan 

•particulariy" 

4.1.4 

baanyohaat 

'bathtubs' 

4.2.1 

SaaHbaaii 

'my  friends  (f .)' 

4.2.3 

$aayfiinak 

"we  see  you  (m.sg.)" 
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4.3.1 

?aabliyye 

'ability- 

4.3.4 

Zaam9iyyaat 

■university  graduates  (f.)* 

4.4.1 

SaaHbaatna 

•our  friends  (f.)* 

4.4.3 

Zaam9aatkum 

'your  (pi.)  universities' 

4.5.1 

?aabliyyto 

'his  ability 

4.5.3 

?aabliyytak 

'your  (m.sg.)  ability" 

5.1.1 

wibraka 

'crankshaft' 

5.1.4 

haydroZiin 

'hydrogen' 

5.2.1 

Hammluuli 

'load  (sth.)  for  me' 

5.2.3 

wass9uuhum 

'widen  them' 

5.3.1 

mistgidde 

'prepared  (f.sg.)' 

5.3.4 

mistwiyyaat 

•ripe  (f.pl.)' 

5.4.1 

?ustraalya 

'Australia' 

5.4.3 

wazznuulhum 

'weigh  for  themi' 

Quadrisyllable  patterns 

1.1.1.3 

mmatizim 

'rheumatism' 

1.1.1.4 

vitaminaat 

'vitamins' 

1.1.1.5 

maraHuli$$ 

•they  didn't  go  to  him' 

1.1.2.1 

Tarabeeza 

'table' 

1.1.2.3 

biZamaatak 

'your  (m.sg.)  pajamas' 

1.1.2.4 

?abaZooraat 

•blinds' 

1.1.3.1 

Hanafiyye 

'faucet' 

1.1.3.3 

?alaminyom 

'aluminum' 

1.1.3.4 

telivizyoon 

'television' 

1.1.3.5 

makatapti$$ 

'1  didn't  write  it  (m.)' 

1.1.4.1 

banTaanya 

•Britain' 

1.1.4.3 

bawariidhum 

•their  guns^ 

1.1.4.4 

marahuulhaaS 

•they  didn^t  go  to  her* 

1.1.5.1 

naDariyyto 

•his  theory^ 

1.1.5.3 

dakatirthum 

•their  doctors^ 

1.1.5.4 

maDarapthaaS 

•1  didn^t  hit  her" 

1.1.5.5 

makatap{hum$ 

•1  didnl  write  them' 

1.2.1.1 

7a7aa?iri 

•Algerian^ 

1.2.1.3 

bi$aawiri$ 

•he  doesn't  consulf 

1.2.1.4 

muraasalaat 

'correspondence^ 

1.2.1.5 

maraasali$$ 

•he  didn't  write  to  him' 

1.2.3.1 

bi$aawir1ia 

'he  consults  her* 

1.2.3.3 

bi?aabilhum 

'he  meets  them' 

1.2.3.4 

bi$aawirtiaa$ 

'he  doesn't  consult  her' 

1.2.3.5 

bikaatibhum$ 

'he  doesn't  write  to  them' 

1.2.5.4 

maHaarapthaaS 

'1  didn't  fight  her 

1.2.5.5 

ma?aabalthum$ 

'1  didn't  meet  them' 

1.3.1.1 

mufakkara 

'agenda' 

1.3.1.3 

mawiSlati$ 

'she  hasn't  arrived' 

1.3.1.4 

darabzinaat 

'banisters' 

1.3.1.5 

maDatbati$$ 

'she  didn't  hit  him' 

1.3.2.1 

dimuqraaTi 

'democrat  (m.)' 

1.3.2.3 

bukutbuulak 

'they  write  to  you  (m.sg.)' 

1.3.3.1 

dimaSqiyye 

'Damascene  (f.sg.)' 

1.3.3.3 

sa?altilhum 

'1  asked  for  them' 

1.3.3.4 

holandiyyiin 

'Dutchmen' 

1.3.3.5 

makatbatlakS 

'she  didn^t  write  to  you  (m.sg.)' 

1.3.4.1 

dizinTaarya 

'dysentery' 

1.3.4.3 

mabikaatbiS 

'he  doesn't  write  letters' 

1.3.4.4 

busub8uulhaa$ 

'they  don't  dye  to/for  her* 

1.3.4.5 

mabit?aabli$$ 

'she  doesn't  meet  him" 

1.3.5.4 

ma9allamthaa$ 

'1  didn't  teach  her" 

1.3.5.5 

makallamthum$ 

'1  didn't  talk  to  them' 

1.4.1.3 

bi$aawruki$ 

'they  don't  consult  you  (m.sg.)' 

1.4.1.4 

bi9aatbuhaa$ 

'they  don't  blame  her" 

1.4.1.5 

bikaatbuhum$ 

Ihey  don't  write  to  them' 

1.4.2.1 

?araaybiino 

•his  relatives' 

1.4.2.3 

?araaybiinak 

'your  (f.sg.)  kin/relatives' 

1.4.3.1 

?uroobbiyye 

'European  (f,)' 

1.4.3.4 

?uroobbiy7aat 

•Europeans  (f.)' 

1.4.4.1 

?araaybiinha 

'her  kin/relatives' 

1.4.4.3 

?araaybiinhum 

'their  kin/relatives' 

1.4.4.4 

biHaartuulhaaS 

'they  don't  fight  for  her" 

1.4.4.5 

biHaarbuulkum$ 

'they  don't  fight  for  you  (pi.)' 

1.5.1.4 

?akoordyonaat 

'accordions' 

1.5.2.1 

?iliktrooni 

'electronic' 

1.5.2.3 

bikattbuuhum 

'they  make  them  write' 

1.5.3.1 

bi9allmi$$i 

'he  doesn't  teach  him' 

1.5.3.3 

makalthumrni$ 

•1  didn't  eat  them' 

2.1.1.1 

?aaxirihi 

'in"?ila  ?aaxirihi  'etc.'"' 

2.1.1.3 

saaba?atak 

'she  raced  you  (m.sg.)' 

2.1.2.1 

raafa?uuni 

'they  accompanied  me' 

2.1.2.3 

Haarabuukum 

'they  fought  you  (pi.)' 

2.1.3.1 

?aadamiyye 

'human  being  (f.)' 

2.1.3.3 

raafa?attxim 

"she  accompanied  them' 

2.1.3.4 

keelomitraat 

'kilometers' 

2.1.4.1 

Saawaruulha 

"they  consulted  for  her' 

2.1.4.3 

Taalabuulkum 

"they  demanded  for  you  (m.pl.)' 

2.2.3.1 

miizaaniyye 

•budget' 

2.2.3.4 

kiimaawiyyiin 

'chemists' 

3.1.1.1 

muttaHida 

'united' 

3.1.1.3 

kattabatak 

'she  made  you  (m.sg.)  write' 

3.1.1.4 

maTrabanaat 

•jars' 

3.1.2.1 

?istimaara 

•application  form' 

3.1.2.3 

?istafaadat 

'she  benefited' 

3.1.2.4 

?istimaaraat 

•application  forms^ 

3.1.3.1 

madfa9iyye 

•artillery' 

3.1.3.3 

watfarathum 

'she  saved  them' 

3.1.3.4 

maSribiyyaat 

'Moroccans  (f.)' 

3.1.3.5 

bitjibilhumS 

'she  doesn't  bring  to  them' 

3.1.4.1 

?i9tiraafha 

'her  confession' 

3.1.4.3 

?i9tiraaDhum 

•their  objection' 

3.1.5.1 

?imkaniyyto 

•his  ability' 

3.1.5.3 

?imkaniyytak 

'your  (m.sg.)  ability' 

3.2.1.1 

limlaakame 

'the  boxing  fight' 

3.2.1.3 

bit$aawiri$ 

'she  doesn't  consult' 

3.2.3.1 

lubnaaniyye 

'Lebanese  (f.sg.) 

3.2.3.4 

lubnaaniyyiin 

'Lebanese  (m.pl.)' 

3.3.1.1 

mustaSmara 

'colony' 

3.3.1.3 

baTbuxlaki$ 

'1  don't  cook  for  you  (m.sg.)' 

3.3.1.4 

musta$fayaat 

'hospitals' 

3.3.1.5 

?il?ur3uduks 

Ihe  Orthodox' 

3.3.2.1 

?istibdaadi 

•autocratic' 

3.3.2.3 

Qallamtiihum 

'you  (f.sg.)  taught  them' 

3.3.2.4 

?isti3naa?aat 

'exceptions' 

3.3.3.1 

katirabjiyye 

'electricians' 

3.3.3.3 

wa??attillak 

'1  stopped  for  you  (m.sg.)' 

3.3.3.4 

?il?arZantiin 

'Argentina' 

3.3.3.5 

bitkattibhumS 

'she  doesnl  make  them  write' 

3.3.4.1 

wa??aftiilha 

'you  (f.sg.)  stood  up  for  her* 

3.3.4.3 

wa??aftiilhum 

'you  (f.sg.)  stood  up  lor  them' 

3.4.1.4 

bitraasluniiS 

'you  (pi.)  don't  write  to  me' 

3.4.1.5 

bitraasluhum$ 

'you  (pi.)  don't  write  to  them' 

3.4.2.1 

bit9aaynuuha 

'you  (pi.)  examine  it  (f.)' 

3.4.2.3 

bitwaa9duuhum 

'you  (pi.)  promise  them' 

3.4.4.1 

bitraasliiulha 

'you  (pi.)  correspond  for  her" 

3.4.4.3 

bitHaarbuulhum 

•you  (pi.)  fight  for  them' 

3.4.4.4 

bitraasluulhaa$. 

•you  (pi.)  don't  correspond  for  her* 

3.4.4.5 

bitHaarbuulhum$ 

•you  (pi.)  don't  fight  for  them' 

3.5.1.4 

matrakkbuhaa$ 

'don't  give  her  a  ride'                   ^ 

3.5.2.1 

?irTa?smaali 

•the  capitalistic^ 

3.5.2.3 

bit9allmuuhum 

•you  (pi.)  teach  them^ 

3.5.3.1 

binkattbi$$i 

'we  don^t  make  him  write' 

3.5.4.1 

bi1?arrimnuulha 

'you  (pi.)  trust  her- 

3.5.4.3 

bit?a$$ruulhum 

•you  (pi.)  waveforthem^ 

3.5.4.4 

mat9allmuulhaa$ 

•don^t  teach  for  her 

3.5.4.5 

mat(allmuulhum$ 

•don^t  teach  for  them' 

4.1.3.1 

?aasyawiyye 

•Asian  (f.)' 

4.1.3.4 

?aasyawiyyaal 

'Asians  (f.)' 

4.3.3.1 

?aabliyyitha 

'her  ability' 

4.3.3.3 

?aabliyyithum 

'their  ability' 

5.1.1.4 

wibrakayaat 

'crankshafts' 

5.1.3.1 

?art59iniyye 

'centipede' 

5.1.3.4 

?ingliziyyaat 

'English  women' 

5.2.3.1 

ra?smaaliyye 

'capitalism' 

5.2.3.4 

ra?smaaliyyiin 

'capitalists' 

Pentasyllable   patterns 

1.1.1.1.3 

maraHulakiS 

'they  didn't  go  to/for  you  (m.sg.)' 

1.1.1.1.4 

karikateraat 

'caricatures' 

1.1.1.1.5 

makatabuhum$ 

'they  didn't  write  them^ 

1.1.1.2.1 

?otomatiiki 

•automatic^ 

1.1.1.3.1 

bariTaniyye 

•British  (f.)^ 

1.1.1.3.4 

bariTaniyyiin 

•British  (m.pl.)' 

1.1.1.4.4 

makatabuulhaa$ 

•they  didn^t  write  to  hef 

1.1.1.4.5 

makatabuulhum$ 

'they  didn't  write  to  them' 

1.1.2.3.1 

fidiraaliyye 

•federal  (!.)• 

1.1.2.3.4 

fidiraaliyyiin 

•federal  (pi.)' 

1.1.2.3.5 

mabikaatibhumS 

'he  doesn't  write  to  them' 

1.1.3.1.3 

maTalablakiS 

'he  didn't  order  for  you  (m.sg.)' 

1.1.3.1.4 

tilivizyonaat 

'television  sets' 

1.1.3.1.5 

masa?alnali$$ 

'we  didn't  ask  for  him^ 

1.1.3.2.1 

baladiyyaati 

•coming  from  my  town^ 

1.1.3.2.3 

baladiyyaatak 

■coming  from  your  (m  sg.)  town' 
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1.1.3.3.1 

danimar1<iyye 

•Dane.  Oanish  (f.sg.)' 

1.1.3.3.3 

baladiyyitkum 

•your  (pi.)  municipalit/ 

1.1.3.3.4 

danimarkiyyiin 

"Danes,  Danish  (m.pl.)' 

1.1.3.3.5 

makatahtilli$$ 

*l  didn't  write  to/for  him' 

1.1.3.4.1 

baladiyyaatna 

'our  municipalities' 

1.1.3.4.3 

baladiyyaatkum 

'your  (pi.)  municipalities' 

1.1.3.4.4 

masa?alnaalhaa$ 

'we  didn't  ask  for  her^ 

1.1.3.4.5 

makatabnaalkum$ 

•we  didn^t  write  to/for  you  (pi.)' 

1.1.4.1.3 

mabiraaslaki$ 

'he  doesn't  correspond  with  you  (m.sg.) 

1.1.4.2.1 

Ziyofiizyaa?i 

'geophysical' 

1.1.4.3.3 

maraHuulhummiS 

'they  didn't  go  to  them' 

1.1.4.4.4 

mabiHaartxiulhaaS 

'they  don't  fight  for  her* 

1.1.4.4.5 

mabiHaarbuulkumS 

•they  don1  fight  for  you  (pl.)^ 

1.1.5.1.4 

mabikattbuhaa$ 

•they  don1  make  her  write^ 

1.1.5.1.5 

mabikattbuhum$ 

•they  don't  make  them  write' 

1.1.5.3.3 

maDarapthutiuiii$ 

'1  didn't  hit  them' 

1.2.1.1.3 

maraasalatiS 

'she  didn't  correspond' 

1.2.1.1.4 

maHaarabuhaa$ 

'they  didn't  fight  her" 

1.2.1.1.5 

makaatabuhum$ 

'they  didn't  write  to  them' 

1.2.1.2.1 

muraasilaato 

'his  correspondence  (f.)' 

1.2.1.3.1 

bi$aawirilha 

'he  waves  to  her- 

1.2.1.3.3 

biHaaribilhum 

'he  fights  for  them' 

1.2.1.3.4 

Zazaa?iriyyaat 

•Algerians  (f.)' 

1.2.1.3.5 

maHaarabatli$$ 

•she  didn't  fight  for  him' 

1.2.1.4.1 

nxiraasalaatha 

'her  correspondence' 

1.2.1.4.3 

muraasilaathum 

'their  correspondence  (f.)' 

1.2.3.2.1 

SalaaHiyyaato 

'his  authorities' 

1.2.3.2.3 

SalaaHiyyaatak 

•your  (m.sg.)  authorities^ 

1.2.3.3.3 

bi$aawirtiumnrii$ 

•he  doesn't  consult  them' 

1.2.3.4.1 

SalaaHiyyaatha 

'her  authorities' 

1.2.3.4.3 

SalaaHiyyaatkum 

'your  (pi.)  authorities' 

1.2.3.4.4 

makaatabnaalhaaS 

•we  didn't  write  to  her* 

1.2.3.4.5 

makaatabnaalkumS 

•we  didnl  write  to  you  (pi.)' 

1.2.5.3.3 

maHaarapthumrni$ 

'1  didn't  fight  them' 

1.3.1.1.3 

makattabatiS 

'she  didn't  dictate' 

1.3.1.1.4 

makassaratcx)$ 

•she  didnl  smash  it  (m.)' 

1.3.1.1.5 

ma9allamat$$ 

•she  didn't  teach  him" 

1.3.1.2.1 

$ikoslofaaki 

'Czechoslovakian' 

1.3.1.2.3 

mubarriraatak 

■your  (m.sg.)  justifications' 

1.3.1.2.4 

?arisToTaaliis 

•Aristotle' 

1.3.1.3.1 

bi?amminilha 

'he  tmsts  her* 

1.3.1.3.3 

budursuhurtuiii$ 

•they  don^t  study  them^ 

1.3.1.3.4 

katidra?iyyaat 

'Cathedrals' 

1.3.1.3.5 

madarrasathum$ 

'she  didn't  teach  them' 

1.3.1.4.1 

$cikoslofaakya 

'Czechoslovakia' 

1.3.1.4.3 

mubarriraathum 

1heir  justifications' 

1.3.2.1.1 

bilimwaafa?a 

'approvingly' 

1.3.3.1.3 

maTabxatlaki$ 

'she  didn't  cook  for  you  (m.sg.)' 

1.3.3.1.4 

ma9allamtuhaa$ 

'you  (pi.)  didn't  leach  her* 

1.3.3.1.5 

ma9allamnahum$ 

'we  didnl  teach  them' 

1.3.3.2.1 

?aristuqraaTi 

'Aristocrat(ic)' 

1.3.3.3.1 

madarrastt$$i 

'1  didn't  teach  him' 

1.3.3.3.3 

mabuktubhummiS 

'he  doesn't  write  them' 

1.3.3.3.4 

rTtawaffartilhaa$ 

'1  didn't  save  for  her* 
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1.3.3.3.5 

mabitkattibhum$ 

"she  doesnl  make  them  write' 

1.3.3.4.4 

mawa??aftiilhaa$ 

•you  (f.sg.)  didn't  stand  for  her* 

1.3.3.4.5 

mawa??afnaalhum$ 

•we  didn't  stand  for  ttiem^ 

1.3.4.1.4 

mabilraaslunii$ 

•you  (pi.)  don't  write  to  me' 

1.3.4.1.5 

mabitraasluli$$ 

'you  (pi.)  don't  correspond  for  him" 

1.3.4.3.3 

maZibnaalhummi$ 

'we  didn't  bring  to  them' 

1.3.4.4.4 

mabitraasluulhaa$ 

•you  (pi.)  don^t  correspond  tor  her 

1.3.4.4.5 

mabitraasluulhum$ 

•you  (pi.)  don^t  correspond  (for  them^ 

1.3.5.3.3 

ma9allamthummi$ 

•1  didn't  teach  them' 

1.4.1.1.3 

biHaartxjIakiS 

•they  don^t  fight  for  you  (m.sg.)' 

1.4.1.3.3 

bikaatbuhummi$ 

'they  don^t  write  to  them^ 

1.4.1.3.4 

?ameert<aniyyaat 

•Americans  (f.)^ 

1.4.4.3.3 

biHaarbuulkummi$ 

•they  don^t  fight  for  you  (pi.)' 

1.5.1.3.1 

?iliktroniyye 

•electronic  (f.)^ 

1.5.1.3.3 

bikattbuhummi$ 

'they  don't  make  them  write' 

1.5.1.3.4 

?iliktroniyyaat 

'electronics' 

2.1.1.3.1 

Haarabaliiha 

'she  fought  for  her* 

2.1.1.3.3 

Haarahatilhum 

•she  fought  for  them' 

3.1.1.1.4 

kartxiretoraat 

•carburetors^ 

3.1.1.2.1 

?imbaraToora 

•empress^ 

3.1.1.3.1 

?il9arabryye 

•the  Arab  (f.)' 

3.1.1.3.3 

bitHaribilkum 

•she  fights  for  you  (pi.)' 

3.1.1.3.4 

bittilivizyoon 

•on  television^ 

3.1.1.4.1 

baskalitaatha 

•her  bicycles' 

3.1.1.4.3 

baskalrtaalhum 

'their  bicycles' 

3.1.2.3.1 

dihliimaasiyye 

'diplomacy' 

3.1.2.3.3 

?ii1iZaaliyyan 

'on  the  spur  of  the  moment' 

3.1.2.3.4 

?iqtiSaadiyyiin 

•economists' 

3.1.2.5.4 

?ittifaa?iyyteen 

'two  agreements' 

3.1.3.1.1 

?ilbanafsa2i 

'the  violet  (m.)' 

3.1.3.1.3 

bitHariblakiS 

•she  doesn't  fight  for  you  (m.sg.)* 

3.1.3.2.1 

?imkaniyyaato 

'his  abilities' 

3.1.3.2.3 

?imkaniyyaatak 

'your  (m.sg.)  abilities" 

3.1.3.3.1 

?imkaniyyitha 

•her  ability 

3.1.3.3.3 

bit$awirhummi$ 

'she  doesm  consult  them* 

3.1.3.4.1 

?imkaniyyaatha 

'her  abilities' 

3.1.3.4.3 

?imkaniyyaathum 

'their  abilities' 

3.2.1.3.1 

binkaatibiiha 

•we  correspond  for  her" 

3.2.1.3.3 

binHaaribilhum 

'we  fight  for  them' 

3.2.1.3.4 

binTaalibilhaa$ 

'we  don't  demand  for  her* 

3.2.1.3.5 

bit$aawirilhum$ 

'she  doesn't  consult  tor  them' 

3.2.3.2.1 

binkaatibhaalo 

•we  correspond  her  for  him^ 

3.2.3.2.3 

binHaaribhaalak 

•we  fight  her  for  you  (m.sg.)' 

3.3.1.1.4 

?ilbaskalitaat 

'the  bicycles' 

3.3.1.2.1 

bitkart<ibuuha 

•you  (pi.)  mess  it  (f.)  up^ 

3.3.1.2.3 

bitkar1<ibuuhum 

•you  (pi.)  mess  them  up^ 

3.3.1.3.1 

bil?aktariyye 

•largely 

3.3.1.3.3 

binkatlibilhum 

'we  dictate  for  them' 

3.3.1.3.4 

?iZZumhuriyyaat 

•the  republics' 

3.3.1.3.5 

bin?amminilhum$ 

'we  don't  tnjst  them* 

3.3.1.4.1 

itHallatuuIha 

'they  threaten  her* 

3.3.1.4.3 

bitwajhanuulhum 

'they  flatter  them' 

3.3.1.4.4 

bitHallafuulhaaS 

'they  don't  threaten  her* 

3.3.1.4.5 

bitwajhanuulhum$ 

'they  don't  flatter  them' 
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3.3.2.3.1       ?istihlaakiyye 

■consumer-' 

3.3.2.3.4      ?istibdaadiyyiin 

•autocratic  (pi.)' 

3.3.3.3.3      bit9allimhummi$ 

'she  doesn't  teach  them' 

3.3.4.3.3      matfar$uulhummi$ 

'don't  brush  for  them' 

3.4.1.3.1       ?issoovyatiyye 

'the  Soviet  (f.)' 

3.4. 1 .3.3      matraasluhummi$ 

'don't  write  to  them' 

3.4.1.3.4      ?issoovyatiyyiin 

'the  Soviet  (m.pi.)' 

3.5.1.3.1       mat9allmuli$$i 

■don't  teach  for  him" 

3.5.1.3.3      matrakkbuhummi$ 

'don't  give  them  a  ride* 

3.5.4.3.3      matjamm9uulhummi$ 

■don^t  collect  for  them' 

5.1.1.2.1       ?an3ropolooZi 

'anthropologic(al)' 

5.1.1.4.1       ?an3ropolooZya 

'anthropology' 

Hexasyllabic   patterns 

1.1.1.1.1.3  makatabulaki$ 

'they  didn't  write  to  you  (m.sg.) 

1.1.1.1.3.1  ?otomatikiyye 

•automatic  (f.)' 

1.1.1.1.3.3  maDarabuhummiS 

'they  didn^t  hit  them' 

1.1.1.1.3.4  ?otomatikiyyiin 

•automatic  (m.pl.)^ 

1.1.1.3.3.3  masa?alilhummi$ 

•he  didn't  ask  for  them' 

1.1.1.4.3.3  maTalabuulhummi$ 

'they  didn't  order  for  them' 

1.1.2.3.3.3  mabiraasilhummi$ 

'he  doesn't  correspond  with  them' 

1.1.3.1.1.3  mamasaknalakiS 

'we  didn't  hold  for  you  (m.sg.) 

1.1.3.1.3.1  Ziyofizya?iyye 

'geophysical  (f.)' 

1.1.3.1.3.3  maDarabtuhummi$ 

'you  (pi.)  didn't  hit  them' 

1.1.3.3.1.3  maTabaxtillaki$ 

'1  didn't  cook  for  you  (m.sg.)' 

1.1.3.3.3.3  maHamaltilhummiS 

'1  didn't  canry  for  them' 

1.1.3.4.3.3  maba9atnaalkummi$ 

'we  didn't  send  to  you  (pi.)' 

1.1.4.4.3.3  mabiHaarbuulkummi$ 

'they  don't  fight  for  you  (pi.)' 

1.2.1.1.2.1  maHaarabuli$$i 

'they  didn't  fight  for  him' 

1.2.1.1.3.3  maraasaluhummi$ 

'they  didn't  correspond  with  them' 

1.2.1.1.3.4  maHaaraha1ilhaa$ 

'she  didn't  fight  for  her* 

1.2.1.1.3.5  maHaarabatilhum$ 

'she  didn't  fight  for  them' 

1.2.1.3.1.3  masaaba?atlaki$ 

'she  didn't  race  for  you  (m.sg.)' 

1.2.1.3.3.1  mabaarakatli$$i 

'she  didn't  bless  for  him' 

1.2.1.3.3.3  biraasilihummi$ 

'he  doesn't  correspond  for  them' 

1.2.3.4.3.3  makaatabnaalhummiS 

'we  didn't  write  for  them' 

1.3.1.1.3.3  ma9allamuhummi$ 

•they  didn^t  teach  them^ 

1.3.1.1.3.4  madarrasatilhaaS 

'she  didn't  teach  for  her* 

1.3.1.1.3.5  ma(assaratilhum$ 

'she  didn't  explain  to  them' 

1.3.1.2.3.1  $ikosluvaakiyye 

'Czechoslovakian  (f.)' 

1.3.1.2.3.4  $ikosluvaakiyyiin 

•Czechoslovakians  (m.)' 

1.3.1.3.1.3  mabitsabi?laki$ 

'she  doesn't  race  for  you  (m.sg.)' 

1.3.1.3.3.3  maHamlatilhummi$ 

•she  didn't  carry  for  them' 

1.3.2.1.3.4  mabilHaaribilhaaS 

'she  doesn-t  fight  for  her* 

1.3.2.1.3.5  mabitHaaribilhum$ 

'she  doesn't  fight  for  them* 

1.3.3.1.3.1  makassamali$$i 

'we  didn't  break  for  him' 

1.3.3.1.3.3  maHa??a?nahummi$ 

'we  didn't  achieve  them' 

1.3.3.1.3.4  mabinkart<ibilhaa$ 

'we  don't  mess  up  for  her* 

1.3.3.1.3.5  mabin8arbililhum$ 

'we  don't  sift  for  them' 

1.3.3.1.4.4  mabrtkarkibuulhaa$ 

'you  (pi.)  don't  mess  up  for  her" 

1.3.3.1.4.5  mabit8arbiluulhum$ 

'you  (pi.)  don't  sift  for  them' 

1.3.3.2.3.1  ?aristoqraaTiyye 

•aristocracy* 

1.3.3.2.3.4  ?aristoqraaTiyyiin 

*aristocrats  (m.)* 
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1.3.3.3.3.3 
1.3.4.1.3.3 
1.3.4.4.3.3 
3.1.1.2.3.1 
3.1.1.2.3.4 
3.1.1.3.3.3 
3.1.3.1.3.1 
3.1.3.1.3.3 
3.1.3.1.3.4 
3.3.1.2.3.1 
3.3.1.2.3.4 
3.3.1.3.3.3 
3.3.1.4.3.3 
3.3.3.1.2.1 
3.3.3.1.3.1 
5.1.1.2.3.1 
5.1.1.2.3.4 


mabit9allimhummi$ 

mabitraasilhummiS 

mabitraasluulhummi$ 

?imbaraTooriyye 

?imbaraTooriyyaat 

bi1Haribilkummi$ 

?ilbanafsa2iyye 

bitkarikbuhummi$ 

?ilbanafsaZiyyaat 

?il?i$tiraakiyye 

?il?i$tiraakiyyiin 

malwaSSililhummi$ 

matkarkibuulhummi$ 

?il?iskandaroona 

?ilmustanSiriyya 

?an3ropolooZiyye 

?an3ropolooZiyyiin 


Septasyllabic   patterns 


1.2.1.1.3.3 
1.3.1.1.3.3 
1.3.3.1.3.3 
1.3.3.1.4.3 
3.1.1.1.1.3 
3.1.1.3.1.3. 
3.1.3.1.2.3 
3.1.3.1.2.3 
3.1.3.3.2.3. 
3.1.3.3.2.3. 
3.3.1.1.2.3. 
3.3.1.1.2.3. 
3.5.1.1.2.3 
3.5.1.1.2.3, 


3  maHaarabatilkummi$ 
3  mabitHaribilkummi$ 

3  mabinkarkibilhummi$ 
.3  mabitkarkibuulhummi$ 
1  ?il?otomatikiyye 
1  ?ilZiyofizya?iyye 
1  ?i$$ekoslovaakiyye 
,4  ?i$$ekoslovaakiyyiin 
1  ?il?aristoqraaTiyye 

4  ?il?aristoqraaTiyyiin 
1  ?il?imbaraTooriyye 
4  ?il?imbaraTooriyyaat 
.1  ?il?an3ropolooZiyye 
4  ?il?an3ropolooZiyyiin 


'she  doesn't  teach  them' 

'she  doesn't  correspond  with  them' 

'you  (pi.)  don't  correspond  for  them' 

'empire' 

'empires' 

'she  doesn't  fight  for  you  (pi.)* 

'the  violet  (f.)' 

'you  (pi.)  don't  mess  them  up' 

'the  violets  (f.pl.)' 

'the  socialist/socialism' 

'the  socialists' 

'don't  send  to  them' 

'don't  mess  up  for  them' 

'Alexandaretta' 

'Mustansiria' 

'anthropological' 

'anthropologists' 


'she  didn't  fight  for  you  (pi.)' 

'she  doesn't  fight  for  you  (pi.)' 

'we  doni  mess  up  for  them' 

'you  (pi.)  don't  mess  up  for  them' 

'the  automatic  (f.)' 

'the  geophysical  (f.)' 

'the  Czechoslovakian  (f.)' 

'the  Czechoslovakians  (m.)' 

'the  aristocracy' 

'the  aristocrats  (m.)' 

the  empire' 

"the  empires' 

'the  anthropological' 

'the  anthropologists' 
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ON  THE  NOTION  OF  'SECOND   POSITION'   IN   GREEK 

Richard  Cervin 

The  notion  'second  position'  plays  an  important  role  in  the 
syntax  of  many  languages.  Depending  on  the  language,  the  position 
may  be  occupied  by  various  elements,  such  as  AUX,  pronominal 
clitics,  adverbial  particles,  medals.  It  has  been  generally  argued  that 
'sentential  second  position'  is  a  syntactic  phenomenon.  More  recent- 
ly, Radanovic-Kocic  (1988)  has  argued  that  'second  position'  is  a 
phonological  phenomenon  in  Serbo-Croatian.  In  this  paper  1  argue 
that  in  Ancient  Greek,  second  position  cannot  be  exhaustively  defined 
either  by  the  syntax  or  the  phonology  but  requires  consideration  of 
pragmatic  factors. 

0.  Introduction^ 

Many  languages  exhibit  clitics  of  one  sort  or  another  and  it  is  not 
uncommon  for  such  clitics  to  be  placed  in  what  is  termed  the  'second  position' 
of  the  clause  or  sentence.  But  what  exactly  is  'second  position'?  'Second 
position'  (or  2P)  may  be  defined  in  various  ways:  After  anything  which  is  fronted 
(Hock  1985),  after  the  first  word  and/or  the  first  constituent  of  the  sentence 
(Steele  1975,  1976;  Klavans  1982,  1985;  Kaisse  1985),  after  the  first  syllable 
(Tegey  1977),  after  the  first  phonological  phrase  (Radanovic-Kocic  1988). 

The  purpose  of  this  paper  is  to  examine  the  notion  of  'second  position'  with 
respect  to  Ancient  Greek,  a  language  which  may  place  its  2P  particles  after  the 
first  word,  or  after  the  first  constituent  of  the  clause,  or  after  non-initial  elements 
within  the  first  constituent.  I  will  argue  that  2P  in  Greek  is  sensitive  not  merely  to 
syntax,  but  also  to  pragmatic  factors. 

The  first  section  will  provide  relevant  background  material  specific  to 
Ancient  Greek,  such  as  which  elements  occupy  'second  position',  how  clitics 
are  characterized  in  Greek,  and  how  2P  is  defined.  Section  2  will  discuss  the 
two  major  approaches  to  the  definition  of  second  position,  the  syntactic  and  the 
phonological  ones,  and  how  neither  of  these  is  sufficient  for  Greek.  In  section  3, 
I  will  argue  for  a  pragmatic  approach  to  2P  in  Greek. 

1.  General  background  and  definitions. 

This  paper  is  not  about  clitics  per  se,  yet  any  discussion  of  'second 
position'   requires  a  discussion  of  clitics,   especially  in  the  light  of  such 
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Statements  as  Kaisse's:  'All  languages  with  S'  clitics  place  those  clitics  in 
second  position."  (1985:81,  emphasis  mine).  Now,  traditional  Greek  grammars 
make  a  distinction  between  "enclitics',  words  which  attach  to  the  preceding  word 
and  affect  the  accentuation  of  their  host,  and  "postpositives",  words  which  may 
never  begin  a  sentence  or  phrase.  (Smyth  1920;  Goodwin  1894;  Denniston 
1954.)  Such  a  distinction  is  necessary,  since  not  all  clitics  are  "postpositive",  nor 
is  every  "postpositive"  word  a  clitic.  Failure  to  understand  this  distinction  has 
resulted  in  erroneous  analyses  and  claims. 

1.1.  'Postpositives".  "Postpositive",  as  the  term  is  used  by  the  Greek 
grammars,  simply  means  a  word  that  may  never  begin  a  sentence  or  a  clause. 
The  term  is  non-committal  regarding  the  actual  positioning  of  the  word  in  the 
sentence.  All  of  the  postpositives  in  Greek  belong  to  a  class  of  words 
traditionally  called  "particles'.  These  particles  are  various  types  of  conjunctions 
and  sentence  adverbs  with  a  variety  of  uses:  inferential,  adversative,  correlative, 
ironic,  etc.  (For  a  list  of  these,  together  with  their  functions  and  meanings,  see 
the  Appendix.)  These  postpositive  particles  seem  to  fall  into  Zwicky's  (1977) 
category  of  'bound  words",  inasmuch  as  they  must  have  some  host  to  their  left. 
Most  of  these  postpositive  particles  are  always  accented,  and  as  such  cannot 
be  considered  clitics.^ 

Following  are  a  few  illustrative  sentences  which  contain  one  or  more  of 
these  postpositive  particles. 

(1)    a.      en   Elidi  men        gar    kai       en  Boiolois... 

in    Elis      -  for     and     among        Boeoteans... 

(PISym.  182b). 

"For  in  Elis  and  among  the  Boeoteans...' 

b.  ei  ^ra  ho  Eros      ton       kalon 

if    consequently  the-N      Eros-N  the-G  beauty-G 

endees       esti... (ibid.  201c). 

lacking-N  is-3s... 

"Consequently,  if  Eros  lacks  beauty...' 

c.  logisamenoi    dl  sunnoesomen  to        pathos... 
thinking- N     indeed  understand-subj-lp  the-A  experience-A 
(PI  Polit.  270b). 

'Indeed,  thinking  (thus),  we  would  understand  the  experience...' 

1.2.  Clitics.  Clitics  are  generally  conceded  to  be  morphemes  which  are 
intermediate  between  words  and  affixes,  i.e.,  they  act  like  words  in  some 
respects,  yet  like  affixes  in  others.  It  is  also  generally  agreed  that  clitics  are 
stressless  morphemes  which  attach  to  a  'host'  word,  although  there  is  some 
discussion  of  clitics  bearing  stress  (Zwicky  1977;  Klavans  1982;  Kaisse  1985). 

1.2.1.  Proclitics.  There  is  some  debate  in  the  Greek  grammatical 
tradition  regarding  which  words  are  proclitic.  While  the  Ancient  Greek 
grammarians  did  not  recognize  such  a  category  (Chandler  1881:263-83), 
modern  Greek  scholars  admit  a  small  class  of  proclitics,  about  a  dozen  words 
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which  are  all  unaccented  monosyllables  (Smyth  1920:41;  Goodwin  1894:31). 
Modern  linguists,  however,  such  as  Sommerstein  (1973)  and  Klavans  (1982) 
treat  all  prepositions,  articles,  and  certain  other  words  as  proclitic. 

Regardless  of  their  status  as  proclitics  or  not,  all  prepositions  and  all  forms 
of  the  article  may  serve  as  host  for  clitics,  and  may  thus,  in  effect,  occupy  'first 
position'  as  in  the  following  examples. 

(2)  a.      ho       de    KCiros      idon  auton  ...(Xen  Cyr.  III.1.8) 

the-N  and  Cyrus-N  seeing-N  him-A  ... 
'And  Cyrus,  seeing  him...' 

b.  tei        de    husteraiai...(ibid.  III. 3.34). 
the-D  and     next-D... 

'And  on  the  next  (day)...' 

c.  en  de    toi      tou       leimonos     telei...(A.T.  1.1.6). 
at  and  the-D  the-G  meadow-G  end-D... 

'And  at  the  end  of  the  meadow...' 

1.2.2.  Enclitics.  For  the  purposes  of  this  paper,  I  shall  define  enclitics  in 
a  rather  tight  fashion  following  the  traditional  Greek  definition.  Accordingly, 
enclitics  are  those  words  which  attach  to  the  right  end  of  a  host  word,  and  affect 
the  accent  of  their  host.  For  example, 

(3)  a.      anthropostis  (from:  anthropos  +  tis) 'a  certain  man' 

b.  doulos  =te  kai  eleutheros  (from:  doulos  +  te)  'both  a 
slave  and  a  freeman' 

c.  moros  =ge  =esti  (from:  moros  +  ge  +  esti) 'he's  a  fool' 

Thus,  a  crucial  factor  in  determining  whether  a  word  is  enclitic  or  not  is  this 
property  of  affecting  the  accentuation  of  the  host  word  (see  Smyth  1920:42-3 
and  Chandler  1881:263-7).  This  is  important,  at  least  for  Greek,  because  in 
some  modern  work  on  clitics  (e.g.  Kaisse  1985;  Klavans  1982;  Sommerstein 
1973),  certain  Greek  particles  are  considered  to  be  clitic,  such  as  oun  'there- 
fore', gar 'for',  d§  Indeed',  although  these  words  do  not  affect  accentuation, 
fvloreover,  as  noted,  these  particles  are  always  accented.  According  to  Greek 
tradition,  both  ancient  and  modern,  such  particles  are  not  enclitics,  even  though 
they  are  postpositives. 

The  unaccented  postpositives  (see  the  appendix),  are  all  enclitic  inasmuch 
as  they  affect  an  accentual  change  on  the  host.  Greek  possesses  other  enclitics 
besides  these  postpositive  particles,  such  as  the  indefinite  pronoun  tis  'anyone', 
certain  temporal  particles  such  as  pote  'at  one  time',  various  enclitic  pronouns, 
and  a  couple  of  enclitic  verbs. 

1.2.3.  Pronominal  clitics.  Pronominal  clitics  in  Greek,  both  as 
grammatical  subject  and  object,  need  not  occur  in  2P.  Kaisse  (1985:80)  gives 
an  example  of  an  object  clitic,  nin,  in  2P  along  with  what  she  calls  an  S'-clitic, 
de.   Her  citation  is  given  as  (4)  below: 
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(4)  ego  =cie  =nin  hos  enoesa. 

I        =but  =him  so     perceived. 

'But  I  perceived  him  thus.'  (Theoc  [Idyll  2.103]). 

A  few  pages  later  she  states,  'A  glance  at  the  Tagolog  examples  or  the  Ancient 
Greek. ..reveals  that  these  languages  also  place  their  object. ..clitics  in  second 
position.'  (1985:83)  This  statement  is  misleading  for  Greek  because  object 
clitics  need  not  appear  in  2P.  Wackernagel  (1892)  demonstrated  a  tendency 
for  pronominal  enclitics  to  occur  in  2P,  but  that  tendency  has  percentage  rates 
of  only  40-65%  as  far  as  I  can  see.^  It  is,  however,  by  no  means  obligatory  for 
pronominal  enclitics  to  appear  in  2P.  As  the  following  examples  show, 
pronominal  enclitics  may  occur  at  any  point  within  a  given  sentence,  except 
initially. 

(5)  a.     thermos  gar   eros      auto  =me    kataithei.  (Theoc  7.56). 

hot-N       for     love-N  here    me-A  burns-3s. 
'For  hot  love  burns  me  here.' 

b.  phoiteis  d'  auth'  houtos  hokka  glukus  hupnos 
wander-2s  but  again  thus  whenever  sweet-N  sleep-N 
ekhei  =me.  (Theoc  1 1 .22). 

take-subj-3s  me-A 

'But  you  wander  off  again  whenever  sweet  sleep  takes  me.' 

c.  e  d6    Puthfe       =oi      khrai  fade: ... 
the-N  and  Puthia-N  him-D  prophesied-3s    thus 
(Hdt  1.55.2). 

'And  the  Pythian  (priestess)  prophesied  to  him  thus:  ...' 

Some  pronouns  have  accented  full  forms  which  may  occur  clause  initially.  In 
the  following  example,  the  pronoun  se  is  clearly  clause  initial  as  is  shown  by  the 
2P  particle  men  which  follows  it.  Note  that,  as  in  other  Indo-European  lan- 
guages, the  vocative,  Zeu,  is  extra-clausal,  i.e.,  does  not  count  as  occupying  first 
position. 

(6)  (Zeu),  se  men  Idafoisin  en  ouresi  =phasi 
Zeus-V,  you-A  -  Idaian  in  mountains  say-3p 
genesthai.   (Call  Hymn  1.6). 

to-be-born. 

'Zeus,  they  say  you  were  born  in  the  Idaian  mountains.' 

1.2.4.  Clitic  verbs.  Greek  also  has  a  couple  of  clitic  verbs.  These  verbs 
are  peculiar  in  that  they  are  enclitic  only  in  the  first  and  third  person  of  the  present 
tense.  The  verbs  in  question  are  eimi'Xo  be'  and  phemi'Xo  say,  affirm'.  They  can 
be  identified  as  clitics  by  their  property  of  affecting  the  accentuation  of  their  host. 
However,  they  are  not  limited  to  2P.  For  example: 

(7)  a.      oudeis       gdr  outo  anoetos    =esti.  (Hdt  1.87.4) 

no-one-N    for    so      stupid-N      is-3s. 
'For  no  one  is  so  stupid.' 
(anoetos  =esti  <  anoetos  +  esti) 
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b.      Perseon       men    nun      oi       logioi  Phoinikas 

Persians-G      -       now       the-N  historians-N  Phoenicians-A 

aiti'ous      =phasi  genesthai    tes       diaphores. 

causes-A  say-3p   to-be  the-G  quarrel-G. 

(Hdt  1.1.1). 

'Now  the  Persians'  historians  say  (that)  the  Phoenicians  are  the 

cause  of  the  quarrel.' 

(aitious  =phasi  <  aitfous  +  phasi) 

The  clitic  forms  of  eimi  and  phemf  need  not  occur  in  2P,  although  they 
frequently  do  so.  esti  'is-3s'  also  has  an  accented  full  form  which  is  usually 
impersonal. 

(8)  esti     =moi     kala  pais.    (Sap  132). 
is-3s  me-D     beautifui-N    daughter-N. 
'There  is  to  me  a  beautiful  daughter.' 

=  'I  have  a  beautiful  daughter.' 

The  verb  phemf  a\so  has  accented  full  forms: 

(9)  phasi     de    =min      Delphoi  Theodorou       tou 
say-3p  and     it-A       Delphians-N    Theodonjs-G    the-G 
Sami'ou       ergon     einai.  (Hdt  1.51.3). 

Samian-G  work-A  to-be. 

'And  the  Delphians  say  (that)  it  is  the  work  of  Theodorus  the  Samian.' 

It  is  beyond  the  scope  of  this  paper  to  discuss  whether  these  clitic  verbs 
are  generated  in  their  surface  positions  or  moved  there.  My  point  is  to  illustrate 
that  Greek  clitic  verbs  can  occur  in  a  wide  variety  of  positions. 

1.3.  Defining  2P  for  Greek.  Second  position,  in  its  most  general  sense, 
is  after  the  first  word  of  a  sentence  or  clause  in  Greek.  So  for  example,  the  2P 
conjunction  de  will  occur  most  often  after  the  first  word.  (Note  that  de  is  frequently 
elided  to  d'  before  a  vowel). 

(10)  a.       hfppeue  de    ho    pais    en    agroi. 

was-riding-horseback    and  the  boy     in    field 

b.  ho  de  pais  en  agroi  hfppeue. 

c.  en  d'  agroi  hippeuen   ho  pais. 

'And  the  boy  was  riding  horseback  in  a  field." 

In  (10a),  the  verb  hfppeue  is  the  first  word,  as  well  as  the  first  constituent,  and  de 
is  placed  immediately  thereafter.  In  (10b  and  c)  de  also  occurs  after  the  first 
word.  In  so  doing,  it  interrupts  the  initial  constituent,  the  subject  NP  in  (b)  and 
the  PP  in  (c).  The  Greek  sentences  cited  thus  far  may  serve  as  illustrations  of 
this  principle. 

However,  a  second  principle  is  at  work  in  Greek,  that  of  placing  a  2P 
particle  after  the  first  constituent,  not  after  the  first  word.  Thus.  (1  Ob  and  c)  could 
also  be  written  as 


(10)  d.      ho  pats  d'  en  agroi  hippeue. 
e.      en  agroi  d6  ho  pais  hippeue. 


Following  are  a  couple  of  such  examples  from  Greek  literature  to  further 
illustrate  the  point: 

(11)  a.      [ppdiatinos  allou]  oQn    eti    prosdokas    an 

by  what  other    so      yet  expect-2s     would 
auta     mathein?  (PI  Crat.  438e). 
them    to-learn? 

'So  by  what  other  (way)  would  you  expect  to  learn  these 
things?' 
b.      [pp  ep]   tin      Leukippen  ]  dd     ho       strategos... 
at    the-A  Leucippe-A   but   the-N  general  N... 
(AT.  IV.3.1). 
'...  but  the  general  (was  looking)  at  Leucippe.' 

Furthermore,  the  notion  of  2P  in  Greek  is  not  confined  to  the  sentence,  but 
is  relevant  also  to  sub-clauses  of  sentences.  For  example,  a  conditional 
sentence  is  usually  comprised  of  two  clauses,  an  'if  clause  and  a  'then'  clause. 
In  Greek,  each  of  these  clauses  may  have  its  own  2P  particle  as  in  the  following 
sentence: 

(12)  In    d6    me    ekhosi       hugron    meden,       oi        de 
if       -      not     have-3p    wet-A     nothing-A,    the-N     - 
tes         khamathen      spodou     labontes      leikhousi. 
the-G     from-ground    dust-G      taking-N       lick-3p. 
(Hdt  IV.  172.4). 

'If  they  (a  tribe  called  Nasamones)  do  not  have  anything  wet  (to 
drink),  they  lick  the  dust  from  the  ground.' 

Note  that  a  small  subset  of  2P  particles  can  be  employed  even  in  combination 
with  phrases  (NPs  or  PPs),  as  the  following  example  shows.  Here  the  contrast  is 
between  Philip  promising  'you'  (i.e.  the  Athenians)  'peace'  and  promising  'them' 
(i.e.  the  Thebans)  'help',  and  the  correlatives  which  clarify  this  contrast  are  in 
their  appropriate  second  position.  Examples  of  this  type  suggest  that  a  complete 
discussion  of  Greek  2P  particles  will  have  to  consider  not  just  the  level  of  the 
sentence  or  clause.  However,  to  simplify  matters  I  will  in  the  following  limit  myself 
to  the  use  of  particles  in  sentential  or  clausal  second  position. 

(13)  Philippos...[humTn  mdn  eirenen]  [ekeinois  dd 
Philip-N...  you-Dp  -  peace-A  them-Dp  but 
boetheian]  epengeflato.  (Dem  18.19). 

help-A  promised-3s. 

'Philip.. .promised  you  peace  and  them  help.' 

1.4.  Collocation  of  2P  particles.  It  frequently  happens  that  two  or 
more  postpositives  cooccur  in  2P.    This  sounds  like  a  contradiction  in  terms: 
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two  different  words  in  a  linear  sequence,  each  in  'second'  position?  That  gar 
'for'  is  a  2P  postpositive  can  be  seen  from  example  (5a)  above.  Similarly,  de 
"indeed,  in  fact'  is  a  2P  postpositive,  as  illustrated  in  (1c)  above. 

What  happens  if  gar  and  dJ'are  used  together  in  a  sentence?  They  are 
stacked  up  one  after  the  other  as  gar  di  (usually  not  digai).^  and  this  'stacked 
up'  collocation  as  a  unit  occurs  in  2P.  (Cf.  Hock  1982  for  a  similar  stacking  or 
'doubling'  up  of  particles  in  Vedic  Sanskrit.)  Denniston,  commenting  on  the 
collocations  of  postpositive  particles,  says  that  conjunctive  particles  tend  to 
precede  adverbial  particles  (1954:lx). 

(14)  6s  gar  de  diexelthoi  o  kenjx ... 
when  for  in-fact  came-3s  the-N  herald-N 
(Hdt  1.196.3). 

'For  when  the  herald  In  fact  came  ..." 

Collocations  of  postpositive  particles  may  in  turn  be  followed  by 
pronominal  and  other  clitics,  as  the  next  examples  show. 

(15)  a.      outoi        men  de  =sphi       gnomas        toiasde 

these-N  then       -     them-D    opinions-A  these-A 
apedexanto.  (Hdt  1.171.1). 
proved-3p. 

'Then  they  proved  these  opinions  to  them.' 
b.      patroion        =te    gar  =toi     =esti     kai     proseti 
ancestral-N  both  for     you-D  is-3s    and    besides 
rome  uparkhei.  (Hdt  1.41 .3). 

strength-N     exist-3s. 

'For  it  is  your  (toi)  ancestral  (duty),  and  besides,  you  possess 
strength." 

2.  Explanations  of  second  position. 

Most  of  the  work  on  the  notion  'second  position'  assumes  that  2P  is  a 
syntactic  phenomenon  with  little  or  no  direct  reference  to  phonology  (Steele 
1976;  Klavans  1982,  1985;  Kaisse  1985).  Radanovic-Kocic  (1988)  however, 
argues  that  phonology  is  very  relevant  to  the  notion  of  2P.  These  two  different 
views  will  be  summarized  here. 

2.1.  2P  as  a  syntactic  phenomenon.  Steele  (1976)  describes 
Luisefio,  an  Amerindian  language,  where  AUX  occupies  the  second  position  of 
the  sentence.  She  calls  this  'sentential  second  position'  because  the  AUX 
element  has  'scope'  over  the  entire  sentence,  and  not  just  over  a  part  of  the 
sentence.  (She  never  defines  the  term  'scope'.)  'Second  position'  in  Luiseiio  is 
very  much  like  what  it  is  in  Greek,  i.e..  2P  is  reckoned  either  after  the  first  word  or 
after  the  first  constituent  of  a  sentence.  To  capture  this  notion  of  second  position 
for  Luiseiio,  Steele  posits  two  principles  (1976:596): 
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(16)  Principle  1: 

Put  the  auxiliary  after  the  first  word. 
Principle  2: 

Put  the  auxiliary  after  the  first  constituent. 

However,  as  I  will  demonstrate  in  section  3,  these  principles  are  not  enough  to 
account  for  2P  placement  in  Greek. 

Steele  further  posits  that  Luisefio  AUX  is  generated  in  sentence-initial 
position  and  is  either  moved  rightward  in  accordance  with  her  two  principles,  or 
some  word/constituent  is  fronted  around  the  AUX.  For  Steele,  'sentential 
second  position  is  very  much  a  surface  phenomenon'  (604). 

Kaisse  (1985)  also  treats  2P  as  a  syntactic  phenomenon.  For  Kaisse, 
Greek  postpositives  such  as  gar,  de,  and  dl  are  'sentential  clitics'  (i.e.  S'  clitics) 
which  'have  scope  over  the  entire  sentence'  (77).  Again,  'scope'  is  not  defined. 
Such  S'  clitics  are  generated  in  COMP  and  are  somehow  moved  to  their 
appropriate  syntactic  position.  Kaisse  says  that  she  wants  'to  remain  neutral  on 
the  issue  of  how  some  or  all  clitics  arrive  at  their  surface  position'  (76). 

Klavans  (1982,  1985)  has  developed  a  theory  of  clitics  which  is  designed 
to  'constrain  clitic  positioning  in  a  universal  framework,  regardless  of  the 
language-particular  origins  of  clitics'  (1985:96).  Insofar  as  some  languages, 
like  Greek,  have  2P  clitics,  Klavans's  theory  is  relevant  to  the  notion  of  second 
position.  Klavans  also  argues  that  the  syntax  and  phonology  of  clitics  are  inde- 
pendent of  each  other. 

Klavans  goes  on  to  discuss  her  proposal,  which  involves  three  parameters, 
the  purpose  of  which  is  to  define  clitics  in  syntactic  terms  and  to  constrain  their 
positioning  at  S-structure.  Her  three  parameters  are  as  follows  (1985:97-8): 

(17)  P(arameter)  1 :  Initial/Final:    Refers  to  the  first  or  last  constituent 

of  the  sentence  which  acts  as  the  clitic's  host  under  the 

clitic's  domain. 
P2:  Before/After:    Refers  to  the  location  of  the  clitic,  either  before  or 

after  its  host. 
P3:  Proclitic/Enclitic:    'Refers  to  the  direction  of  phonological 

liaison. 

Klavans  then  introduces  a  notion  of  Domain  of  Cliticization  which  she  defines 
thus  (1982:68): 

A  node  N  is  the  Domain  of  Cliticization  for  a  clitic  cif  the  syntactic 
position  of  c  is  determined  with  respect  to  the  immediate  constituents 
of  the  designated  node  N. 

So,  for  example,  in  a  structure  such  as  (18),  the  domain  of  position  a  is  NP, 
while  the  domain  of  position  b  is  S.  As  far  as  I  can  tell  from  Klavans,  her  notion 
of  Domain  is  merely  syntactic,  not  semantic  or  pragmatic. 
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(18) 


DET=a     N=fj 

These  parameters,  along  with  Klavans's  notion  of  Domain,  predict  the 
possible  placements  of  postpositive  particles  discussed  so  far.  According  to 
Klavans's  theory,  the  Greek  postpositive  conjunction  dk  would  be  categorized 
as  follows: 

(19)  Parameters  of  de: 
PI  :  initial. 

P2  :  after  its  host,  i.e.  postpositive. 

P3  :  this  parameter  is  inapplicable,  since  de  is  an  accented 
particle  and  not  a  clitic. 

In  the  sentences  in  (10),  reproduced  below,  de  occurs  after  the  initial  constituent 
within  the  domain  of  S  in  (a,  d,  e),  but  de  is  within  the  domain  of  the  NP  in  (b) 
and  within  the  domain  of  the  PP  in  (c). 

(10)  a.  [hfppeue]  d6  ho  pais  en  agroi. 

b.  [ho  dd  pais]  en  agroi  hfppeue. 

c.  [en  d'  agroi]  hippeuen  ho  pais. 

d.  [ho  pais]  d'  en  agroi  hfppeue. 

e.  [en  agroi]  d6  ho  pais  hfppeue. 

Klavans's  theory,  however,  does  not  predict  the  entire  range  of 
possibilities.  Take,  for  example,  an  S  initial  PP  of  the  structure  in  (20): 

(20) 


For  this  structure,  Klavans's  theory  allows  clitics  to  go  only  into  position  1 
(initial/after  within  the  domain  of  P1),  and  position  4  (initial/after  within  the 
domain  of  S).  Positions  2  and  3  are  disallowed  by  Klavans's  theory  because 
they  are  positions  within  the  sub-structure  of  the  initial  constituent,  PI. 
Klavans's  Parameter  1,  therefore,  cannot  refer  to  either  N2  or  N1,  and  thus 
positions  2  and  3  are  ruled  out. 
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But  the  Greek  postpositives  can  occur  in  all  four  of  these  positions  as  can 
be  seen  in  the  following  examples. 

(21)  a       (kai)[NP  hoi  alloi     d'       Hellenes] ...  (Dem  18.43). 
also  the  other  and    Greeks 

'And  the  other  Greeks  also  ..." 

b.  [NP  tan     glaukan    de    thalassan]      ea 

the-A  grey-A     but    sea-A  permit-imper-2s 

poti    kherson    orekhthein.  (Theoc  1 1 .43). 
to       dry-land     to-lap. 
'But  permit  the  grey  sea  to  lap  the  dry  land.' 

c.  [PR  ek  tou  autoCi  men  khorfou]  e     orme     estai. 

at  the  same        -     place      the  attack  shall-be. 
(Hdtl.11.5). 
'The  attack  shall  be  at  the  same  place.' 

d.  [pp  en  toutoi     dS         toi        khronoi]    ho 

in    this-D  indeed    the-D  time-D      the-N 
presbuteros  pais...      prosliei  ...(Xen  Cyr.  III. 1.7). 
elder-N  son-N...  came-forth-3s 

'At  this  time,  the  elder  son. ..came  forth  ...' 

In  (21a  and  b),  the  particle  occurs  in  the  position  under  N1  (position  3  in  (20)) 
which  is  ruled  out  by  Klavans's  theory.  Note  that  kai  in  (21a)  stands  outside  the 
constituent  in  the  same  way  as  the  vocative  is  outside  the  constituent  in 
example  (6)  above. ^  In  (21c)  the  particle  stands  in  position  3  from  (20)  within 
the  PP,  while  (21  d)  shows  the  particle  in  position  2.  These  examples  show  that 
2P  in  Greek  is  defined  not  just  in  terms  of  the  margins  of  constituents  as 
Klavans's  theory  suggests,  but  may  be  located  anywhere  within  the  first 
constituent  or  immediately  following  it. 

For  Klavans's  theory  to  work  for  Greek,  it  would  have  to  be  modified  so  that 
2P  postpositives  may  be  placed  anywhere  within  the  initial  constituent.  More- 
over, it  would  have  to  be  permitted  to  apply  both  to  clitic  and  to  non-clitic 
particles.  It  is  not  clear  to  me  at  this  point  how  such  modifications  could  be 
made  without  seriously  changing  the  entire  framework. 

2.2.  2P  as  a  phonological  phenomenon.  In  her  thesis  on  clitics  in 
Serbo-Croatian,  Radanovic-Kocic  (1988)  takes  a  different  approach  from 
Steele,  Klavans,  and  Kaisse.  Whereas  the  latter  attempt  to  solve  the  problem  of 
2P  clitics  by  exclusively  syntactic  means,  Radanovic-Kocic  claims  that  clit- 
icization,  in  Modern  Serbo-Croatian  at  least,  is  a  post-syntactic  phenomenon 
which  is  effected  in  PF.  Serbo-Croatian  has  several  categories  of  clitic  ele- 
ments: auxiliaries,  pronominals,  a  question  particle,  sentential  adverbs,  all  of 
which  occur  in  2P.  After  surveying  the  evidence  and  considering  alternative 
interpretations,  Radanovic-Kocic  comes  to  the  conclusion  that  2P  must  be 
defined  as  the  position  which  follows  the  first  phonological  phrase  (1988:118, 
141-3).  Accordingly,  she  posits  the  following  clitic  movement  rule  (134): 
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(22)  Clitic  Movement 

Move  all  [+clitic]  elements  within  an  l[ntonational] 
P[hrase]  into  the  position  after  the  first  P[honological 
phrase]  of  the  same  l[ntonational]  P[hrase]. 

Radanovic-Kocic  is  probably  right  in  claiming  that  syntax  alone  is  not 
enough  for  cliticization.  However,  the  constraints  operative  in  Serbo-Croatian 
are  not  directly  applicable  to  Greek.  The  only  grammatical  category  which  must 
obligatorily  occupy  2P  in  Greek  is  the  postpositive  particles.  Such  is  not  the 
case  in  Serbo-Croatian.  Any  complex  constituent  may  be  interrupted  by  the 
Greek  postpositives,  but  this  is  not  so  in  Serbo-Croatian.  Moreover,  there  is 
nothing  in  the  attested  evidence  of  Ancient  Greek  that  requires  a  phonological, 
as  opposed  to  a  syntactic,  solution. 

3.  A  pragmatic  account  of  2P. 

As  we  have  seen,  a  mere  syntactic  analysis  of  second  position  in  Greek 
runs  into  difficulties  because  2P  postpositive  particles  may  occur  anywhere 
within  the  first  constituent,  as  well  as  after  the  first  constituent.  Klavans's  theory 
predicts  only  some  of  the  various  options  for  2P  in  Greek.  Moreover,  even  if  it 
could  be  made  to  work  for  Greek,  it  would  not  address  the  question  of  why  one 
option  should  be  employed  against  another.  It  is  my  contention  that  the  various 
possible  syntactic  interpretations  of  2P  in  Greek  can  be  motivated  by  pragmatic 
concerns  such  as  topicalization,  focus,  or  emphasis. 

Steele  (1976:608-10)  raises  the  possibility  of  topicalization/emphasis  as 
related  to  the  2P  auxiliary  in  Luiseno,  but  does  not  go  into  a  lot  of  details  and 
leaves  the  matter  open  for  further  consideration. 

However,  in  Greek  literature,  there  are  many  passages  where,  within  the 
overall  discourse  context,  the  placing  of  a  postpositive  particle  either  late  in  a 
complex  constituent,  or  after  a  complex  constituent,  unambiguously  signals  that 
what  precedes  is  emphatic  or  in  focus.  To  illustrate  this  claim,  I  will  cite  a  few 
examples  with  extended  context. 

(23)  ophelen     =s*       6  tuphle     Ploute      mete      gel 
would-3s  you-A  o  blind-V  Pluto-V    neither  earth-D 
mit'  en  thalassei 

nor    in    sea-D 

mit'  en     epeiroi        phanemen,    alia  Tartaron       =te 

nor    on    mainland-D  to-appear,     but    Tartarus-A  both 

naiein         k'     Akheronta; 

to-inhabit  and  Acheron-A; 

[dia  se]     g^r    pant" 

because-of  you-A  for     everything-N 

aien        anthropois    kakl.    (Tim  731). 

always     men-D         evil-N 

'O  blind  Pluto,  would  (that)  you  not  appear  on  earth  or  sea  or 

mainland,  but  (may  you)  inhabit  Tartarus  and  Acheron;  for  (it  is) 

your  fault  (that)  everything  (is)  always  evil  for  mankind.' 
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In  the  last  sentence  of  this  short  poem,  se  is  emphatically  accented,  and  gar 
follows  the  entire  constituent,  dia  se.  This,  plus  the  overall  tenor  of  the  poem, 
justifies  the  translation  I  have  given,  'for  it  is  your  fault ..." 

(24)  [kalei]       dl     pros  ten       thean  hemas    ho 
invite-3s  then  to      the-A    spectacle-A  us-A        the-N 
strategos,     kai    he        Leukippe       sumparen. 
general-N,  and  the-N  Leucippe-N    was-present-3s. 
[hemeis]  men    oOn  epi  to         therion     tous       ophthalmous 
we-N          -       so      at    the-A   beast-A   the-A    eyes-A 
eikhomen,  [epi  ten       Leukippen]  de     ho       strategos; 
held-3p,       at    the-A  Leucippe-A  but   the-N  general-N; 

kai     euthus    healokei.      (AT.  IV.3.1). 

and  quickly    was-captivated-3s. 

Then  the  general  invited  us  to  the  spectacle,  and  Leucippe  was 

present.  So  we  were  looking  at  the  beast,  but  the  general  (was 

looking)  at  Leucippe,  and  he  was  quickly  captivated.' 

In  this  text,  the  author  manages  to  convey  a  double-focus:  'we'  is  contrasted 
with  'the  general',  and  'the  beast'  with  'Leucippe'.  This  is  effected  by  means  of 
the  rhetorical  trick  of  chiasmus,  (the  linear  sequence  of  a  b  b  a)  and  the 
presence  of  the  2P  correlative  particles  men  ...de.  The  chiastic  sequence  in 
(24)  is  'we,  at  the  beast...  at  Leucippe,  the  general',  i.e.,  subject,  PP  ...  PP, 
subject,  men  and  de  are  employed  to  reinforce  the  double-focus/chiasmus,  in 
that  men  highlights  hemeis  'we',  v.'hile  de  highlights  epi  t§n  Leukippen  'at 
Leucippe'.  Again,  de  occurs  after  the  entire  constituent,  rather  than  after  the  first 
word  of  the  constituent. 

(25)  [entautha]  de        ho       Armenios        gignoskon  ten 
then           emph.  the-N    Armenian-N    realizing-N  the^A 
ananken        katabainei,           kai    ho        Kuros  labon 
necessity-A  come-down-3s,  and    the-N  Cyrus-N  taking-N 
eis  to    meson    k'akeinon     kai     t'  alia          panta 

to    the  middle  both-him-A  and    the-rest-A  all-A 
periestratopedeusato,  homou     ede         pasan     ekhon 
pitched-camp-3s.  where       already  all-A        having-N 

ten      dunamin. 
the-A  force-A. 

[en  toutoi    de        toi        khronoi]    ho 
in    this-D     emph.  the-D  time-D       the-N 
presbuteros  pais    tou       Armeniou       Tigranes       ex 
elder-N         son-N  the-G  Armenian-G  Tigranes-N    from 
apodemias  =tin6s         proseiei,  hos        kai 

journey-G       some-G  was-coming-3s,    who-N  also 
suntheros  =pote     egegeneto  toi  Kuroi. 

fellow-hunter-N    once    was-3s         the-D    Cyrus-D. 
(XenCyr.  III.1.6-7). 
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Then  the  Armenian,  realizing  the  need,  came  down,  and  Cyrus 
took  both  him  and  all  the  rest  to  the  middle  of  his  (Cyrus')  camp, 
where  he  (Cyrus)  already  had  all  his  force.  At  that  time, 
Tigranes,  the  elder  son  of  the  Armenian,  who  at  one  time  used  to 
be  a  fellow  hunter  with  Cyrus,  was  coming  from  some  journey 
(abroad).' 

In  this  passage,  the  only  postpositive  word  used  is  d§.  The  first  occurence  of  d€ 
is  after  a  single  word  constituent,  but  the  second  occurence  is  of  greater 
interest,  since  d§ occurs  in  the  middle  of  the  constituent,  thereby  emphasizing 
toutoi  'that'. 

Examples  like  these  show  that  there  is  a  correlation  between  2P  and  the 
range  or  'scope'  of  focus  or  emphasis.  This  is  especially  clear  with  constituents 
consisting  of  more  than  two  words,  where  the  placement  of  particles  can  single 
out  any  one  of  the  words  as  being  emphasized  or  focussed. 

The  notion  of  second  position  in  Greek  can  thus  be  summed  up  as  follows: 

(26)  1.  Regarding  single  word  constituents,  2P  is  immediately  thereafter. 
2.  Regarding  constituents  of  more  than  one  word, 

a.  the  pragmatically  neutral  position  is  after  the  first  word; 

b.  the  pragmatically  marked  positions  are  anywhere  else  within 
the  first  constituent,  or  after  the  first  constituent,  such  that  the 
particle  follows  the  element  under  emphasis  or  focus. 

4.  Conclusion. 

The  notion  of  'second  position'  in  Ancient  Greek  is  not  a  mere  syntactic 
phenomenon,  but  has  pragmatic  functions  as  well.  Further,  the  elements  which 
go  into  2P  are  best  defined  as  particles.  While  some  of  these  are  clitics  (as 
Kaisse  and  Klavans  show),  others  are  always  accented  regardless  of  their 
phonological  environment,  and  are  presumably  not  clitics.  Moreover,  clitic 
pronouns  and  verbs  in  Greek  are  not  restricted  to  2P. 

Two  issues  were  not  addressed  because  their  exploration  would  have 
gone  beyond  the  scope  of  this  paper.  One  is  the  syntactic  origin  of  the  Greek 
postpositives.  There  are  several  possibilities:  They  could  be  base-generated  in 
their  surface  position;  they  could  be  generated  in  COMP  and  either  lowered  into 
the  initial  constituent,  or  elements  could  be  fronted  around  the  postpositives  to 
the  left;  they  could  be  located  under  a  different  node  altogether;  or  a 
combination  of  the  above.  The  second  issue  has  to  do  with  the  interaction  of 
the  process  of  fronting  with  the  placement  of  postpositives.  I  hope  to  address 
these  issues  in  future  publications. 
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NOTES 

^The  following  abbreviations  have  been  used  throughout  this  paper:  '=' 
indicates  clitic  attachment;  N  -  nominative;  V  -  vocative;  G  -  genitive;  D  -  dative;  A 
-  accusative;  s  -  singular;  p  -  plural;  subj  -  subjunctive;  imper  -  imperative. 

2As  Wanner  (1978)  and  Klavans  (1982:91-114)  have  shown,  some 
languages  permit  clitics  to  be  accented.  However,  such  accentuation  is  found 
only  in  certain  phonological  contexts.  Greek  particles  like  oun  are  accented  no 
matter  what  their  phonological  context. 

^Wackernagel  (1892:352)  cites  some  staticstics  which  he  culled  from  Book 
VII  of  Herodotus's  Histories:  Of  45  occurences  of  toi  'you-Ds',  18-20  are  in  2P; 
of  16  occurences  of  se  'you-As',  10  are  in  2P;  24  out  of  37  instances  of  moi  "me- 
Ds',  are  in  2P;  to  quote  a  few.  In  these  few  sets  of  statistics  one  can  see 
percentage  rates  of  40  -  65%  as  far  as  2P  placement  is  concerned.  I  myself 
tallied  the  occurences  of  the  pronoun  min  '3  pers.-A'  and  oi'3  pers.-D'  (these 
pronouns  refer  to  the  third  person  irrespective  of  gender),  throughout  Books  I 
and  II  of  Herodotus's  Histories,  and  my  findings  are  as  follows:  min  occurs  120 
times,  of  which  69  (57%)  are  in  2P,  and  oi  occurs  175  times  with  109  (62%)  in 
2P. 

"^d^  gar  is  frequent  in  Homer,  but  is  not  found  in  most  prose  authors  at  all. 
The  collocation  occurs  only  three  times  each  in  Plato  (Polit.  264e1 ;  Parm. 
138b8;  140e2)  and  Plutarch  (Moralia  30. E. 9;  32.C.11;  380. E. 8).  None  of  the 
following  authors/works  use  the  collocation:  Aristotle,  Demosthenes,  Epictetus, 
Herodotus,  Josephus,  Lysias,  New  Testament,  Philo,  Plotinus,  Polybius, 
Thucydides,  Xenophon. 

^The  position  of  kai  in  a  Greek  sentence  is  interesting  because  it  may  act 
as  a  host  for  clitics,  but  it  need  not  do  so.  In  the  following  pair  of  sentences,  the 
dative  pronoun  o/ cliticizes  to  kai  in  (a)  but  to  the  adjective  of  the  direct  object  in 
(b): 

(27)  a.      kai    =oi       egeneto     thugater...(Hdt  1.107.1). 
and  him-D    was-3s      daughter-N 
'And  he  had  a  daughter ..." 
b.      kai     megisten     =oi      ortin      anagousi. 
and  greatest-A  her-D  feast-A  celebrate-3p. 
(Hdt  11.40.1). 

'And  they  (Egyptians)  celebrate  her  (a  goddess)  with  (the) 
greatest  feast.' 

kai  may  also  combine  with  certain  postpositive  particles  with  idiomatic 
meanings,  e.g.,  kai  gar'ior  in  fact',  /ca/per 'although*,  kai  min  'and  yet'  (Smyth 
1920:640,  654,  659  respectively). 
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The  Greek  ka'i  behaves  very  much  i.ke  German  und.  Hock  (1985:5)  points 
out  that  und  is  extra-clausal  in  Standard  German,  i.e.,  it  is  not  counted  as 
occupying  any  position  at  all  in  the  sentence.  This  can  be  seen  from  the 
behaviour  of  the  verb  which  must  occupy  the  second  position: 

(28)  a.      und  er  kam  nach  hause. 

b.  *?  und  kam  er  nach  hause. 
'and  he  came  home.' 

c.  dann  kam  er  nach  hause. 

d.  *  dann  er  kam  nach  hause. 
'then  he  came  home.' 

(28b)  is  ungrammatical  in  Standard  German,  but  it  is  acceptable  in  some 
dialects.  The  behaviour  of  und  is  different  from  other  adverbs  such  as  dann  in 
that  und  is  extra-clausal,  whereas  dann  is  definitely  counted  as  the  first 
word/constituent  of  the  sentence.  (See  also  Hock  1982  for  a  similar  situation  in 
Vedic  Sanskrit,  where  certain  sentence-linking  adverbs  may  be  treated  by  the 
syntax  as  either  outside  of  the  clause  or  as  first  elements  of  the  clause.) 


TEXTUAL    ABBREVIATIONS 

A.T.  Achilles  Tatius,  Leucippe  and  Cleitophon  (in  Gaslee) 

Call  Callimachus,  Hymns  (in  Hopkinson) 

Dem         Demosthenes,  18:  On  the  Crown  (in  Butcher) 

Hdt  Herodotus'  Histories  (in  Hude) 

PI  Plato  (cited  from  Burnet) 

Crat:  Cratylus 

Parm:  Parmenides 

Polit:  Politicus 

Sym:  Symposium 
Sap  Sappho,  Fragments  (in  Page) 

Theoc      Theocritus,  Poems  (in  Gow) 
Tim  Timocreon,  Fragments  (in  Page) 

Xen  Xenophon,  Cyropaideia  (in  Gemoll) 

REFERENCES 

BURNET,  loannes.  1901.  Platonis  opera,  5  vols.  Oxford:  University  Press. 
BUTCHER,  S.  H.  1903.  Demosthenis  orationes,  3  vols.  Oxford:  University  Press. 
CHANDLER,  Henry  W.  1881.  A  practical  introduction  to  Greek  accentuation, 

2nd  ed.  Repr.  1983,  New  York:  Caratzas. 
DENNISTON,  J.D.  1954.  The  Greek  particles,  2nd  ed.  Oxford:  University  Press. 
GASLEE.  S.  1917.  Achilles  Tatius.  Rev.  1969.  Harvard,  MA:  Loeb  Classical 

Library. 
GEMOLL,  W.  1968.  Xenophontis  institutio  Cyri.  Leipzig:  Teubner. 


38 

GOODWIN,  William  Watson.  1894.  Greek  grammar,  2nd  ed.  Repr.  1977,  London: 

Macmillan. 
GOW,  A.  S.  F.  1952.  Bucolici  graeci.  Oxford:  University  Press. 
HOCK,  Hans  Henrich.  1982.  Clitic  verbs  in  PIE  or  discourse-based  fronting? 

Studies  in  the  Linguistic  Sciences  12:2.1-38. 
1985.  Pronoun  fronting  and  the  notion  of  "verb-second"  position  in  Beowulf. 

Germanic  Linguistics:  Papers  from  a  Symposium  at  the  University  of 

Chicago,  April  24,  ed.  by  J.T.  Faarlund,  70-86.  Bloomington:  Indiana 

University  Linguistics  Club. 
HOPKINSON,  Neil.  1988.  A  Hellenistic  anthology.  Cambridge:  University  Press. 
HUDE,  Carolus.  1927.  Herodoti  historiae.  2  vols.  3rd  ed.  Oxford:  Univ.  Press. 
KAISSE,  Ellen  M.  1985.  Connected  speech.  New  York:  Academic  Press. 
KLAVANS,  Judith  L  1982.  Some  problems  with  a  theory  of  clitics.    Bloomington: 

Indiana  University  Linguistics  Club. 
— .  1985.  The  independence  of  syntax  and  phonology  in  cliticization.  Language 

61.95-120. 
PAGE,  D.  L.  1968.  Lyrica  graeca  selecta.  Repr.  1973.  Oxford:  University  Press. 
RADANOVIC-KOCIC,  Vesna.  1988.  The  grammar  of  Serbo-Croatian  clitics:  A 

synchronic   and   diachronic   perspective.      University    of    Illinois   at 

Champaign-Urbana  Ph.D.   dissertation. 
SMYTH,  Herbert  Weir.  1920.  Greek  grammar.  (Revised  by  Gordon  M.   Messing, 

1956).  Cambridge,  MA:  Harvard  University  Press. 
SOMMERSTEIN,  Alan  H.  1973.  The  sound  pattern  of  Ancient  Greek.    Oxford: 

Basil  Blackwell. 
STEELE,  Susan.  1975.  On  some  factors  that  affect  and  effect  word  order.  Word 

order  and  word  order  change,  ed.  by  C.  N.  Li,  199-268.  Austin:  University 

of  Texas  Press. 
.1976.  On  the  count  of  one.  Linguistic  studies  offered  to  Joseph  Greenberg 

on  the  occasion  of  his  sixtieth  birthday,  ed.  by  A.  Juilland,  3:591-613. 

Saratoga,  CA:  Anma  Libri. 
TEGEY,  Habibullah.  1977.  The  grammar  of  clitics:  Evidence  from  Pashto 

(Afghani)  and  other  languages.  University  of  Illinois  at  Champaign-Urbana 

Ph.D.  dissertation. 
WANNER,  Dieter.  1978.  Stressed  clitics.  MS,  University  of  Illinois. 
WACKERNAGEL,  Jacob.   1892.   Uber  ein  Gesetz  der  indogermanischen 

Wortstellung.  Indogermanische  Forschungen  1 :333-436. 
ZWICKY,  Arnold  N.  1977.  On  clitics.  Bloomington:  Indiana  University  Linguistics 

Club. 


39 


APPENDIX 


Following  are  the  postpositive  particles  which  Smyth  lists  in  his  grammar 
(1920:631-671).  I  have  organized  this  list  in  terms  of  accented  and  unaccented 
particles.  The  unaccented  particles  are  all  enclitic. 

ACCENTED  PARTICLES 


ara 

inferential 

'consequently,  therefon 

ara 

confirmatory 

•indeed,  in  facf 

au 

adversative 

'on  the  other  hand' 

gar 

confirmatory 

•in  fact,  indeed^ 

causal  conjunction 

•for,  because^ 

de 

conjunction 

'and,  but,  yet' 

dl 

emphatic 

'indeed,  in  fact' 

min 

emphatic 

'indeed,  in  truth' 

men...de 

correlative  sequence 
indicating  contrast 

men 

asseverative 

'in  truth,  surely' 

adversative 

'however' 

confirmatory 

'in  fact,  indeed^ 

oun 

inferential 

'therefore,  accordingly' 

temporal  conjunction 

^then' 

CCENTEC 

)  PARTICLES 

ar/ra 

reduced  enclitic  form 
of  ara  above 

ge 

emphatic 

'certainly,  at  any  rate' 

nun/nu 

temporal 

'now,  at  the  present' 

per 

emphatic 

'very,  just,  even' 

te 

conjunction 

'and,  both' 

toi 

confirmatory 

'in  truth,  doubtless' 
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UNICORN:  A  unification  parser 
for  attribute-value  grammars* 
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UNICORN  is  a  unification-based  natural  language  system  that  is 
modelled  after  the  family  of  PAIR  systems.  This  paper  describes  the 
functionality  of  the  system  from  the  user's  perspective  and  provides 
details  of  its  design  and  implementation  in  PROLOG. 


1.   Introduction 

This  paper  describes  the  functionality  and  implementation  of  UNICORN,  a 
unification-based  natural  language  processing  system  which  is  modelled  after 
the  family  of  PATR  systems  described  in  Shieber  et  al.  (1983)  and  Karttunen 
(1986).''  The  system  described  in  this  paper  is  fully  implemented  in  Tl-Prolog"'"'^ 
on  Texas  Instruments  Explorer^M  workstations  used  by  the  Computational 
Linguistics  Laboratory  at  the  University  of  Illinois. 

The  development  of  UNICORN  has  been  guided  by  three  main 
considerations:  (1 .)  We  would  like  to  make  available  to  linguists  and  workers  In 
related  fields  a  set  of  computational  tools  that  enable  them  to  explore  the 
expressiveness  of  complex-feature  based  grammar  formalisms  without 
presupposing  any  knowledge  of  the  algorithms  used  to  embed  such  grammars 
into  natural  language  processing  systems,  (2.)  we  would  like  to  make  available 
to  instructors  and  students  in  computational  linguistics  a  set  of  instructional  tools 
that  contribute  to  study  the  algorithms  that  underlie  unification-based  natural 
language  processing  systems,  and  (3.)  we  would  like  to  explore  further 
extensions  to  known  algorithms  for  unification-based  systems  that  make  it 
possible  to  employ  such  systems  in  a  wide  variety  of  areas  in  computational 
linguistics  (including  in  particular  natural  language  generation,  machine 
translation,  and  semantic  interpretation). 

The  structure  of  this  paper  reflects  the  three  goals  driving  the  development 
of  UNICORN.  Section  2  contains  a  discussion  of  the  system  for  those  users  that 
are  primarily  interested  in  using  UNICORN  as  a  computational  workbench, 
without  becoming  involved  in  the  details  of  its  underlying  algorithms.    It  also 
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provides  relevant  discussion  for  those  users  that  want  to  view  the  system 
prinnarily  as  an  instructional  tool.  Section  3  contains  a  detailed  discussion  of 
the  algorithms  underlying  the  UNICORN  system  and  their  Prolog 
implementation.  The  paper  has  two  appendices:  Appendix  A  contains  a  sample 
grammar  for  a  fragment  of  English  that  illustrates  the  grammar  formalism 
employed  in  the  UNICORN  system.  Appendix  B  provides  a  listing  of  relevant 
parts  of  the  Prolog  implementation  of  UNICORN.  This  appendix  includes  a 
listing  of  the  unification  package  for  directed  acyclic  graph  structures,  of  the 
routine  for  checking  subsumption  among  DAG  structures,  and  of  the  procedure 
path_value  that  identifies  the  value  for  some  path  in  a  DAG.  Appendix  B  also 
contains  a  listing  of  our  Prolog  implementation  of  the  algorithm  for  restricting 
DAG  structures  in  the  context  of  the  generalized  version  of  Barley's  algorithm 
presented  in  Shieber  (1985).  The  rationale  for  including  a  partial  listing  of  the 
UNICORN  system  derives  from  the  practice  suggested  by  Hobbs  and  Shieber 
(1987),  namely  to  make  available  in  print  one's  own  solutions  to  research 
problems  in  the  field,  for  which  algorithms  and  implementations  are  known  to 
exist,  but  which,  for  one  reason  or  another,  tend  to  remain  inaccessable  to  many 
researchers  in  the  field. 


2.  A  User's  Guide  of  UNICORN 

UNICORN  consists  of  three  main  components:  (1.)  a  unification  package 
for  unifying  directed  acyclic  graph  structures  (DAG  structures),  (2.)  a  parsing 
component  that  uses  a  generalized  version  of  Barley's  algorithm  presented  by 
Shieber  (1985),  and  (3.)  a  grammar  interpreter  that  converts  input  grammars 
submitted  by  the  user  into  directed  acyclic  graph  structures  (DAG  structures) 
that  are  employed  by  the  parser.  As  illustrated  in  figure  1 ,  the  modular  structure 
of  the  UNICORN  system  derives  from  its  three  main  components,  with  the  DAG 
unifier  serving  both  the  grammar  interpreter  and  the  parsing  component. 


"Unification  Package  ] 
Figure  1 :  The  Architecture  of  the  UNICORN  system 

In  the  parsing  component,  the  DAG  unification  package  operates  on 
complex  non-terminal  symbols  in  order  to  unify  them  with  other  DAG  structures. 
In  the  grammar  interpreter  the  DAG  unifier  operates  on  partially  interpreted 
grammar  rules  which  themselves  are  represented  as  DAG  structures  in  order  to 
increment  the  information  encoded  by  such  rules. 
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2.1 .  The  Representation  of  Directed  Acyclic  Graphs  (DAG's). 

The  most  important  data  structure  used  in  the  UNICORN  system  is  that  of  a 
directed  acyclic  graph  (DAG).  Such  structures  have  been  employed  in  name 
and/or  spirit  by  a  wide  variety  of  grammar  formalisms,  including  Functional 
Unification  Grammar,  Lexical  Functional  Grammar,  PATR  grammars,  and  Head- 
Driven  Phrase  Structure  Grammars.  An  example  of  a  DAG  structure  to  encode 
grammatical  information  is  given  in  (1). 


(1) 


The  DAG  in  (1)  describes  the  grammatical  information  associated  with  the 
lexical  item  smiles  in  the  example  grammar  given  in  Appendix  A  of  this  paper. 
The  DAG  specifies  four  main  features:  maj,  head,  subcat,  and  trans.  The  values 
for  the  maj  and  head  features  (familiar  from  GPSG  and  HPSG  grammars) 
specify  which  major  syntactic  category  the  verb  smile  belongs  to  and  which 
features  will  be  inherited  by  its  mother  category.  The  maj  and  head  features  are 
specified  in  such  a  way  that  smile  is  characterized  as  a  finite  verb.  The  subcat 
feature  specifies  that  smile  subcategorizes  for  a  category  whose  head  feature 
is  marked  as  third  person  singular.  The  feature  trans  contains  information  about 
the  semantics  associated  with  smile.  It  specifies  that  the  translation  consists  of 
a  predicate  whose  value  is  smile  and  whose  argument  is  the  same  as  the  basic 
translation  associated  with  the  translation  of  the  category  that  smile 
subcategorizes  for. 

Alternatively,  DAG  structures  can  be  represented  by  attribute-value 
matrices  as  in  (2),  in  which  reentrant  values  are  marked  by  shared  numerical 
values. 
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(2) 


ma  j  :  v 

head:        [form: finite] 


tlperson:      third 
agreement : 
number:       singulai 

sem:    Fparameter:  [ifj 


pred: smile 
;m:  trans : 

|_  [_argl:[l]      J  J 


The  graph  representation  and  the  attribute-value  notation  are  often  used 
interchangeably  in  the  literature.  We  will  follow  this  practice  in  this  paper  and 
use  whatever  notation  seems  most  perspicuous  for  the  matter  at  hand. 

The  internal  representation  of  DAGs  in  UNICORN  is  that  of  an  ordered  pair 
<FS,R_List>  whose  first  element  is  a  feature  structure  (FS),  with  reentrant 
values  being  marked  by  Prolog  variables.  R_List,  the  second  element  of  the 
ordered  pair  specifies  any  information  associated  with  reentrant  values 
contained  in  the  FS. 

(3)  FS:     [[[maj,v], 

[head,   [ [form, finite] ]] , 

[subcat,  [  [first,  [  [head,  [ [agr,  [ [person, third] , 

[num,sing] ]]]]]], 
[sem,  [ [parameter, _161] ]]] , 
[rest, end] ] , 
[sem,    [  [trans,  [ [pred, smile] ,  [argl,_161] ] ] ] ] ] 

R_List:  [[_161,[]]]] 

If  a  reentrant  value  is  associated  with  the  empty  list  ([])  in  the  reentrancy 
list,  this  indicates  that  the  reentrant  value  is  completely  unspecified  with  respect 
to  its  value.  This  case  is  illustrated  by  the  reentrant  value  _161,  which 
simultaneously  marks  the  value  for  the  path  <sem  trans  arg1>  and  for  the  path 
<subcat  first  sem  parameter>  in  the  lexical  entry  for  smile. 

in  case  reentrant  values  are  themselves  associated  with  feature  structures, 
as  in  example  (4),  such  structures  are  associated  as  values  of  the  variable 
which  marks  the  reentrancy  (e.g.  the  variable  _105  has  the  value  [[form,  finite]]). 


45 

(4)   FS:     [[[xO,  [[maj,   s], 

[head,  _115], 

[sem,   [trans,   _127]], 

[gap,   _133]]], 
[xl,   _139], 
[x2,  [[maj,   vp], 

[head,   _115], 

[sem,   [trans,   _169] ] , 

[gap,   _133] 

[subcat,   [[first,   _139] 

[rest,   end]]]]]], 

R_List:  [[_139,   [[maj,    np] , 

[sem,   [trans,   _127]], 
[fun_arg,   [trans,   _169]]]], 

[_115,   [form,   finite]] 

[_133,   []] 

[_169,   []] 

[_127,   []]]] 

The  feature  structure  in  (4)  represents  the  feature  structures  associated 
with  the  three  categories  S,NP,  and  VP  in  the  S  ->  NP  VP  rule  of  the  sample 
grammar  given  in  Appendix  A.  in  that  grammar,  head  features  are  taken  to  be 
inherited  by  the  mother  category  from  their  head  daughter,  and  the  value  for 
head  in  the  feature  structure  for  S  has  to  be  reentrant  with  the  value  for  head  in 
the  feature  structure  for  VP  .  Moreover,  since  the  VP  of  S  is  taken  to  be  finite 
and  since  finite  is  the  value  of  a  head  feature:  form,  [form,  finite]  has  to  appear 
among  the  values  for  the  reentrant  value  of  the  paths  <xO  head>  and  <x1 
head>.  Notice  also  that  the  feature  structure  for  NP  ,  the  leftmost  daughter  of  S, 
is  taken  to  be  reentrant  with  the  first  element  on  the  subcategorization  list  of  the 
verb  phrase. 2  Hence,  [cat  np]  has  to  be  among  the  values  on  the  reentrancy  list 
for  the  reentrant  value  _139.  The  list  of  values  associated  with  the  variable 
_139  exemplifies  a  further  property  of  reentrant  values,  namely  that  reentrant 
values  can  be  reentrant  themselves.  For  example,  the  reentrant  value  _139  has 
among  its  values  a  FS  whose  values  for  <sem  trans>  and  <fun_arg  trans>  are 
reentrant  appropriate  values  in  the  feature  structures  for  the  categories  S  and 
VP  ,  respectively. 

One  condition  that  we  have  not  imposed  is  that  our  DAGs  have  to  be 
acyclic.  Thus,  it  is  easy  to  represent  e.g.  a  category  that  subcategorizes  for 
itself:  [[[xO  P]],[P, [[subcat, P]]].  While  technically  such  structures  are  directed 
graphs  rather  than  DAGs  we  have  continued  to  use  the  term  DAG  mainly 
because  we  have  not  yet  found  any  use  for  cyclic  structures  for  representing 
linguistic  information.  Still  these  cyclic  structures  are  handled  by  the  DAG 
unifier  embedded  in  UNICORN  without  modification.    Thus,  if  it  turns  out  that 
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there  is  linguistic  motivation  for  employing  cyclic  structures,  UNICORN  will  be 
able  to  accommodate  such  structures. 


2.2.  The  DAG  Unification  Package 

The  DAG  unifier  embedded  in  UNICORN  has  as  its  top-level  predicate  a 
three-place  relation  d_union{Dag_1,Dag_2,Dag_3)  which  holds  just  in  case 
Dag_3  is  the  most  general  DAG  structure  which  is  subsumed  by  both  Dag_1 
and  Dag_2,  i.e.  just  in  case  Dag_3  is  the  unification  of  Dag_1  and  Dag_2.  (For 
a  detailed  discussion  of  the  subsumption  ordering  among  DAG's  see  section 
3.4.3  below). 

Each  argument  of  d_union  is  of  the  form  [FS,R_List].  Figure  3  shows  the 
example  of  unifying  the  DAG  structure  associated  with  the  S  ->  NP  VP  rule 
in  (5)  with  the  DAG  structure  associated  with  the  noun  phrase  Tom  Sawyer  in 
the  grammar  of  Appendix  A. 

(5)       d_union( [ [ [xl,     [[maj,np],  %%%   FS_1 

[head, [ [agreement, [ [gender, masculine] , 
[person, third] , 
[num, singular] ] ] ] ] , 
[sem, [ [parameter, torn] , 
[trans, _359] , 
[fun_arg, [ [trans, _359] , 
[gap,_377]]]]], 
[gap,_377]]]], 
[ [_377, [] ] ,  %%%    RList_l 

[_359,  []]]], 

[ [ [xO, [ [maj,s],  %%%  FS_2 

[head,_636], 

[sem, [ [trans, _354] ]], 

[gap,_730]]], 
[xl,_767], 
[x2,  [  [maj,  vp]  , 

[head,_636], 

[sem,  [ [trans, _354] ] ]  , 

[gap,_730], 

[subcat, [ [first, _767] , 

[rest, end] ]]]]], 

[ [_767, [ [maj,np],  %%%RList_2 

[sem, [ [trans, _354] , 

[fun_arg, [ [trans, _354] ]]]]]], 
[_636, [ [form, finite] ] ], 
[  730,  []],  [  354,  []],  [  354,  []]]], 
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[ [  [xO,  [ [maj,s]  ,  %%%  FS_3 

[head,_63  6], 

[sem, [ [trans, _354] ] ] , 

[gap,_730]]], 
[xl,_767] , 
[x2,  [ [maj, vp] , 

[head,_636]  , 

[sem, [ [trans, _354] ] ] , 

[gap,_730], 

[subcat, [ [first, _767] , 

[rest,end]]] ]]], 

[ [_767, [ [maj,np] ,  %%%  RList_3 

[head, [ [agreement, [ [gender, masculine] , 
[person, third] , 
[num, singular] ] ] ] ] , 
[sem, [ [parameter, torn] , 
[trans, _354] , 
[fun_arg,  [ [trans, _354 ] , 

[gap,_450]]]]], 
[gap,_450]]], 
[_636, [ [form, finite] ]], [_730, [] ], [_450, []], 
[_354,  []]]]). 

As  mentioned  in  section  2.1,  the  DAG  unifier  is  used  in  the  UNICORN 
system  by  both  the  parser  and  the  grammar  interpreter.  In  the  parsing 
component  the  DAG  unifier  is  used  in  an  extended  version  of  Barley's  algorithm 
presented  by  Shieber  (1985).  However,  the  predicate  d_union  could  equally 
well  be  embedded  in  alternative  parsing  algorithms.  For  that  matter,  it  could  be 
used  quite  independently  of  the  application  considered  in  this  paper  in  any 
domain  which  requires  a  unification  algorithm  for  DAG  structures. 


2.3.  The  Grammar  Interpreter 

Even  though  the  parser  itself  operates  on  the  type  of  DAG  structures 
described  in  the  previous  section,  users  of  UNICORN  are  not  required  to 
translate  application  grammars  into  such  DAG  representations  themselves. 
Instead,  users  can  write  PATR-II  style  grammars  that  take  the  form  of  rewrite 
rules  that  are  annotated  by  path  equations.  The  task  of  translating  the 
annotated  rewrite  rule  into  the  representation  required  by  the  parser  is  one  of 
the  functions  performed  by  the  grammar  interpretation  component.  The 
grammar  interpreter  also  provides  necessary  initializations  required  by  the 
parser.  These  initializations  include  the  definition  of  a  default  restrictor  for  DAG 
structures  (see  section  3.4.2),  which  is  used  by  the  predictor  step  in  the  Earley 
algorithm,  and  the  definition  of  the  default  start  symbol  for  the  parser. 
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While  users  are  not  required  to  encode  grammars  in  the  syntax  of  DAG 
structures,  it  is  necessary  to  write  input  grammars  that  conform  to  the  syntax  of 
grammar  rules  expected  by  the  grammar  reader.  The  syntax  for  grammar  rules 
corresponds  very  closely  to  the  rule  format  used  in  PATR-II  style  grammars.  A 
sample  grammar  that  conforms  to  this  format  is  given  in  Appendix  A  of  this 
paper.  For  further  examples  of  rewrite  ruled  annotated  by  path  equations  see 
Karttunen  1986  and  Shieber  1986. 

Each  syntax  rule  consists  of  two  parts:  (1.)  a  rule  skeleton  that  specifies 
which  terminal  and  non-terminal  symbols  are  involved  in  the  rule,  and  (2.)  a  set 
of  equations  which  provide  further  specifications  of  the  symbols  appearing  in 
the  rule  skeleton.  We  can  further  distinguish  rules  that  expand  pre-terminal 
nodes  to  terminal  strings  from  rules  that  expand  non-terminals  to  one  or  more 
non-terminals.  We  will  subsequently  refer  to  the  former  type  as  terminal  rules 
and  to  the  latter  as  non-terminal  mles. 

The  two  types  of  rules  are  primarily  distinguishable  by  the  different  types  of 
rule  skeletons  they  employ.  In  the  case  of  terminal  rules  which  specify  the 
lexical  information  associated  with  a  particular  lexical  item,  the  rule  skeleton  is 
of  the  form: 

Word  <String>: 

where  <String>  is  identical  to  the  orthographic  representation  of  the  word(s)  in 
question.  In  the  case  of  simple  lexical  items,  <String>  will  consist  of  a  single 
word.  However,  as  will  be  discussed  shortly,  <String>  can  also  stand  for  the 
empty  string  (in  the  case  of  traces),  or  consist  of  two  or  more  words  (in  the  case 
of  noun  compounds,  multi-word  names,  idioms,  or  any  group  of  words  that  are 
considered  a  lexical  unit).  (4)  gives  an  excerpt  of  the  terminal  rule  given  for  the 
word  Tom  Sawyer  in  the  grammar  of  Appendix  A.  The  path  equations  specify 
Tom  Sawyer  as  an  NP  whose  agreement  features,  which  are  taken  to  be  head 
features,  include  the  values  masculine,  third,  and  singular  for  the  features 
gender,  person,  and  number. 

(6)   Word  torn  sawyer: 
<maj>  =  np 

<head  agreement  gender>  =  masculine 
<head  agreement  person>  =  third 
<head  agreement  number>  =  singular 

in  the  case  of  non-terminal  rules,  the  skeleton  is  of  the  form 
<Maj_0>  ->  <Maj_1>  ...  <Mai_n>: 
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where  each  symbol  stands  for  one  non-terminal  symbol  .  The  S  ->  NP  ,  VP 
rule  of  the  sample  grammar  in  Appendix  A,  whose  DAG  structure  we  introduced 
in  the  previous  section  can  thus  be  specified  as  in  (7). 

(7)  s  ->  np  vp: 

<xO  head>  =  <x2  head> 

<xO  head  form>  =  finite 

<xO  sem  trans>  =  <xl  sem  trans> 

<xl  sem  fun_arg  trans>  =  <x2  sem  trans> 

<xO  gap>  =  <x2  gap> 

<x2  subcat  first>  =  <xl> 

<x2  subcat  rest>  =  end. 

The  syntax  rules  in  (6)  and  (7)  also  serve  to  illustrate  the  syntax  of  the  set 
of  path  equations  that  can  accompany  rule  skeletons  in  input  grammars 
accepted  by  the  UNICORN  grammar  interpreter.  Notice  that  terminal  and  non- 
terminal rules  differ  in  the  syntax  used  for  specifying  paths.  In  non-terminal 
rules,  each  path  originates  with  a  label  xO,  x1 ,  x2,  etc.  The  number  on  the  label 
corresponds  to  the  left-to-right  order  of  categories  used  in  the  rule  skeleton. 
Thus,  xO  always  references  the  category  on  the  left-hand  side  of  the  rule 
skeleton.  x1 ,  x2,  ...  xn  reference  categories  on  the  right-hand  side  in  left-to-right 
order.  For  terminal  rules,  however,  such  referencing  is  not  necessary,  since 
they  involve  a  single  (maj)  category.  Thus,  paths  can  originate  with  any 
attribute  label  relevant  to  the  lexical  item  in  question. 

Notice  also  that  in  the  terminal  rule  in  (6)  it  happens  that  the  right-hand 
side  of  each  equation  is  marked  by  an  atomic  feature  value,  while  the  left-hand 
side  is  marked  by  a  path  description  which  is  delineated  by  angled  brackets.  By 
contrast,  in  (7)  each  equation  happens  to  involve  the  identity  of  two  path 
descriptions.  Schematically,  we  can  state  the  two  possibilities  for  specifying 
equations  as  in  (8a)  and  (8b). 

(8)  a.   <PATH>  =  VALUE.     b.   <PATH1>  =  <PATH2> 

The  second  type  of  equation  always  gives  rise  to  reentrant  DAG  structures. 
In  this  case  the  grammar  interpreter  will  create  a  FS  (feature  structure)  in  which 
each  of  the  two  paths  receives  an  index  (indicated  by  a  Prolog  variable)  as  a 
value.  This  index  also  appears  as  a  member  in  the  reentrancy  list  that 
accompanies  the  FS.  Thus,  all  indices  that  appear  in  the  internal 
representation  of  terminal  or  non-terminal  rules  are  ultimately  derived  from  path 
equations  of  the  second  type. 

The  rule  skeleton  of  the  non-terminal  rule  that  we  have  discussed  so  far 
contained  only  names  of  individual  non-terminal  symbols.  However,  this  is  not 
the  only  option  for  specifying  non-terminal  rules.    A  second  possibility  that  is 
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supported  by  the  UNICORN  grammar  reader  is  the  use  of  schematic  categories 
as  in  (9). 

(9)  vp  ->  vp  x2  : 

<xO  head>  =  <xl  head> 

<xO  sem  trans>  =  <x2  sem  trans> 

<xl  sem  trans>  =  <x2  sem  fun_arg  trans> 

<x2  sem  fun_arg  gap>  =  <xl  gap> 

<xO  gap>  =  <x2  gap> 

<xl  subcat  first>  =  <x2> 

<xO  subcat>  =  <xl  subcat  rest>. 

(9)  specifies  the  right-most  daughter  of  VP  as  any  category  x2  which 
matches  the  first  element  on  the  subcategorization  list  of  its  sister  VP  .  Thus  x2 
can  be  taken  to  be  a  variable  over  any  category  that  satisfies  this  constraint. 

Note  that  even  non-schematic  categories  such  as  S,  NP  or  VP  can  be 
expressed  as  schematic  categories,  namely  as  schematic  categories  whose 
maj  feature  is  specified.  The  rule  in  (9),  for  example,  could  also  be  specified  as 
in  (10). 

(10)  xO  ->  xl  x2  : 
<xO  maj>  =  vp 
<xl  maj>  =  vp 

<xO  head>  =  <xl  head> 

<xO  sem  trans>  =  <x2  sem  trans> 

<xl  sem  trans>  =  <x2  sem  fun_arg  trans> 

<x2  sem  fun_arg  gap>  =  <xl  gap> 

<xO  gap>  =  <x2  gap> 

<xl  subcat  first>  =  <x2> 

<xO  subcat>  =  <xl  subcat  rest>. 

In  fact,  the  grammar  interpreter  always  translates  the  rules  with  non- 
schematic  category  names  into  completely  schematic  rule  types,  thereby 
achieving  a  uniform  representation  for  non-schematic  and  schematic 
categories. 

In  the  grammar  of  Appendix  A,  rule  (9)  applies  twice  in  order  to  combine  a 
ditransitive  verb  such  as  give  with  its  two  non-subject  NPs  in  order  to  form  a 
verb  phrase  such  as  give  Tom  Sawyer  a  toad.  For  transitive  verbs  such  as 
paint,  rule  (9)  applies  once  in  order  to  produce  verb  phrases  such  as  paint 
every  fence.  Due  to  rule  (9),  the  grammar  of  Appendix  A,  therefore,  generates 
left-recursive  verb  phrases. 

Let  us  conclude  the  discussion  of  the  grammar  interpreter  by  mentioning 
some  useful  predicates  that  users  can  invoke.   The  most  important  one  is  the 
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predicate  process_grammar  which  takes  as  its  only  argument  the  file  name 
containing  an  input  grammar  such  as  the  one  specified  in  Appendix  A.  When 
the  grammar  is  processed,  all  terminal  and  non-terminal  rules  are  added  to  the 
Prolog  database.  The  internal  syntax  of  these  rules  takes  the  form  of  an 
ordered  pair  in  the  case  of  a  terminal  rule  and  the  form  of  a  triple  for  non- 
terminal rules. 

(11)  Internal  Rule  Syntax: 

a.  Terminal  Rules:       <String_Length, String, DAG> 

b.  Non-terminal  Rules:   <RHS_of_Rule, DAG> 

For  non-terminal  rules  the  first  element  specifies  the  right-hand  side  of  the 
rule  skeleton,  and  the  second  element  represents  a  DAG  structure  for  the 
information  specified  by  the  path  equations  that  accompany  the  rule  skeleton. 
For  terminal  rules,  the  string  expanded  by  the  preterminal  category  is  specified 
together  with  the  length  of  the  string  (number  of  words),  and  the  DAG  structure 

Users  can  query  which  rules  have  been  added  to  the  database  by  invoking 
the  predicates  showJex_rule/1  and  show_rule/1  for  terminal  and  non-terminal 
rules,  respectively.  showJex_rule  and  show_rule  provide  pretty-printed  copies 
of  the  internal  representation  of  the  two  types  of  rules.  (12)  and  (13)  exemplify 
the  use  of  these  two  predicates  and  show  the  internal  representations  for  the 
lexical  item  Tom  Sawyer  and  for  the  syntactic  rule  S  ->  NP    VP  . 

(12)  I  ?-   show_rule {Right_Hand_Side) . 
< 

[xO  :   [ma  j  :   s] 

[head:   _143] 

[sem:   [trans:   _155]] 

[gap:   _161]] 
[xl:   _167] 
[x2  :   [ma  j  :   vp] 

[head:   _143] 

[sem:   [trans:   _197]] 

[gap:   _161] 

[subcat:   [first:   _167] 
[rest :   end] ] ] 


[_167 :        [ma j :       np] 

[sem:        [trans:      _155] 

[fun_arg:        [trans:      _197]]]] 
[_143:        [form:       finite] ] 
[_161:        []]     [_197:        []]     [_155:        []] 

Right    Hand    Side    =     [xl,x2] 
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(13)  I  ?-   show_lex_rule(X)  . 

< 

[maj:   np] 

[head:   [agreement:   [gender:   masculine] 
[person:   third] 
[number:   singular]] 
[sem:   [parameter:   tom] 
[trans:   _165] 
[fun_arg:   [trans:   _165] 
[gap:   _183]]] 
[gap:   _183] 


[_183:   [] 
[  165:   [] 


X   =    [tom, sawyer  I     110]; 


Notice  that  the  string  Tom  Sawyer  in  (13)  is  not  simply  encoded  as  an 
atom,  but  rather  as  a  list  whose  tail  is  marked  by  a  variable.  The  reasons  for  this 
list  representation  are  mostly  internal  to  the  mechanism  in  the  parsing 
component  that  is  responsible  for  scanning  the  input  string.  The  upshot  is  that 
we  allow  for  terminal  symbols  with  variable  string  length.  This  is  what  makes  it 
possible  to  treat  noun  compounding  and  traces  along  with  simple  lexical  items 
in  UNICORN.  We  allow  for  the  possibility  that  a  preterminal  can  either  expand 
the  empty  string  (i.e.  a  trace)  or  that  it  can  expand  a  string  that  can  consists  of 
one  or  more  words.  The  terminal  rule  for  the  two-word  noun  a  Tom  Sawyer 
and  the  terminal  rule  for  NP  trace  that  are  included  in  the  grammar  in  Appendix 
A  are  typical  examples  of  pre-terminals  that  expand  to  something  other  than  a 
single  string.  To  achieve  maximal  generality  we  place  no  upper  bound  on  the 
number  of  words  that  can  appear  in  a  compound  lexical  item. 


2.4    The  Parser 

The  parser  used  in  the  UNICORN  system  is  based  on  Shieber's  extension 
of  Barley's  algorithm  for  context-free  grammars  to  grammars  whose  categories 
are  not  atomic  symbols,  but  are  made  up  of  complex  feature  structures.  For  a 
detailed  technical  discussion  of  Barley's  algorithm  and  Shieber's  extension  we 
refer  the  reader  to  Barley  (1970)  and  Shieber  (1985).  Technical  details  of  our 
implementation  of  Shieber's  algorithm  will  be  presented  in  section  3.3  of  this 
paper.  In  this  section  we  will  concentrate  on  those  aspects  of  the  parsing 
component  that  will  allow  users  to  parse  sentences  and  obtain  logical  forms  for 
the  sentences  parsed  without  having  to  concern  themselves  with  details  of  the 
algorithm  itself. 
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The  top-level  predicate  for  the  parser  is  parse(StringList),  which  returns 
for  any  grammatical  string  a  pretty-printed  copy  of  the  category  that  generates 
the  string  in  question.  (14)  shows  the  successful  parse  of  Tom  Sawyer  painted 
every  fence,  using  the  grammar  in  Appendix  A. 

(14)  ?-   parse ( [torn, sawyer, painted, every, fence] ) . 

< 

[maj:   s] 

[head:   [form:   finite]] 

[sem:   [trans:   [quant:   every] 

[range:   [pred:   fence] 

[argl:   _23170]] 
[scope:   [pred:   paint] 
[argl:   torn] 
[arg2:   _23170]]]] 
[gap:   minus] 
> 
< 

[_23170:   []] 


(14)  shows  a  pretty-printed  version  of  the  sentence  node  that  is 
instantiated  for  the  input  string  in  question.  Notice,  however,  that  UNICORN 
does  not  return  a  complete  parse  tree.  We  made  this  move  intentionally  and  do 
not  consider  it  an  undesirable  omission.  The  point  is  that  users  can  easily  add 
such  a  facility  by  adding  a  feature  called  parse  to  the  category  on  the  left-hand 
side  of  each  rule.  This  feature  would  take  as  a  value  the  DAG  structures  for 
each  of  its  daughter  constituents.  Such  a  technique  would  be  similar  to  the  way 
parse  trees  can  be  constructed  during  execution  in  definite  clause  grammars 
(see  Pereira  and  Shieber  1987  for  more  details). 

As  an  aid  for  those  users  who  would  like  to  familiarize  themselves  with  the 
internal  processing  of  the  parser  and  to  those  users  who  want  to  debug  their 
own  input  grammars,  the  UNICORN  parser  can  be  run  in  two  modes  which  we 
call  "on"  and  "off"  mode.  In  "off"  mode  the  parser  will  simply  return  the 
instantiated  category  that  generates  the  input  string  in  question.  "On"  mode,  on 
the  other  hand,  is  an  interactive  mode  which  informs  the  user  at  each  point 
which  action  has  been  performed  by  the  parser.  Since  we  will  explain  much  of 
our  implementation  of  the  Earley/Shieber  parsing  algorithm,  we  will  only  give  a 
brief  preview  at  this  point. 

The  parser  performs  the  three  main  operations  of  Earley's  algorithm: 
prediction,  scanning,  and  completion.  In  the  "on"  mode  for  stepping  through  the 
parse  for  a  given  input  string,  the  user  is  informed  at  each  point  what  part  of  the 
input  string  is  being  processed  and  which  of  the  two  operations  is  being 
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performed.    See  section  3.3  for  more  discussion  of  our  implementation  of  the 
parser  and  on/off  mode  facility. 

Another  feature  that  the  UNICORN  parsing  package  makes  available  is  the 
option  of  modifying  the  start  symbol  that  the  parser  uses  as  its  initial  state.  As  a 
default,  the  parser  will  take  the  start  symbol  to  be  a  category  whose  only 
attribute  is  the  feature  maj  and  which  has  S  as  the  value  for  that  attribute.  Thus, 
as  a  default  the  parser  will  parse  input  strings  that  are  sentences.  However,  by 
invoking  the  predicate  change_root_category,  the  user  is  free  to  override  this 
default  and  invoke  some  other  DAG  as  the  start  symbol.  For  example,  to  parse 
noun  phrases,  users  can  invoke  'change_root_category([[[maj,np]], []])'.  To  get 
all  parses  for  a  string  of  words,  the  user  can  specify  the  start  DAG  as  the 
completely  unspecified  DAG  (i.e.  [[],[]]). 

However,  in  certain  circumstances  users  may  find  it  useful  to  specify  start 
symbols  that  are  more  specific  than  merely  S  or  NP  .  For  example,  in  a 
grammar  that  allows  for  NP  gaps,  as  the  grammar  of  Appendix  A  does,  the 
strings  in  (15a)  and  (15b)  would  both  be  parsed  as  sentence  nodes. 

(14)  a.  Tom  Sawyer  gave  every  boy  a  brush, 
b.  Tom  Sawyer  gave  every  boy. 

However,  sentence  (15b)  should  be  classified  as  a  sentence  which 
contains  on  NP  gap.  In  order  to  exclude  sentences  such  as  (15b),  whose  gap 
remains  unfilled,  from  being  parsed  successfully,  the  user  can  specify  the  start 
symbol  as  in  (15). 

(15)  change_root_category ( [ [ [maj, s] ,  [sem,  [ [index, minus] ] ] ] ,  [] ] ) 

Specifying  the  value  minus  for  the  index  attribute  that  is  embedded  under 
the  semantics  feature  is  the  way  the  grammar  of  Appendix  A  encodes  the 
absence  of  a  gap. 


3.  Implementation  Details  of  Unicorn 

The  Unicorn  system  has  three  major  parts:  A  DAG  unifier,  a  grammar 
interpreter,  and  an  Earley  style  parser.  The  DAG  unifier  plays  a  central  role 
since  it  is  used  by  both  the  grammar  reader  and  the  parser.  We  will  accordingly 
begin  our  discussion  of  implementation  details  with  the  DAG  unifier  in  section 
3.1 .  A  full  listing  of  this  DAG  unifier  is  provided  in  Appendix  B.  In  section  3.2  we 
will  discuss  the  grammar  interpreter,  including  how  the  DAG  unifier  is  used  in 
translating  path  equations  into  DAGs.  Then  in  section  3.3  we  will  discuss  how 
the  Earley  parser  is  extended  to  work  with  complex  feature  structures.  Since 
this  extension  is  already  discussed  in  Shieber  (1985),  we  will  not  say  much 
about  the  parser  itself,  rather  we  will  concentrate  on  describing  the  additional 
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Operations  on  DAGs  that  are  required  for  this  extension.  In  section  3.3.1  we 
describe  the  procedure  'path_var  which  takes  a  path  and  a  DAG  as  arguments 
and  returns  the  value  in  the  DAG  at  the  end  of  the  path.  While  this  function  is  not 
conceptually  difficult,  certain  problems  arise  with  reentrancies  that  make  it 
trickier  than  expected.  Next,  in  section  3.3.2  we  will  discuss  the  concept  of 
restriction  and  describe  our  own  procedure  which  does  a  modified  form  of 
restriction.  Then,  in  section  3.3.3  we  will  explain  our  procedure  for  checking  the 
subsumption  relation  between  DAGs.  Full  listings  for  all  of  these  procedures 
can  be  found  in  Appendix  B. 


3.1   The  DAG  Unifier 

The  DAG  unifier  (Appendix  B)  takes  two  DAG  structures  as  arguments  and 
returns  their  unification  as  output  (i.e.  d_union{DAG1,DAG2,OutputDAG).3  The 
definition  for  unification  is  as  in  Shieber  (1986).  The  reader  familiar  with  term 
unification  as  in  Prolog  and  Definite  Clause  Grammars  should  be  aware  that 
graph  unification  is  rather  different  kind  of  unification  than  term  unification. 
Term  unification  basically  involves  substitution  for  variables  in  two  terms  with 
identical  arity  (i.e.  identical  number  of  arguments).  In  graph  unification,  on  the 
other  hand,  the  idea  is  to  find  the  most  general  graph  structure  that  is  subsumed 
by  the  two  input  graphs.  The  arity  of  the  unified  graph  may  be  greater  than  the 
arity  of  either  input  graph.  For  example,  in  (16)  both  input  graphs  are  of  arity  two 
but  the  output  graph  is  of  arity  three.  A  second  difference  is  that  reentrancies 
can  have  values  in  graph  unification  so  when  two  variables  marking 
reentrancies  are  unified  the  unifier  must  find  their  values  on  the  reentrancy  list 
and  then  unify  these  values. 

(16)  d_union ([[ [maj,np] , [form, finite] ],[]] ,  %    input 

[[ [form, finite] , [subcat, end] ],[]] ,  %    input 

[[ [ma j,np] , [form, finite] , [subcat,end] ],[]]) .  %   output 

The  first  step  in  unifying  two  DAGs  is  to  append  their  reentrancy  lists 
together.  Unless  some  of  the  feature  stnjctures  on  this  combined  reentrancy  list 
are  unified  with  other  feature  structures,  this  combined  reentrancy  list  will 
become  the  reentrancy  list  of  the  output  DAG.  In  general, however,  this 
combined  reentrancy  list  will  be  successively  modified  as  the  parts  of  DAG1  are 
unified  with  the  parts  of  DAG2. 

After  appending  the  reentrancy  lists,  the  auxiliary  function,  'aux_d_union', 
is  called.  Aux_d_union  has  three  base  cases  and  two  recursive  clauses.  The 
base  cases  unify  []  with  anything  and  unify  any  atom  with  an  identical  atom. 

When  the  base  cases  fail  to  apply,  it  must  be  the  case  that  DAG1  is  a  list 
beginning  with  some  feature  pair  [f,v],  where  v  is  a  feature  structure  or  Prolog 
variable.    Two  cases  need  to  be  considered  here.  Either  some  [f,v']  does  or 
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does  not  occur  in  DAG2.  If  it  does  not,  we  simply  add  [f,v]  to  the  output  DAG  and 
recursively  unify  the  rest  of  DAG1  with  DAG2  to  get  the  rest  of  the  output  DAG. 

If  some  [f,v']  does  occur  in  DAG2  we  remove  this  pair  from  DAG2  and,  as 
above,  recursively  unify  the  rests  of  DAG1  and  DAG2.  Then  we  will  need  to  find 
the  values  of  v  and  v'  and  unify  these  values.  The  complication  of  finding 
values  occurs  because  of  the  fact  that  values  can  be  reentrant,  and  these 
reentrancies  can  themselves  have  values  as  the  example  in  (18)  illustrates. 

(17)  [[  [form,  finite]  ],  []] 

(18)  [ [ [form,X], [head, [ [form,X] ] ] ], [X, finite] ] 

In  (17)  the  feature  'form'  has  the  value  'finite'.  In  (18),  'form'  is  followed  by 
the  variable  'X',  and  the  value  'finite'  is  found  on  the  reentrancy  list.  For  ease  of 
reference,  then,  we  define  a  function  'value'  such  that  value(X)  =  finite  in  (18) 
and,  for  consistency,  value(finite)  =  finite  in  (17).  In  general,  value(x)  =  x  when  x 
is  a  feature  structure  and,  when  x  is  a  Prolog  variable,  value(x)  =  the  (unique) 
feature  structure  fs  such  that  [x.fs]  occurs  in  the  reentrancy  list. 

The  procedure  responsible  for  finding  and  then  unifying  the  values  of  v 
and  V'  is  called  'match_values'.  The  first  clause  of  match_values  catches  the 
case  where  value(v)  and  value(v')  have  already  been  unified.  If  it  is  the  case 
that  V  and  v'  are  identical  variables  (i.e.  v  ==  v')  then  it  must  be  the  case  that 
their  values  have  already  been  unified,  so  we  do  nothing  in  this  case.  The 
second  clause  handles  the  case  where  v  and  v'  are  different  variables.  In  this 
case,  we  remove  their  values  from  the  reentrancy  list  and  unify  these  values.  In 
addition  we  (destructively--see  below)  unify  the  variables  v  and  v'.  Then  [f,v]  is 
added  to  the  output  feature  structure  and  [v,value(v)  U  value(v')]  is  added  to  the 
reentrancy  list  (where  U  is  the  operator  used  to  indicate  unification). 

The  third  clause  handles  the  case  where  v  is  a  variable  and  v*  is  a  feature 
structure.  In  this  case  the  value  for  v  is  removed  from  the  reentrancy  list  and 
unified  with  v'.  Then  [v,val(v)  U  v']  is  put  back  on  the  reentrancy  list  and  [f,v]  is 
added  to  the  output  feature  structure.  The  fourth  clause  is  the  mirror  image  of 
the  third  clause.  In  this  case  v  is  a  feature  structure  and  v"  is  a  variable.  The 
fifth,  and  final,  clause  of  match  values  handles  the  case  where  both  v  and  v'  are 
feature  structures.  In  this  case,  v  and  v'  are  unified  and  then  [f,v  U  v']  is  added 
to  the  output  feature  structure. 

Like  other  graph  unification  algorithms  mentioned  in  the  literature,  this 
algorithm  has  the  property  that  the  input  DAGs  can  be  destructively  modified  in 
the  process  of  unifying  them.  Specifically,  clause  two  of  'match_values' 
destructively  unifies  variables  marking  reentrancies.  Consequently,  the  use  of 
'd_union'  requires  a  certain  amount  of  caution.  Previous  discussions  of  this 
problem  have  focused  on  efficient  copying  procedures  and  methods  of  structure 
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sharing  (see  Shieber,  Pereira,  Karttunen,  and  Kay  1986).  While  we  have 
nothing  to  contribute  to  this  rather  technical  literature,  we  have  found  some 
practical  methods  of  efficiently  copying  in  Prolog.  The  most  straight-forward 
way  is  to  use  a  procedure  such  as  that  in  (19)  to  explicitly  copy  DAGs  before 
using  them  as  input  in  'd_union'.  Somewhat  more  efficiently,  one  can  embed 
calls  of  'd_union'  within  'findall',  which  has  the  property  of  suppressing 
destructive  changes."^  For  example  one  could  use  a  procedure  like  the 
simplified  procedure  in  (20)  to  do  prediction  in  the  Barley  parser.  One  further 
way  of  using  these  copying  procedures  would  be  to  embed  the  copying  into  the 
'd_union'  procedure  itself.  The  easiest  way  to  do  this  would  be  to  delay  the 
destructive  variable  binding  in  clause  two  of  match_values  by  keeping  a 
binding  list.  Then  one  could  get  as  output  a  DAG  and  a  binding  list  which  could 
be  copied  at  the  last  minute  just  before  the  variables  in  the  binding  list  are 
destructively  unified.  Then  in  the  case  of  unification  failure,  no  copying  would 
be  done  at  all. 

(19)  copy (X, Y) :-  f indall(Z, Z=X, [Y] ) . 

(20)  findall (PredictedDag, 

(right_of_dot (DAGl)  , 

root_dag_from_rule_in_database (DAG2) , 
d_union (DAGl, DAG2, PredictedDag) ) , 

Predictions) . 

One  more  subtle  method  of  copying  is  to  use  the  built-in  copying  implicit  in 
Prolog.  It  is  an  important  design  feature  of  Prolog  that  clauses  in  the  database 
cannot  be  modified  except  by  explicit  uses  of  'assert'  or  'retract'.  In  particular 
one  cannot  query  the  database  for  some  lexical  entry  and  then  destructively 
modify  the  database  by  unifying  that  lexical  entry  with  some  other  DAG.  In 
effect,  every  time  the  database  is  queried  a  fresh  copy  is  produced,  leaving  the 
original  clauses  for  lexical  entries  and  phrasal  rules  unaffected. 


3.2  The  Grammar  Interpreter 

The  grammar  interpreter  reads  rules  in  the  form  shown  in  (21)  and 
translates  them  into  the  form  shown  in  (22).  The  form  in  (22)  is  a  two  member 
list.  The  first  member  is  [x1  ,x2]  which  represents  the  right-hand  side  of  the  rule 
and  the  second  member  is  a  DAG  in  list  representation  (using  Prolog  internal 
variables). 

(21)  vp  ->  vp  x2 :  %%  phrase  structure  rule 

<xl  subcat  first>  =  <x2>     %%  path  equations 
<xO  head>  =  <xl  head> 
<xl  subcat  rest>  =  <xO  subcat>. 
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(22)  [[xl,x2],  %%  right-hand  side  of  PS  rule 

[  [ [xO,  [ [maj,vp] ,  %%  DAG  for  PS  rule 

[head,_105], 
[subcat,_lll] ] ] , 
[xl,  [  [ma  j,  vp]  , 

[subcat, [ [first, _135] ,  [rest,_lll] ] ] , 
[head,_105] ] ], 
[x2,_135] ], 
[[_111,  []],  [_105,  []],  [_135,  []]]]] 

The  grammar  interpreter  consists  of  three  main  parts.  The  first  part 
processes  the  body  of  the  rule  (i.e.  vp  ->  vp  x2),  the  second  part  processes  the 
path  equations  and  the  third  part  creates  the  DAG  which  is  asserted  into  the 
database.  We  will  discuss  these  three  parts  in  order. 

The  first  part  (which  we  call  •process_body')  does  two  things.  First  it 
changes  the  form  of  the  right-hand  side  of  the  rule.  In  (21)  it  would  change  '  vp 
x2'  to  [x1,x2].  The  left-hand  side  is  thrown  away  since  it  is  always  known  to  be 
'xO'.  Then  •process_body'  creates  a  DAG  for  the  known  values  for  maj,  in  this 
case  [[xO,[[maj,vp]]],  [x1,[[maj,vp]]]].  The  maj  value  for  •x2'  is  unknown  but  its 
value  is  constrained  by  the  first  path  equation  '<x1  subcat  first>  =  <x2>'.  The 
DAG  obtained  from  the  rule  body  is  saved  in  order  to  be  unified  later  with  the 
DAGs  obtained  from  the  path  equations. 

The  second  part  (which  we  call  'process_constraints')  creates  a  list  of 
feature  structures  (one  for  each  path  equation)  and  it  creates  a  single 
reentrancy  list  for  all  of  the  variables  used  in  these  feature  structures.  There  are 
two  separate  kinds  of  path  equations  that  need  to  be  handled:  those  like  (23) 
with  a  path  on  either  side  of  the  equal  sign,  and  those  like  (24)  with  a  path  on 
the  left  side  and  a  value  on  the  right  side.^ 

(23)  <xO  head>  =  <x2  head> 

(24)  <xO  head  form>  =  finite 

In  processing  the  left  side  of  the  equation  in  (23),  'process_constraints'  will 
create  a  feature  structure  of  the  form  [[xO,[[head,X]]]].  The  variable  X  will  be 
recorded  and  used  again  in  the  feature  structure  structure  for  the  right  side  of 
the  equation.  Then  the  reentrancy  [X,[]]  will  be  added  to  the  reentrancy  list.  At 
this  point  none  of  the  reentrancies  will  have  values  other  than  []  since  it  is 
impossible  to  represent  any  more  specific  reentrant  value  in  a  single  path 
equation. 

In  processing  the  equation  in  (24)  the  left  side  is  processed  as  in  (23)  (i.e. 
forming  the  feature  structure  [[xO,[[head,[[form,Y]]]]]]).  Then  on  encountering  the 
atom  'finite',  the  variable  'Y'  will  be  unified  with  this  atom.  Nothing  will  be 
added  to  the  reentrancy  list  in  this  case. 
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The  third  part  of  'process_grammar'  (shown  in  (25))  takes  all  of  the  feature 
structures  obtained  from  •process_body'  and  'process_constraints'  along  with 
the  reentrancy  list  from  'process_constraints'  and  unifies  them  together  to  form 
one  DAG.  Since  'd_union'  will  only  unify  two  DAGs  at  a  time,  this  procedure 
recursively  goes  through  the  list  of  feature  structures  and  unifies  in  just  one  at  a 
time. 

(25)  optimize_dag(  []  ,R_List,  [],R_List)  . 

optimize_dag ( [FirstFS IRestFSs] , R_List, OptFS, OptR_List)  :- 
optimize_dag (RestFSs, R_List , OptRestFSs, OptR_Listl) , 
d_union ( [FirstFS, [ ] ] , 

[OptRestFSs, OptR_Listl] , 

[OptFS, OptR_List ] ) . 

While  this  procedure  looks  straightforward,  it  actually  uses  'd_union'  in  a 
somewhat  unusual  manner.  Normally  the  syntax  of  'd_union'  is 
d_union(lnputDAGl,lnputDAG2,  OutputDAG),  but  in  this  case  the  first  and 
second  arguments  are  not  proper  DAGs,  since  all  of  the  reentrancies  are 
recorded  in  the  second  DAG.  Thus  even  though  "FirstFS"  is  a  feature  structure 
that  may  contain  variables  which  should  indicate  reentrancies,  it  is  used  here 
with  a  null  reentrancy  list.  This  trick  works  anyway  since  the  first  step  in 
'd_union'  is  to  append  together  the  reentrancy  lists  of  the  two  input  DAGs. 

The  processing  of  lexical  entries  is  quite  similar.  The  lexical  entry  in  (26)  is 
processed  into  the  form  in  (27)  and  asserted  into  the  database  as  a  value  of 
•|ex_rule(X)'. 

(26)  Word   torn   sawyer: 

<cat>  =  np 

<head  agreement  gender>  =  masculine 
<head  agreement  person>  =  third 
<head  agreement  number>  =  singular. 

(27)  [2, 

[torn, sawyer |_92 ] , 
[[[cat,np], 

[head, [ [agreement, [ [gender, masculine] , 
[person, third] , 
[number, singular] ]]]]], 
[  ]  ]  ]  ; 

The  main  difference  is  that  'process_body'  creates  the  incomplete  list 
[tom,sawyer|_92]  and  it  also  returns  a  number  (in  this  case,  1)  which  indicates 
the  number  of  words  in  the  list.  In  this  way  one  can  have  lexical  items  of 
variable    length.        A    simple    common    noun    like     fence    would    be 
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[1  ,[fence|_X],DAG_for_fence]  and  a  trace  would  be  [O.X.DAGJorJrace].    The 
reason  for  creating  lexical  items  in  this  form  is  explained  in  the  next  section. 


3.3  The  Earley  Parser 

The  parser  is  pretty  much  as  described  in  Earley  (1970),  with  the 
modifications  indicated  in  Shieber  (1985).  Therefore,  in  this  paper  we  will 
describe  just  one  innovation  we  have  made  and  then  in  the  following  sections 
we  will  discuss  the  operations  on  DAGs  that  are  needed  to  make  Shieber's 
extension  of  Barley's  algorithm  work.  For  those  users  who  are  interested  on 
following  a  trace  of  the  parser's  operation  we  have  provided  a  trace  mode 
which  we  will  also  describe  below. 

The  main  innovation  we  have  made  is  in  the  handling  of  lexical  entries.  As 
described  in  Earley  (1970),  there  are  three  steps  in  processing  a  lexical  item. 
The  following  example  illustrates  these  three  steps.  Imagine  that  the  parser  has 
predicted  the  structure  shown  in  (28).  Furthermore  assume  that  the  rule  'A  ->  a' 
exists  in  the  grammar  and  that  the  next  item  in  the  input  is  'a'.  First,  the  predictor 
makes  the  prediction  in  (29).  Second,  the  scanner  notices  that  the  prediction 
matches  the  input  and  scans  the  'a'.  Finally  the  completer  updates  the 
prediction  as  shown  in  (30).6 


(28)  S  (29)         S  (30) 


While  this  method  works  fine  with  the  types  of  grammars  Earley  discusses, 
it  would  fail  miserably  with  a  natural  language  grammar.  To  see  this,  imagine 
that  the  preterminal  'A'  is  replaced  by  'N'  for  noun.  In  this  case  the  predictor 
would  have  to  predict  every  noun  in  the  lexicon  and  then  have  the  scanner  try  to 
scan  that  noun.  Obviously  this  would  take  too  much  time.  The  solution  to  this 
problem  is  apparent.  The  predictor  should  look  at  the  input  first  and  only 
consider  predicting  lexical  entries  that  match  the  input.  If  the  prediction  matches 
the  input  then  we  know  immediately  that  it  will  be  scannable  and  completable, 
so  all  three  steps  can  be  combined  into  one. 

The  question  that  remains  is  what  do  we  mean  by  'match  the  input.'  In 
natural  language,  terminals  can  be  of  variable  length.  Compounds  are  of 
length  greater  than  one  and  traces  are  of  length  zero.  It  is  clear  that  in  all  these 
cases  we  want  to  predict  a  lexical  item  when  that  lexical  item  is  a  prefix  of  the 
input.  Rather  than  define  a  Prolog  predicate  for  prefix,  however,  we  use  the 
built  in  pattern  matching  of  Prolog.    This  explains  why  in  the  preceding  section 
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we  created  a  lexical  entry  with  [tom,sawyer|_X]  rather  than  [torn, sawyer].  Thus 
rather  than  making  a  check  like:  prefix([tom, sawyer], Input),  we  can  directly  find 
the  lexical  entries  that  match  the  input  with  the  query:  lex_rule(Num, Input, DAG). 
In  order  to  make  this  system  work,  we  need  the  first  argument  of  lex_rule/3  to  tell 
us  the  length  of  the  lexical  item.  In  Barley's  presentation  of  the  algorithm,  the 
result  of  a  scan  is  always  put  into  the  next  state  set.  In  this  case,  however,  we 
need  the  variable  'Num'  to  tell  us  in  which  state  set  to  put  the  result  of  the  scan. 
If  'Num'  is  1 ,  we  put  the  result  in  the  next  state  set.  In  the  case  of  'tom  sawyer' 
'Num'  will  be  2  so  the  result  will  be  put  into  the  next  but  one  state  set.  If  'Num'  is 
0  (i.e.  for  a  trace)  then  the  result  is  put  into  the  current  state  set.  Thus  the  parser 
handles  traces  in  a  completely  general  manner.  Still,  of  course,  there  is  some 
cost  involved.  The  parser  will  attempt  to  predict  traces  everywhere  (presuming 
there  are  traces  in  the  grammar).  Furthermore,  the  parser  must  continue  to 
make  predictions  even  when  the  input  is  exhausted.  For  this  reason  it  may  be 
preferable  to  develop  attribute-value  grammars  that  don't  use  traces,  such  as 
the  version  of  HPSG  described  in  Pollard  (1988)  or  Cooper's  "Wholewheat 
Syntax"   1982.7 

Given  an  understanding  of  Barley's  algorithm  and  an  understanding  of  our 
modification  for  lexical  entries,  the  user  should  be  able  to  follow  the  trace  mode 
we  have  provided.  To  activate  the  trace  mode,  the  user  types  "on"  before 
parsing  a  sentence.  The  parser  then  prints  out  a  record  of  each  prediction, 
scan,  or  completion  that  is  made.  As  an  example,  suppose  we're  parsing  the 
sentence  '  Tom  Sawyer  smiles'  with  the  grammar  in  (31). 

(31)  s  ->  np  vp: 

<xO  head>  =  <x2  head> 

<x2  head  subject>  =  <xl  head>. 

vp  ->  v: 

<xO  head>  =  <xl  head>. 

Word  tom  sawyer: 

<maj>  =np 

<head  agreement  gender>  =  masculine 

<head  agreement  person>  =  third 

<head  agreement  number>  =  singular. 

Word  smiles : 

<maj>  =  V 

<head  form>  =  finite 

<head  subject  agreement  person>  =  third 

<head  subject  agreement  number>  =  singular. 

Starting  form  the  's  ->  np  vp'  prediction,  we  get  the  trace  shown  below  in 
(32).    As  shown,  the  tracer  first  writes  out  the  state  currently  being  processed 
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along  with  the  current  input.  The  state  contains  four  items  of  information.  First 
comes  the  'state  type'.  This  indicates  whether  the  state  is  the  result  of  prediction 
scan  or  completion.  It  is  important  to  have  this  piece  of  information  since  when 
new  states  are  added  to  the  queue,  new  predictions  (scans, completions)  must 
not  be  subsumed  by  previous  predictions  (scans,completions).  The  second 
piece  of  information  is  the  rule  which  licensed  the  prediction  being  worked  on. 
The  third  item  indicates  in  which  state  set  the  prediction  being  worked  on  was 
made.  And  the  fourth  item  shows  the  DAG  for  the  rule. 

After  showing  the  state  being  processed,  the  tracer  writes  out  what  scans, 
predictions  or  completions  are  made  from  that  state.  In  this  case,  a  scan  is 
made,  since  the  lexical  entry  for  Tom  Sawyer  matched  the  sub-DAG  for  'x1',  the 
item  "to  the  right  of  the  dot"  in  Barley's  sense.  Following  the  scan  state,  the 
tracer  writes  out  "enqueue"  to  indicate  that  the  new  state  was  put  onto  the 
queue.  Since  this  was  a  scan,  of  course,  this  means  that  it  was  enqueued  into 
the  next  state  set.  A  state  will  fail  to  be  enqueued  if  it  is  subsumed  by  a  state 
already  on  the  queue.  In  this  case,  nothing  had  already  been  put  onto  the  next 
state  set,  so  the  scan  was  necessarily  enqueued.  Next  the  tracer  writes  out 
"predict  none,"  which  indicates  that  no  predictions  were  made.  No  predictions 
could  be  made  with  the  grammar  in  (31)  because  there  are  no  phrasal 
expansions  for  NP  in  this  small  grammar.  Since  no  predictions  were  made,  the 
parser  is  now  done  with  the  first  state  set.  The  tracer,  therefore,  writes  out  "go  to 
next  state  set"  and  then  writes  the  new  state  to  be  processed,  which  completes 
the  cycle. 

(32)  looking  at : 

state  type:  prediction 
rule  =  [xO, [xl,x2]] 
predicted  in  state  set  #0 
< 

[xO:   [maj:   s] 

[head:   _394]] 
[xl :   [maj :   np] 

[head:   _412]] 
[x2  :   [ma  j  :   vp] 

[head:    394]] 


[_394:   [subject:   _412]] 

[_412:   []] 
> 

input=  [torn, sawyer, smiles] 
I  : 

scan 

state  type  scan 
rule  =  [xO,  [x2]  ] 


63 


predicted  in  state  set  #0 
< 

[xl :   [ma j :   np] 

[head:   _2709]] 
[xO  :   [ma  j  :   s] 

[head:   _2727]] 
[x2  :   [ma  j  :   vp] 

[head:   _2727]] 
> 
< 

[_2709:   [agreement:   [gender:   masculine] 
[person:   third] 
[number:   singular]]] 
[_2727:   [subject:   _2709] ] 
> 
enqueue 

predict   none 

go  to  next  state  set 

looking  at : 
state  type  scan 
rule  =  [xO,  [x2]] 
predicted  in  state  set  #0 
< 

[xl :   [ma j :   np] 

[head:   _2709]] 
[xO :   [ma j :   s] 

[head:   _2727]] 
[x2  :   [ma  j  :   vp] 

[head:   _2727]] 
> 
< 

[_2709:   [agreement:   [gender:   masculine] 
[person:   third] 
[number:   singular]]] 
[_2727:   [subject:   _2709] ] 
> 
input=  [smiles] 


3.4.1   Values  at  Ends  Of  Paths 

In  this  and  the  next  two  sections  we  discuss  some  operations  on  DAGs  that 
are  necessary  for  the  operation  of  the  parser.    We  include  these  procedures 


64 

here  primarily  because  they  illustrate  how  to  handle  some  of  the  intricacies  of 
the  DAG  representation  in  UNICORN.  The  first  procedure  which  we  call 
'path_var  finds  the  value  at  the  end  of  a  path.  This  procedure  is  used  in  both  the 
prediction  and  completion  steps  in  the  parser.  For  example,  if  the  parser  is 
looking  at  the  following  first  state  in  the  parse  trace  given  above  and  repeated 
here  as  (33),  the  parser  must  look  at  the  sub-DAG  that  occurs  at  the  end  of  the 
path  <x1>  in  order  to  make  a  prediction.  Here  the  path  is  only  of  length  one,  but 
it  will  be  useful  to  have  a  more  general  procedure  for  path_value,  primarily  for 
the  purpose  of  displaying  subparts  of  complex  DAGs. 

(33)  state  type:  prediction 
rule  =  [xO, [xl,x2] ] 
predicted  in  state  set  #0 
< 

[xO:   [maj:   s] 

[head:   _394]] 
[xl :   [ma  j  :   np] 

[head:   _412]] 
[x2 :   [maj  :   vp] 

[head:   _394]] 
> 
< 

[_394:   [subject:   _412]] 
[_412:   []] 
> 
input=  [tom, sawyer, smiles] 

We  start  off  by  describing  a  procedure  called  'naive jDath_var  that  simply 
follows  a  path  and  returns  the  feature  structure  found  at  the  end  of  the  path.  For 
example,  'naive_path_var  gives  the  result  found  in  (34). 

(34)  naive_path_val ( [head, subject] ,  %  Path 

[  [  [agr,X] ,  %  Input  DAG 

[form, Y] , 

[head, [ [subject, [ [agr,X] ] ] , 
[form,Y]]]], 
[ [X,third_sg] , [Y, finite] ] ] , 
[ [agr,X]  ] )  .  %  Output  FS 

This  much  will  be  easy  to  define.  However,  the  output  here  is  not  quite 
what  we  want.  Instead  of  the  output  feature  structure  we  would  like  an  output 
DAG,  which  means  that  we  need  to  find  a  reentrancy  list  to  go  with  this  feature 
structure.  If  we  use  the  reentrancy  list  from  the  input  DAG,  however,  we  get  the 
non-well  formed  DAG:  [[[agr,X]],[[X,third_sg],[Y,finite]]].  There  are  two  obvious 
problems  with  this  "DAG".  First  the  variable  'X'  marks  a  "reentrancy"  that  only 
occurs  once  in  the  feature  structure  and,  worse,  the  variable  'Y'  marks  a 
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"reentrancy"  that  doesn't  even  occur  at  all  in  the  feature  structure.  In  order  to 
define  the  fully  correct  'path_val',  it  will  be  necessary  to  filter  out  these  false 
reentrancies. 

We  begin,  now,  with  the  simple  'naive_path_var.  This  can  be  defined  as  a 
simple  recursive  procedure.  The  base  case  will  take  care  of  paths  of  length  one 
and  the  recursive  part  will  give  the  transitive  closure  of  this  base  case.  The 
base  case  is  given  in  (35).  For  a  path  of  length  one  what  we  want  is  a 
procedure  that  gives  the  value  of  any  given  feature.  The  procedure  'value' 
looks  for  the  value  either  in  the  feature  structure  or  in  the  reentrancy  list  (see 
Appendix  B  for  'feat_val_member'  which  is  used  for  finding  values  on 
reentrancy  lists).  If  no  value  for  the  feature  is  found  then  []  is  returned. 8 

(35)  value (Feat, [FS, R_List ] , Val) :- 

member ( [Feat, VI]  ,FS)  ,  !, 
if  (nonvar(Vl) ,Val  = 
VI, f eat_val_member ( [VI, Val] ,R_List) ) . 
value (_F,_Dag, []) . 

Once  this  is  defined,  then  'naive_path_val'  (in  (36))  is  simply  its  transitive 
closure. 

(36)  naive_path_val ( [Feature] , DAG, OutputFS) :-    %  base  case 

value (Feature, [FS, R_List ], OutputFS) . 
naive_path_val ( [Feature  I RestPath] ,  [FS, R_List, OutputFS)  :- 
value (Feature, DAG, Val) , 
naive_path_val (RestPath, [Val, R_List] , OutputFS) . 

Now  to  extend  this  to  the  fully  correct  'path_val'  requires  some 
complication.  The  basic  idea  is  to  search  through  the  output  feature  structure  to 
find  which  variables  occur  at  least  twice  (either  in  the  feature  structure  itself  or  in 
its  reentrancies).  These  variables  will  be  the  only  ones  marking  true 
reentrancies  so  the  output  reentrancy  list  must  contain  just  those  reentrancies 
on  the  input  reentrancy  list  which  are  marked  by  these  variables.  For  the  case 
of  reentrancy  variables  which  don't  occur  at  all  in  the  feature  structure  (such  as 
'Y'  in  [[[agr,X]],[[X,third_sg],[Y,finite]]]),  these  can  simply  be  thrown  away  (giving 
[[[agr,X]],[[X,third_sg]]]).  Then  for  the  case  of  variables  that  occur  exactly  once  in 
the  feature  structure  (such  as  'X'  in  ([agr,X]]),  these  reentrancies  need  somehow 
to  be  undone.  Undoing  reentrancies  is  not  hard:  if  we  have  a  list  of  variables 
occurring  exactly  once  in  the  feature  structure,  we  can  first  look  through  the 
input  reentrancy  list  to  find  just  those  reentrancies  marked  by  these  variables, 
(for  example,  in  [[[agr,X]],[[X,third_sg]]]  we  find  the  reentrancy  [X,third_sg]),  then 
we  simply  unify  the  reentrancy  variable  with  the  reentrant  feature  structure  (i.e. 
in  the  above  example,  we  use  'X  =  third_sg',  then  both  instances  of  'X'  will  be 
bound  to  'third_sg',  which  has  the  effect  of  putting  'third_sg'  in  as  the  value  of 
'agr'  --  see  the  third  clause  of  'undo_reentrancies  in  Appendix  B.   Unfortunately, 
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this  way  of  undoing  reentrancies  is  destructive  so  one  must  be  a  little  more 
careful.  Since  it  is  necessary  to  search)  all  the  way  through  the  output  feature 
structure  anyway  to  find  the  false  reentrancies,  it  takes  little  more  work  to 
replace  the  variables  in  the  process.  Then  the  destructive  binding  will  have  no 
bad  consequences.  The  complete  code  incorporating  this  solution  is  listed  in 
Appendix  B.  The  good  news  is  that  the  fully  correct  'path_value'  is  not  always 
needed.  Often  the  simpler  'naive_path_var  is  good  enough,  as  we  will  see  in 
the  next  section. 


3.4.2  Restriction 

Restriction  is  the  major  innovation  of  Shieber  (1985)  which  makes  it 
possible  to  extend  Earley  parsing  to  complex  feature  structures.  Unfortunately, 
Shieber  illustrates  this  concept  with  his  so-called  "counting  grammar"  which 
appears  to  have  dubious  relevance  to  natural  language  parsing.  The  same 
point  can  be  illustrated,  however,  with  list  valued  subcat  features  as  used  in 
HPSG.  To  show  this  we  first  define  the  partial  grammar  in  (37).  The  grammar 
would  need  rules  for  NPs  and  Vs  to  actually  yield  any  strings,  but  this  is  enough 
of  the  grammar  to  illustrate  the  point. 


(37) 


.  xO    — >   xl      x2 
xO  :  I    ma  j  :     si 

xl  :     [1]   I    ma  j  :    npj 

ma  j :    vp 

subcat 


b.  xO 


■>   xl      x2 


Ffirst:    [ifl 
rest :     end 


^^     fmaj:  vp  "j 

I    subcat :    [2M 

Ffirst:    [afl 
|_rest:      [2]J 


ma] :    vp 
subcat : 


[3] 


Assume  now  that  we  start  with  the  first  rule  and  parse  a  generic  label  NP 
(i.e.  an  NP  with  the  DAG  [[[maj,np]],[]]  and  no  other  more  specific  syntactic  or 
semantic  features).  We  then  have  the  rule  xO  ->  x2  with  the  same  DAG  as  in 
rule  1 ,  so  we  are  now  ready  to  parse  a  VP.  In  a  standard  Earley  parser  we 
would  search  through  the  grammar  for  any  rules  that  have  VP  on  the  left-hand 
side.  In  attribute  value  grammars,  on  the  other  hand,  it  is  not  always  possible  to 
do  this  since  'maj'  has  no  special  status  as  a  feature.  In  some  cases  it  may 
happen  that  left-hand  sides  are  unmarked  for  this  feature.  Shieber's  "naive 
solution"  (which  he  considers  and  correctly  dismisses)  would  be  to  take  the 
entire  sub-DAG  which  is  the  value  of  the  path  [x2]  and  unify  this  with  the  left- 
hand  side  of  some  rule  (i.e.  rule  2).  If  we  do  this  we  get  the  prediction  in  (38), 
and  if  we  do  it  again  we  get  the  prediction  in  (39).    Clearly,  we  could  go  on  ad 
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infinitum  making  new  predictions  which  crucially  are  not  subsumed  by  old 
predictions. 


(38)     xO   — >  xl      x2 

maj :     vp 

xC: 

subcat:    [2] 


ma] :    vp 


rfirst:[]maj:    np^l 
I    rest :     end 


(39) 


subcat:  [2] 
_x2:  [3] 

xO  — >  xl   x2 

ma j :  vp 

subcat:   [2 

ma  j  :  vp 
subcat:  [2] 
x2:  [3] 


Ffirst:       [3 
[_rest:         [2 


0, 


xl: 


,   Test  :P""^[I"""^  "311 

I    rest :     end 

Ffirst:    [3]"! 
|_rest:      [2]J_ 


Shieber's  solution  is  to  limit  the  amount  of  structure  the  parser  can 
consider  in  making  predictions.  The  structure  can  be  limited  by  considering 
only  a  designated  set  of  paths  which  Shieber  calls  a  "restrictor".  For  each  path, 
if  the  path  has  an  atomic  value  in  the  DAG  to  be  restricted,  that  path  will  have 
the  same  atomic  value  in  the  in  the  restricted  DAG.  If  the  path  has  a  complex 
value  or  no  value  in  the  DAG  to  be  restricted,  it  will  have  no  value  in  the 
restricted  DAG.  For  example,  if  we  apply  the  restrictor  [[maj],[head,form],[agr]]  to 
the  DAG 

[[[maj, np],[head. [[form, normal]]],[agr,[[person,third],[num,sing]]]],Q], 

we  get  the  DAG  [[[maj.np],  [head, [[form, normal]]]], []].  Shieber  also  stipulates  that 
if  any  of  the  paths  in  the  restrictor  have  reentrant  values,  these  values  will  also 
be  reentrant  in  the  restricted  DAG.  We  have  found  it  convenient,  however,  not 
to  follow  him  on  this  point.  It  appears  that  when  reentrancies  are  thrown  away, 
only  very  few  spurious  predictions  are  made,  but  the  restriction  algorithm  is 
greatly  simplified.  Of  course  the  parser  is  still  correct  when  spurious  predictions 
are  made,  since  these  predictions  will  not  be  completed.   In  order  to  explain  our 
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restriction  algorithm,  we  start  with  a  hard  coded  restrictor  (in  40)  just  for  the 
restrictor  [[maj],[head,form]]. 

(40)  restrict (X_j, Dag, RestrictedDag) :- 

naive_path_val ( [X_j,ma j] , Dag,Ma jVall) , 
naive_path_val ( [X_j,head, form] ,Dag,HFVall)  , 
if (list (MajVall) ,MajVal  =  [],  MajVal  =  MajVall),  ^ 
if (list (HFVall) ,HFVal  =  [],  HFVal  =  HFVall) , 
inake_restrictor  (  [MajVal,  HFVal]  ,  RestrictedDag)  . 

make_restrictor ([[],[]],[[],[]]). 
make_restrictor ( [ [] , HFVal] , 

[ [ [xO, [ [head, [ [form, HFVal] ]]]]],[]]). 
make_restrictor ( [MajVal, [] ] , [ [ [xO, [ [maj, MajVal] ] ] ] , [] ] ) . 
make_restrictor ( [Ma jVal, HFVal]  , 

[ [ [xO, [ [maj, MajVal] , 

[head, [ [form, HFVal] ]]]]], 
[]])  . 

The  general  version  of  restriction  (in  Appendix  B)  is  just  an  extension  of 
this  method.  The  procedure  'change_restrictor'  first  retracts  the  clauses  in  (40) 
and  then  asserts  a  new  clause  for  'restrict'  and  a  set  of  clauses  for 
'make_restrictor'.  The  new  'restrict'  clause  uses  a  generalized  version  of 
'naive_path_var  called  'get_vals'  which  finds  the  values  for  a  set  of  paths.  The 
procedure  for  asserting  the  'make_restrictor'  clauses  is  somewhat  more 
complicated.  Since  each  path  either  may  or  may  not  end  in  some  atomic  value, 
we  need  to  construct  a  template  restricted  DAG  for  each  possible  combination. 
It  would  appear,  then,  that  the  number  of  'make_restrictor'  clauses  would  be  the 
square  of  the  number  of  paths  in  the  restrictor.  In  fact,  this  is  not  the  case  since 
there  are  some  pairs  of  paths  that  cannot  both  have  values.  For  example  if 
[head.trans]  has  an  atomic  value  then  [head,trans,arg1]  could  not  possibly  also 
have  a  value,  and  if  [head,trans,arg1]  has  any  value  then  [head,trans]  would  not 
be  able  to  have  an  atomic  value.  The  way  we  have  worked  this  out  is  as 
follows:  First  non-deterministically  pick  a  subset  of  paths  that  might  have  atomic 
values  ('make_selection').  Then  try  to  combine  this  set  of  paths  into  a  coherent 
DAG  template.  If  successful,  assert  a  clause  of  'make_restrictor'  with  the  set  of 
paths  as  the  first  argument  and  the  DAG  template  as  the  second  argument  and 
then  backtrack  to  make  a  new  selection  of  paths  and  continue  the  cycle.  If  a  set 
of  paths  cannot  be  combined  into  a  coherent  DAG  template,  then  we  throw 
away  that  set  and  backtrack  to  'make_selection'  as  above. 


3.4.3  Subsumption 

Subsumption  is  a  very  important  test  for  the  parser.    Whenever  a  new 
prediction  (scan/completion)  is  made  the  parser  must  check  that  the  new  DAG  is 
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not  subsumed  by  the  DAG  of  previous  predictions  (scans/completions)  in  the 
queue;  otherwise  the  parser  can  fall  into  the  left  recursion  trap.  For  example,  a 
rule  like  'vp  ->  vp  x2'  in  Appendix  A  could  cause  the  parser  to  keep  predicting 
VPs  forever  unless  there  were  a  check  to  see  that  the  new  predictions  were 
subsumed  by  previous  predictions.  In  this  case,  of  course,  the  new  predictions 
would  not  only  be  subsumed  by  the  old  predictions,  but  they  would  actually  be 
identical  to  the  old  predictions.  Nevertheless,  we  don't  want  to  check  identity 
simply  because  identity  is  a  more  difficult  test:  DAG1  and  DAG2  are  identical  if 
DAG1  subsumes  DAG2  and  DAG2  subsumes  DAG1.  Subsumption  will  always 
be  a  sufficient  test  since  if  we  keep  only  the  most  general  predictions,  these 
predictions  will  allow  any  completions  that  a  more  specific  prediction  would 
allow. 


Informally  DAGI  subsumes  DAG2  iff  DAG2  contains  all  of  the  information 
in  DAG1  and  possibly  more.  If  DAG2  does  contain  more  information  than  DAGI 
then  we  say  that  DAG2  extends  DAG1 .  There  are  two  ways  in  which  DAG2  can 
extend  DAGI .  First  DAG2  may  have  a  value  for  some  path  which  does  not  exist 
in  DAGI .  The  idea  here  is  that  DAGs  contain  partial  information  so  even  though 
DAGI  does  not  have  a  value  for  this  path,  it  could  be  extended  to  have  a  value. 
The  other  way  DAG2  can  extend  DAGI  is  to  have  reentrant  values  (token 
identity)  for  two  paths  where  in  DAGI  the  same  two  paths  have  values  that  just 
happen  to  be  identical  (type  identity).  Thus  the  DAG  in  (41)  subsumes  the  DAG 
in  (42). 


(41) 


subj:l  agr:  I  num:third 
pred:l  agr:  I  num:thirdn J 


(42) 


subj:    I    agr:[l]   I    num:third~] J 
_pred:     [^agr:[ir] 


The  subsumption  procedure  (Appendix  B)  has  four  clauses.  The  first  two 
clauses  are  the  base  cases.  The  first  says  that  the  null  DAG  ([[],[]])  subsumes 
anything,  and  the  second  says  that  atoms  subsume  identical  atoms.  The  other 
two  clauses  are  recursive  conditions.  The  first  of  these  handles  the  case  where 
a  feature  in  DAGI  ends  in  a  non-reentrant  value.  In  this  case  this  value  must 
subsume  the  value  of  the  same  feature  in  DAG2.  It  doesn't  matter  whether  or 
not  that  value  is  reentrant  in  DAG2.  In  the  fourth  case  a  feature  in  DAGI  ends  in 
a  reentrant  value.  In  this  case  again  this  value  must  subsume  the  value  of  the 
same  feature  in  DAG2.  Moreover,  in  this  case  the  value  in  DAG2  must  be 
reentrant  since,  otherwise,  a  DAG  like  (42)  above,  would  subsume  (41). 
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Given  this  definition  for  subsumption  between  DAGs,  a  slight 
generalization  gives  a  definition  for  subsumption  between  states  as  described 
in  3.4.0  (in  the  discussion  of  the  "trace"  mode.  State  1  will  subsume  state  2  iff 
the  first  three  members  of  state  1  are  identical  to  the  first  three  members  of  state 
2  and  the  fourth  member  of  state  1  (i.e.  the  DAG)  subsumes  the  fourth  member 
of  state  2. 


4.   Conclusion 

We  would  like  to  conclude  by  briefly  discussing  some  future  directions  we 
plan  to  pursue  in  our  research  on  UNICORN.  One  such  direction  involves  the 
use  of  attribute  value  grammars  for  the  purposes  of  semantic  interpretation. 
Even  though  such  work  is  already  under  way  in  various  instantiations  of 
attribute  value  grammars  (cf.  Halvorsen  1983  for  LFG,  Pollard  and  Sag  1988 
for  HPSG,  and  Shieber  1986  and  the  present  paper  for  PATR-grammars),  some 
of  the  more  intricate  aspects  of  semantic  interpretation  remain  to  yet  be 
explored  in  this  formalism.  In  Gerdemann  and  Hinrichs  (ms)  we  discuss  how 
one  particularly  interesting  area  of  current  research  in  formal  semantics,  the 
semantics  of  generalized  quantifiers,  can  be  treated  in  attribute-value  grammars 
accepted  by  UNICORN,  including  the  non-deterministic  scoping  of  quantifiers. 

A  second  line  of  research  that  we  would  like  to  pursue  is  the  question  of 
how  to  use  UNICORN  for  the  purposes  of  natural  language  generation.  We 
plan  to  study  extensively  how  to  run  UNICORN  in  reverse  so  that,  given  a 
logical  form,  it  will  produce  a  syntactic  string  as  output.  It  will  be  interesting  to 
see  what  modifications,  if  any,  are  needed  in  order  to  turn  UNICORN  into  a 
reversible  system  that  can  be  used  for  both  parsing  and  generation.  Our  initial 
explorations  in  this  area  have  yielded  promising  results  and  will  be  followed  up 
in  detail  by  Gerdemann  (in  preperation). 

A  third  topic  that  will  be  important  for  future  extensions  of  UNICORN 
concerns  the  representation  of  feature  structures  as  data  types,  an  approach 
that  has  been  argued  for  convincingly  by  Pollard  and  Sag  1988.  The  UNICORN 
system,  as  it  stands,  provides  no  facility  to  state  generalizations  across 
categories.  Such  a  facility  would,  of  course,  greatly  simplify  the  grammar  writing 
process,  but  would  also  be  of  potential  use  in  the  parser  itself.  We  plan  to 
explore  these  possibilities  by  encoding  feature  structures  in  inheritance 
networks  of  KL-ONE  (Brachman  and  Schmoize  1985).  We  are  grateful  to  ISI  for 
making  the  KL-ONE  software  available  which  will  greatly  facilitate  our  efforts  on 
this  aspect  of  future  extensions  of  UNICORN. 
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*  We  would  like  to  thank  C.C.  Cheng  and  the  members  of  the  Unification 
Grammar  workshop  at  the  University  of  Illinois  for  helpful  feedback  on 
UNICORN.  We  are  especially  indebted  to  Jerry  Morgan  for  providing  us  with  his 
pretty  print  routine  for  DAG  structures.  The  alphabetically  first  author  was 
partially  funded  by  a  CogSci/AI  fellowship  from  the  University  of  Illinois  and  the 
alphabetically  second  author  was  supported  by  grant  #1-2-69171-1200  from 
the  Research  Board  of  the  University  of  Illinois. 

1  UNICORN  stands  for  UNIfication-based  natural  language  processing 
system  from  the  CORNfields. 

2  The  grammar  in  Appendix  A,  thus,  assumes  a  treatment  of 
subcategorization  a  la  HPSG  (Pollard  and  Sag  1987)  in  which  subject  NPs  are 
subcategorized  for.  The  list  character  of  subcategorization  is  handled  by 
treating  the  value  of  subcat  in  terms  of  two  features  first  and  rest  which  play  a 
role  similar  to  the  head  and  tail  of  a  list  in  LISP  or  Prolog. 

3  The  DAG  unifier  runs  only  in  the  mode  d_union(lnput, Input, Output).  We 
are  not  aware  of  applications  requiring  the  reverse  mode: 
d_union(Output, Output, Input).  It  might  be  thought  that  the  reverse  mode  would 
be  useful  for  generation  but  our  initial  results  using  UNICORN  for  generation 
continue  to  use  d_union  in  the  forward  mode. 

4  Both  (19)  and  (20)  rely  on  this  property  of  'findall'.  As  pointed  out  to  us  by 
Bob  Wengert,  this  property  is  due  to  the  implicit  existential  quantification  of 
variables  inside  the  body  of  'findall'. 

5  In  addition  to  these,  we  have  experimented  with  equations  of  the  form 
<path>  =  [],  meaning  that  the  path  has  some  unknown  value.  Such  an  equation 
could  be  used,  for  example  in  the  's  ->  comp  s'  rule  in  Appendix  A,  which  we 
repeat  here  for  convenience: 

i.         s    ->   comp      s : 

<x2  rel>  =  minus 

<xO  rel>  =  plus 

<xO  sem>  =  <x2  sem> 

<xO  gap  index>  =  <x2  gap  index>. 

If  we  had  put  the  feature  'gap'  on  the  <xO  sem>  path,  then  we  could  have 
left  out  the  equation  <xO  sem  gap  index>  =  <x2  sem  gap  index>  since  this  is 
included  in  the  equation  <xO  sem>  =  <x2  sem>.  But  omitting  this  equation 
would  allow  the  possibility  of  forming  a  relative  clause  even  when  the  lower 
sentence  has  the  value  'minus'  for  the  path  <sem  gap>;  i.e.  one  could  form  a 
relative  clause  like  'that  Tom  Sawyer  smiled'  with  no  gap.   One  could  solve  this 
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problem  with  the  equation  <x2  sem  gap  index>  =  [],  which  rules  out  the 
possibility  of  'gap'  having  the  value  'minus'.  As  for  the  grammar  interpreter, 
equations  like  <path>  =  []  can  be  handled  in  the  same  manner  as  equation  (24). 

6  Actually  Earley  speaks  of  moving  the  "dot"  to  the  right  of  the  'A'.  The 
effect  of  this  is  identical  to  removing  'A'  from  the  prediction. 

7  One  other  difference  between  our  parser  and  the  standard  Earley  parser 
about  which  we  have  little  to  say  is  that  the  Unicorn  system  lacks  any  kind  of 
lookahead. 

8  Here  we  are  not  distinguishing  between  features  that  happen  not  to  have 
values  and  features  that  couldn't  possibly  have  values.  For  example  the  atomic 
DAG  [finite, []]  could  not  be  further  instantiated  to  have  a  value  for  'subcat',  yet 
according  to  (35)  the  value  of  the  feature  is  [].  If  one  needed  to  make  such  a 
distinction,  it  would  be  easy  to  modify  this  procedure. 

9  In  practice  these  two  'if  clauses  are  not  needed  since  '<maj>  and  <head 
form>  are  always  atomic  valued  paths. 
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Appendix  A 

The  grammar  in  this  appendix  includes  the  following  features:  list-valued 
subcategorization  features,  logical  form  construction,  quantifiers,  traces, 
compound  nouns,  and  relative  clauses  with  object  gaps.  All  of  these  features 
are  illustrated  in  the  following  parse: 
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parse ( [every, fence, that, torn, sawyer,  painted, shines] ) 


[maj  : 

s] 

[head: 

[form: 

finite] ] 

[sem: 

[trans: 

[quant : 

every] 

[range: 

[pred: 
[conjl: 

[conj2: 

and] 
[pred:   fence] 

[argl:   _39497]] 
[pred:   paint] 
[argl:   torn] 
[arg2:   _39497]]] 

[scope: 

[pred: 
[argl: 

shine] 
_39497]]]] 

[gap:      minus] ] 


[_39497:       []] 
> 

Most  of  the  features  in  this  parse  are  self-explanatory.  Only  the  feature 
'gap'  requires  explanation.  The  feature  'gap'  can  take  either  a  complex  or  an 
atomic  value.  It  takes  the  atomic  value  'minus'  to  indicate  that  there  is  no  gap, 
as  is  the  case  in  the  above  parse.  If  there  is  a  gap  it  takes  the  complex  value 
[[index.Var]]  where  the  value  of  'Var'  will  be  the  translation  of  the  constituent  that 
is  gapped.  The  following  parse  shows  the  case  of  a  direct  object  gap: 


1  ?-   parse ( [torn, 
< 

, sawyer, painted]  ) 

[maj:   s] 

[head:  [form.: 

finite]] 

[sem:   [trans: 

[pred:   paint] 

[argl:   tom] 

[arg2:   _12192] 

[gap: 


[index: 


12192] ] 


[  12192 


Since  the  path  <gap  index>  has  a  value,  this  phrase  can  be  used  by  the 
relative  clause  rule  's  ->  comp  s',  which  includes  the  path  equation  <xO  gap 
index>  =  <x2  gap  index>.  The  idea  here  is  that  if  'gap'  has  the  value  'minus' 
then  the  values  of  these  two  paths  will  be  undefined  and  hence  not  equal  to 
each  other  (in  this  respect  we  follow  the  logic  of  attribute  value  grammars  in 
Johnson  1987).  Given  this  equation,  then,  we  can  parse  that  Tom  Sawyer 
painted  as  a  relative  clause  but  correctly  fail  to  parse  that  Tom  Sawyer  painted 
a  fence  as  a  relative  clause.  The  feature  'rel'  in  this  rule  has  the  effect  of  ruling 
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out  iterated  complementizers.    With  or  without  a  complementizer  this  clause 
could  combine  with  a  noun  given  the  rule  N  ->  N  S: 

I  ?-   parse  (  [fence,  torn,  sawyer,  painted]  )  . 
< 

[maj :   n] 

[sem:   [trans:   [pred:   and] 

[conjl:   [pred:   fence] 
[argl:   _31446]] 
[conj2:   [pred:   paint] 
[argl:   torn] 
[arg2:   _31446]]]] 
[gap:      [index:   _31446]] 
> 
< 

[_31446:   []] 
> 

This,  in  turn,  combines  with  the  determiner  'every'  to  give  the  following: 

I  ?-   parse ( [every, fence, torn, sawyer, painted] ) . 


[maj:   np] 

[head:   [agreement:   [person:   third] 

[number:   singular]]] 
[sem:   [trans:     [quant:   every] 

[range:   [pred:   and] 

[conjl:   [pred:   fence] 

[argl:   _30752]] 
[conj2:   [pred:   paint] 
[argl:   torn] 
[arg2:   _30752]]] 
[scope  :   _30794] ] 
[fun_arg:   [trans:   _30794] 

[index:   _30824]] 
[parameter:   _30752]] 
[gap:   [index:    30824]] 


[_30752:  [] 
[_30794:  [] 
[  30824:   [] 
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The  new  features  in  this  DAG,  'fun_arg'  and  'parameter'  are  used  to 
handle  the  type  raising  necessary  to  give  a  Montague  style  analysis  of 
quantified  and  referential  noun  phrases.  Before  explaining  how  these  features 
work,  let  us  compare  the  above  parse  with  the  parse  for  a  referential  NP: 


1  ?-   parse ( [torn, sawyer] ) . 

[maj: 

np] 

[head: 

[agreement:   [gender 

:   masculine] 

[person 

:   third] 

[number 

:   singular] ] 

[sem: 

[parameter:   torn] 
[trans:   _3013] 

[fun  arg:   [trans:    ; 

3013] 

[index:   _3043]]] 

[gap: 
> 

[index:   _3043]] 

< 

[  3043; 

:   []] 

[_3013 

:   []] 

Consider  first  the  path  <sem  parameter>.  In  the  case  of  the  referential  NP, 
the  value  of  this  path  is  'torn'.  When  '  torn  sawyer'  combines  with  a  VP,  this  is 
the  value  that  will  be  fed  into  the  argument  position  in  the  translation.  In  the 
case  of  a  quantified  NP,  what  gets  fed  into  the  argument  position  is  a  variable 
(as  in  the  first  parse  in  this  appendix).  As  in  Montague  Grammar,  our  grammar 
treats  NPs  as  functors.  Thus,  the  VP  semantics  has  to  be  fed  back  into  the  NP 
semantics  before  the  translation  is  passed  up  to  the  mother.  We  do  this  using 
the  feature  'fun_arg'.  The  name  is  mnemonic  for  functor  and  argument, 
reflecting  the  Janus-head  nature  of  NPs  in  our  grammar:  Semantically  the  VP  is 
an  argument  to  the  NP  semantics,  while  syntactically  the  VP  is  treated  as  a 
functor  since  it  subcategorizes  for  the  NP.  With  these  rules  we  can  now  parse 
the  more  complex  example  below: 
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?-   parse  (  [torn,  sawyer,  gave,  every,  boy,  a,  brush]  ) 

[maj:   s] 

[head:   [form:   finite]] 

[sem:   [trans:   [quant:   exists] 


[range: 

[pred: 

brush] 

[argl: 

_44801]] 

[scope: 

[quant : 

every] 

[range: 

[pred: 

boy] 

[argl: 

_44837]] 

[scope: 

[pred: 

gave] 

[argl: 

torn] 

[arg2: 

44801] 

[arg3: 

44837]] 

gap:   minus] 


[_44801:   []] 
[  44837:   []] 


%%  The  Grammar 

s  ->  np  vp: 

<xO  head>  =  <x2  head> 

<xO  head  form>  =  finite 

<xO  sem  trans>  =  <xl  sem  trans> 

<xl  sem  fun_arg  trans>  =  <x2  sem  trans> 

<xO  gap>  =  <x2  gap> 

<x2  subcat  first>  =  <xl> 

<x2  subcat  rest>  =  end. 

s  ->  comp   s: 

<x2  rel>  =  minus 

<xO  rel>  =  plus 

<xO  sem>  =  <x2  sem> 

<xO  gap  index>  =  <x2  gap  index>. 


vp  ->  v: 

<xO  head>  =  <xl  head> 

<xO  sem  trans>  =  <xl  sem  trans> 

<xO  gap>  =  minus 

<xO  subcat>  =  <xl  subcat>. 
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vp  ->  vp  x2:  %  Note  left  recursion 

<xO  head>  =  <xl  head> 

<xO  sem  trans>  =  <x2  sem  trans> 

<xl  sem  trans>  =  <x2  sem  fun_arg  trans> 

<x2  sem  fun_arg  gap>  =  <xl  gap> 

<xO  gap>  =  <x2  gap> 

<xl  subcat  first>  =  <x2> 

<xO  subcat>  =  <xl  subcat  rest>. 

np  ->  det  n: 

<xO  head>  =  <x2  head> 

<xO  sem>  =  <xl  sem> 

<xO  sem  fun_arg  gap>  =  <xO  gap>  %  NP  doesn't  introduce  a  gap 

<xl  subcat  first>  =  <x2> 

<xl  subcat  rest>  =  end. 

n  ->  n  s : 

<xO  sem  trans  pred>  =  and 

<xO  sem  trans  conjl>  =  <xl  sem  trans> 

<xO  sem  trans  conj2>  =  <x2  sem  trans> 

<xl  sem  parameter>  =  <x2  gap  index> 

<xO  sem  parameter>  =  <x2  gap  index>. 

Word  that: 
<maj>  =  comp. 

Word  :  %  This  is  a  trace 

<maj>  =  np 

<sem  parameter>  =  <gap  index> 

<sem  fun_arg  gap>  =  minus         %  Don't  combine  with  a  vp 


%that  already  has  a  gap 


<sem  fun_arg  trans>  =  <sem  trans>. 


Word  shines: 

<maj>  =  V 

<head  form>  =  finite 

<sem  trans  pred>  =  shine 

<sem  trans  argl>  =  <subcat  first  sem  parameter> 

<subcat  first  maj>  =  np 

<subcat  first  head  agreement  person>  =  third 

<subcat  first  head  agreement  number>  =  singular 

<subcat  rest>  =  end. 
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Word  smiles: 

<maj>  =  V 

<head  form>  =  finite 

<sem  trans  pred>  =  smiles 

<sem  trans  argl>  =  <subcat  first  sem  parameter> 

<subcat  first  maj>  =  np 

<subcat  first  head  agreement  person>  =  third 

<subcat  first  head  agreement  number>  =  singular 

<subcat  rest>  =  end. 


Word  painted: 

<maj>  =  V 

<head  form>  =  finite 


<sem  trans  pred> 
<sem  trans  argl> 
<sem  trans  arg2> 
<subcat  first  maj>  =  np 
<subcat  rest  first  maj>  = 
<subcat  rest  rest>  =  end, 


paint 

<subcat  rest  first  sem  parameter> 

<subcat  first  sem  parameter> 


np 


Word  gave: 

<maj>  =  V 

<head  form>  =  finite 

<sem  trans  pred>  =  gave 

<sem  trans  argl>  = 

<sem  trans  arg2>  = 

<sem  trans  arg3>  = 

<subcat  first  maj> 

<subcat  rest  first 

<subcat  rest  rest  first  maj> 

<subcat  rest  rest  rest>  =  end 


<subcat  rest  rest  first  sem  parameter> 
<subcat  rest  first  sem  parameter> 
<subcat  first  sem  parameter> 
=  np 
maj>  =  np 

=  np 


Word  every: 
<maj>  =  det 
<sem  trans  quant> 
<sem  trans  range> 
<sem  trans  scope> 
<sem  parameter>  = 
<subcat  first  maj> 


=  every 

=  <subcat  first  sem  trans> 
=  <sem  fun_arg  trans> 
<subcat  first  sem  parameter> 
=  n 


<subcat  rest>  =  end 

<subcat  first  head  agreement  person>  =  third 

<subcat  first  head  agreement  number>  =  singular. 
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Word  a : 

<maj>  =  det 

<sem  trans  quant>  =  exists 

<sem  trans  range>  =  <subcat  first  sem  trans> 

<sem  trans  scope>  =  <sem  fun_arg  trans> 

<sem  paraineter>  =  <siibcat  first  sem  parameter> 

<subcat  first  maj>  =  n 

<subcat  rest>  =  end 

<subcat  first  head  agreement  person>  =  third 

<subcat  first  head  agreement  number>  =  singular, 

Word  tom  sawyer: 

<maj>  =  np 

<head  agreement  gender>  =  masculine 

<head  agreement  person>  =  third 

<head  agreement  number>  =  singular 

<sem  parameter>  =  tom 

<sem  trans>  =  <sem  fun_arg  trans> 

<sem  fun_arg  gap>  =  <gap>. 

Word  fence : 

<maj>  =  n 

<head  agreement  gender>  =  neuter 

<head  agreement  person>  =  third 

<head  agreement  number>  =  sing^jlar 

<sem  trans  pred>  =  fence 

<sem  parameter>  =  <sem  trans  argl>. 

Word  brush: 

<maj>  =  n 

<head  agreement  gender>  =  neuter 

<head  agreement  person>  =  third 

<head  agreement  number>  =  singular 

<sem  trans  pred>  =  brush 

<sem  parameter>  =  <sem  trans  argl>. 

Word  boy : 

<maj>  =  n 

<head  agreement  gender>  =  neuter 

<head  agreement  person>  =  third 

<head  agreement  number>  =  singular 

<sem  trans  pred>  =  boy 

<sem  paraineter>  =  <sem  trans  argl> 


Appendix  B 

%%%%%%%%%%%%      DAG  Unifier      %%%%%%%%%%%%%%%%%% 

d_union( [FSl,R_Listl] , [FS2,R_List2] , [OutFS, OutR_List ] ) :- 
append (R_Listl, R_List2, R_List) ,  %  make  joint  reentrancy 
aux_d_union (FSl,FS2,R_List, OutFS, OutR_List) ,! .    %  list 

/*  Three  termination  conditons  */ 
aux_d_union( [] ,FS2,R_List,FS2,R_List) :-  !. 
aux_d_union (FSl, [ ] , R_List,FSl,R_List ) :-  !. 
aux_d_union (FS,FS,R_List,FS,R_List) :-  !, 
atom(FS) 

/*  Pathl's  feature  exists  in  Dag2  */ 
aux_d_union( [ [FE,FSlVal] [RestFSl] ,FS2, InR_List, 
[OutFSIRestFS] ,OutR_List) :- 
remove ( [FE,FS2Val] ,FS2,RestFS2 ) , ! , 

aux_d_union (RestFSl, RestFS2, InR_List,RestFS,R_Listl) , 
match_values (FE,FSlVal, FS2Val, R_Listl, OutFS, OutR_List) . 

/*  Pathl's  feature  does  not  exist  in  Dag2  */ 
aux_d_union ( [FSl I RestFSl] , [FirstFS2 |RestFS2] , 
R_List, [FSl lOutFS] ,OutR_List) :- 
aux_d_union (RestFSl, [FirstFS2 |RestFS2] ,R_List, 
OutFS, OutR_List) . 

/*  Pathl  and  Path2  both  end  in  reentrant  values  that  have 
already  been  unified  */ 
match_values(FE,FSlVar,FS2Var,R_List, [FE, FSlVar] , R_List ) :- 

var (FSlVar) , 

var (FS2Var) , 

FSlVar  ==  FS2Var, ! .  %  variables  have  already  been  unified 

/*  Pathl  and  Path2  both  end  in  reentrant  values  */ 
match_values (FE, FSlVar, FS2Var,R_List, [FE, FSlVar] , 
[ [FSlVar, ReentrantFS] |RestR_List ] ) :- 

var (FSlVar) , 

var (FS2Var) , ! , 

/*  Find  the  reentrant  values  on  R_List  */ 

feat_val_remove( [FSlVar, FSlVal ] , R_List, R_Listl) , 

feat_val_remove( [FS2Var, FS2Val] , R_Listl, R_List2) , 

FSlVar  =  FS2Var, 

aux_d_union(FSlVal,FS2Val,R_List2, 

ReentrantFS, RestR  List).       %  Unify  the  reentrant  values 
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/*  Pathl  ends  in  a  reentrant  value,  but  Path2  does  not  */ 
match_values (FE, FSlVar, FS2Val, R_List, [FE,FSlVar] , 
[ [FSlVar,ReentrantFS] I RestR_List ] )  :- 
var (FSlVar) , ! , 

feat_val_remove ( [FSlVar, FSlVal] ,R_List,R_Listl) , 
aux_d_union(FSlVal,FS2Val,R_Listl,ReentrantFS,RestR_List) 

/*  Path2  ends  in  a  reentrant  value  but  Pathl  does  not  */ 
match_values (FE,  FSlVal, FS2Var, R_List ,  [FE, FS2Var] , 
[ [FS2Var,ReentrantFS] I RestR_List] )  :- 
var (FS2Var) , ! , 

feat_val_remove ( [FS2Var,FS2Val]  ,  R_List,  R_Listl)  , 
aux_d_union (FSlVal, FS2Val,R_Listl,ReentrantFS,RestR_List) 

/*  Pathl  and  Path2  both  end  in  non-reentrant  values  */ 
match_values (FE, FSlVal, FS2Val,R_List,  [FE,  OutFS] , OutR_List)  : 
aux  d  union (FSlVal, FS2Val,R  List ,  OutFS,  OutR  List). 


%%%%%%%%%%%%%   Path  Value   %%%%%%%%%%%%%%%%%% 

path_val (Path, [FS,R_List], [Copy_of_OutFS,NCopy_of_R_List ] ) :- 
naive_path_val (Path,  [FS,R_List] , OutFS)  ,  %  See  section  3.4.1 
find_true_reentrancies ( [OutFS, R_List]  , 

[Copy_of_OutFS,Copy_of_R_List] , 
TrueReentrancies) , 
undo_reentrancies (Copy_of_R_List, NCopy_of_R_List, 
TrueReentrancies) . 

/*  undo_reentrancies  TR  =  true  reentrancies  list.  If  a 
reentrancy  variable  (RVar)  is  not  on  this  list,  the 
reentrancy  is  'undone'  and  then  thrown  away  */ 

undo_reentrancies ([],[] ,_TR) .  %  Termination 

/*  In  this  case  we  retain  the  true  reentrancy  list  */ 
undo_reentrancies ( [ [RVar,FS] IRestReentrancies] , 

[[RVar,FS] INRestReentrancies] ,TR) :- 

var_meiTLber  (RVar,  TR)  ,  !  , 

undo_reentrancies (RestReentrancies,NRestReentrancies,  TR)  . 

/*  This  (destructively)  undoes  the  reentrancy.  It's  a  good 
thing  we're  working  with  a  copy!  */ 
undo_reentrancies ( [ [FS,FS]  IRestReentrancies] , 
NRestReentrancies, TR) :- 
undo  reentrancies (RestReentrancies, NRestReentrancies,  TR)  . 
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/*  find_true_reentrancies (Dag, Copy_of_Dag, 

List_of_variables_marking_true_reentrancies) 
This  finds  all  reentrancy  variables  that  occur  at  least  twice 
and  in  the  process  copies  the  DAG. —  CFS  =  Copy  of  FS  */ 
find_true_reentrancies ( [FS,R_List]  ,  [CFS, CR_List ] , TR)  :- 
true_reentrancies_in_FS (FS, CFS, Binding,  TRl)  , 
true_reentrancies_in_R_List (R_List,  CR_List,  Binding,  TRl, TR)  . 

/*  Initialize  accumulator  variables  */ 
true_reentrancies_in_FS (FS, CFS, Binding, TR) :- 

true_reentrancies_in_FS (FS, CFS, [], Binding, [ ] , TR) . 
true_reentrancies_in_FS (A, A, Binding, Binding, TR, TR) :- 

atom (A) , ! . 

/*  When  a  variable  V  is  encountered,  a  new  arbitrarily  chosen 
variable,  VI,  is  introduced  for  the  copy,  and  [V,V1]  is  added 
to  Binding  to  indicate  that  V's  counterpart  in  the  new  copy 
is  VI.  */ 
true_reentrancies_in_FS (V, VI, Binding, Binding, TR, TRl) :- 

var  (V)  , 

feat_val_member ( [V,V1] , Binding)  ,  ! , 

add_to (VI , TR, TRl) .  %  adds  VI  iff  VI  is  not  already  a  member 

/*  Here  we  encounter  a  reentrancy  variable  for  the  first 
time.  We  don't  know  yet  if  it  marks  a  true  reentrancy.  */ 

true_reentrancies_in_FS (V, VI, Binding, [ [V,V1] | Binding] , 
TR,TR)  :- 
var (V) , ! . 

/*  This  is  the  recursive  clause.  The  feature  structure  is 
complex  so  we  first  break  it  down  */ 
true_reentrancies_in_FS ( [ [F,V]  |Rest],  [[F,CV]  ICRest], 
Binding, Binding2, TR,  TR2)  :- 
true_reentrancies_in_FS (V, CV, Binding, Bindingl ,  TR, TRl ) , 
t rue_reentrancies_in_FS (Rest , CRest , Bindingl , Binding2 , 
TR1,TR2) . 

/*  Find  true  reentrancies  in  the  part  of  the  reentrancy  list 

that  is  used  */ 

true_reentrancies_in_R_List (R_List,  [[NVar,CFS]  I RestCR_List]  , 

Binding, TR, TR2) :- 
remove ( [RVar,FS] , R_List,NR_List)  , 

feat_val_member ( [RVar,NVar] , Binding) ,! ,  %  RVar  is  used 
true_reentrancies_in_FS (FS, CFS,  Binding,  Bindingl,  TR,  TRl)  , 
true_reentrancies_in_R_List (NR_List,RestCR_List, Bindingl, 

TR1,TR2) . 

true  reentrancies  in  R  List (  R, [ ] ,  Binding, TR, TR) . 
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%%  add  item  to  list  if  the  item  is  not  already  a  member, 
add_to  (X,L,L)  :- 

var_member (X,  L)  ,  !  . 
add  to(X,L,  [X|L]  )  . 


%%%%%%%%%%%%%   Restriction   %%%%%%%%%%%%%%%%% 

change_restrictor (Paths)  :- 

retractall  (restrict (_,_) ) , 

retractall (make_restrictor (_, _) ) , 

assert ( (restrict (Dag, RestrictedDag) :- 
get_vals (Paths, Dag, Vals)  , 
make_restrictor (Vals, RestrictedDag) ) ) , 

assert_restrictions (Paths) . 

/*  Find  all  the  values  at  the  ends  of  the  paths  */ 
get_vals (X_j, [FirstPath | RestPaths] , Dag, [Val IRestVals] ) :- 

naive_path_val ( [X_j [FirstPath] , Dag, Vail) , 

if (list (Vail) , Val  =  [],Val  =  Vail), 

get_vals (X_j, RestPaths, Dag, RestVals) . 
get_vals(_X_j, [],_Dag, []) . 

/*  Make  the  clauses  for  'Make_restrictor '  */ 

assert_restrictions (Paths) :- 

make_selection (Paths, SelectedPaths) , 
successively_combine (SelectedPaths, FS, Vars) , 
assert ( (make_restrictor (Vars,  [  [  [xO,FS] ] ,  []]))), 
fail. 

assert_restrictions (_) . 

/*  Nondeterministically  pick  a  subset  of  paths  that  have 
values  other  than  []  at  their  ends  */ 
make_selection ( [_F | R] , [ [ ] I Rl ] ) : - 

make_selection (R,  Rl)  . 
make_selection( [F|R] , [F|R1] ) :- 

make_selection (R,  Rl)  . 
make_selection ( [  ]  ,  [  ]  )  . 

successively_combine ([],[],[]). 

successively_combine ( [ [ ] [RestPaths] , CombinedPaths, 
[ [] IRestVs]) :-!, 
successively  combine (RestPaths, CombinedPaths, RestVs) . 
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successively_coinbine  (  [FirstPathlRestPaths]  , 

CombinedPaths, [Var | RestVars] ) :- 
successively_combine (RestPaths, InitialCombination, 

RestVars) , 
mak;e_FS(FirstPath,Var,FS)  , 
combine (FS, InitialCombination, CombinedPaths) . 

/*  'make_FS'  makes  a  feature  structure  from  a  path  and  a 
value, e.g.  make_FS  ( [xO, head, form] , finite, 

[ [xO, [ [head, [ [form, finite] ]]]]])  */ 
make_FS ( [FirstFeature [RestFeatures] , Val, 

[ [FirstFeature, FFeatureValue] ] ) :- 
make_FS (RestFeatures, Val, FFeatureValue) . 
make_FS ( [] , Val, Val) . 

/*  This  is  a  very  simply  version  of  d_union  that  only  unifies 

feature   structures  with  no  reentrancies  */ 

combine (X,_,_)  :-  var (X)  ,!, fail . 

combine  (X,  []  ,X)  :-  !  ,  . 

combine  (  [ [X,FS] ] , CombinedPaths, 

[[X,OutFS] IRestCombinedPaths] ) :- 

remove ( [X,FS2] , CombinedPaths, Rest CombinedPaths] ) , ! , 

nonvar (FS2) , 

combine (FSl,FS2,0utFS) . 
combine ( [FS] , CombinedPaths, [FS I CombinedPaths] ) . 

combine (X, [] ,X) . 

combine ([[X,FS1]], [[X,FS2]], [ [X,OutFS]]) :-  !, 

combine (FSl,FS2,0utFS) . 
combine ([ [X,FS1] ] , [ [Y,FS2] ] , [ [X,FS1] , [Y,FS2] ]) . 


%%%%%%%%%%%%%  Subsumption   %%%%%%%%%%%%%%%%%% 

/*  Null  feature  structure  with  any  reentrancy  subsumes 
anything  */ 
subsumes ([[],_],_) . 

subsumes ( [A, _], [A, _]) :-  %  An  atom  subsumes  an  identical  atom, 
atom (A) . 

subsumes ([ [ [FE,FSlVal] I RestFSl] , R_Listl ] , [FS2, R_List2] ) :- 
nonvar (FSlVal) ,! ,  %  It  doesn't  matter  whether  or  not  Dag2 
value  (FE,  [FS2,R_List2] ,FS2Val) ,  %  has  a  reentrancy  here 
subsumes ( [FSlVal,R_Listl] , [FS2Val, R_List2] ) , 
subsumes ( [RestFSl, R  Listl] , [FS2,R  List2]). 
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subsumes ( [ [ [FE,FSlVar]  IRestFSl] ,R_Listl]  ,  [FS2, R_List2] )  :- 
member ( [FE,FS2Var] ,FS2) , 

var(FS2Var),  %  If  Dagl  has  a  reentrancy  then  Dag2  must  have 
feat_val_member ( [FSlVar,FSlVal] ,R_Listl) ,  %  one  in  the  same 
feat_val_member ( [FS2Var,FS2Val] ,R_List2) ,  %  place, 
subsumes ( [FSlVal, R_Listl ] , [FS2Val, R_List2 ] ) , 
subsumes ( [RestFSl,R  Listl] , [FS2, R  List2]).\ 


%%%%%%%%%%%     Auxiliary  Predicates   %%%%%%%%%%%%%%%% 

/*  var_member (Member, List_of_Variables) .Used  in  Path  Value  */ 
var_member (X, [F|_]) :- 

X  ==  F,  !  . 
var_member (X, [_|R]) :- 

var_member(X,R) . 

/*  feat_val_member  for  checking  membership  on  reentrancy 
lists   Used  in  Path  Value  and  Subsumption  */ 
feat_val_member ( [F,V] , [ [F1,V] |_] ) :-  F  ==  Fl. 
feat_val_member (X, [_H|T]) :-  feat_val_member (X, T) . 

/*  f eat_val__remove  for  removing  members  from  reentrancy 
lists.  Used  in  DAG  Unifier  */ 
feat_val_remove { [F,V] , [ [F1,V] |Xs] ,Xs) :- 

F  ==  Fl . 
feat_val_remove (X, [Y|Ys], [Y|Zs]) :-  f eat_val_remove (X, Ys, Zs) . 

/*  Remove  from  any  kind  of  list.  Used  by  DAG  Unifier  */ 

remove (X, [X|Xs] ,Xs) . 

remove (X,  [Y|Ys],  [Y|Zs])  :-  remove (X, Ys, Zs)  . 

/*  Used  by  Path  Value  and  Restriction   */ 

list([])  . 

list  (  [  First  I  Rest]).  %  assume  that   Rest  is  a  list. 
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This  paper  deals  with  the  interaction  of  phonology  and  syntax 
in  relative  clauses  and  wh-constructions  in  Ciluba'.  Unlike  other 
Bantu  languages  Ciluba  does  not  have  an  overt  relative  clause 
marker.  This  raises  the  question  of  how  relative  clauses  are  formed 
in  Ciluba,  and  how  such  clauses  can  be  distinguished  from  non- 
relative  clauses,  especially  where  both  relative  clauses  and  non- 
relative  clauses  have  the  same  surface  stnjcture.  The  paper 
addresses  these  questions  within  the  Government  and  Binding 
framework  (Chomsky  1981).  It  is  observed  that  although  they  do 
not  contain  overt  relative  prounouns  or  markers,  relative  structures 
are  similar  to  structures  with  fronted  interrogative  pronouns,  just  as 
in  many  other  languages:  Both  in  relative  structures  and  in 
structures  with  moved  interrogatives,  move-alpha  brings  about  the 
loss  of  mutual  c-command  between  the  verb  and  the  preceding 
constituent,  along  with  consequent  verbal  tone  changes.  In  the 
case  of  relative  structures,  the  resulting  tone  changes  may  be 
viewed  as  an  indirect  marker  of  relative  clause  formation.  However, 
since  the  same  changes  take  place  in  structures  with  fronted  wh- 
words,  the  tonal  changes  cannot  be  considered  as  functioning 
primarily  as  relative  markers.  At  best,  it  could  be  argued  that  tone 
change  secondarily  functions  as  something  like  a  signal  of  relative 
clause  formation  in  Ciluba. 


1.    Introduction 

The  question  of  how  phonology  and  syntax  interface  has  received 
considerable  attention  in  the  past  twenty  years  or  so,  as  can  be  seen  from  an 
increasing  number  of  studies  devoted  to  this  topic,  such  as  Hetzron  1972, 
Clements  1978,  Kaisse  1981,  Pullum  and  Zwicky  1983,  Sadock  1983,  Plank 
1984,  Selkirk  1984,  Klavans  1985,  Zwicky  1985,  Kenstowicz  1987. 


One  of  the  concerns  in  most  such  studies  has  been  to  determine 
whether  the  application  of  certain  syntactic  rules  is  conditioned  by 
phonological  rules,  or  whether  what  goes  on  in  phonology  has  no  relation 
whatsoever  to  syntax  and  vice  versa.  The  view  that  syntax  is  unable  to 
influence  phonology  and  vice  versa  is  sometimes  referred  to  as  the  principle 
of  phonology-free  syntax  (Pullum  &  Zwicky  1983),  or  as  the  principle  of 
syntax-free  phonology  in  the  sense  of  Selkirk  1984. 

Assuming  that  the  principle  of  phonology-free  syntax  is  correct,  the 
question  is  how  does  one  account  for  facts  such  as  those  presented  in  the 
Ciluba  structures  in  (1)  and  (2). 

(1)  a.  Muntu  uvwa  mushipa  ntambwe    'The  man  killed  the  lion' 

b.  Muntu  uvwa  mushipa  ntambwe    'The  man  who  killed  the  lion..." 

(2)  a.  Ngandu  uvwa  mushipa  mwana     'The  alligator  killed  the  child' 
b.  Ngandu  uvwa  mushipa  mwana     'The  alligator  that  killed  the 

child...' 

In  terms  of  the  order  of  words  the  structures  in  (1)  are  identical,  and  so 
are  the  structures  in  (2).  The  only  thing  that  distinguishes  the  a-structures 
from  the  b-structures  is  the  tonal  pattern  on  the  auxiliary  verb  L/-v/iva.  The 
auxiliary  verb  stem,  -\/^a,  has  a  high  tone  in  both  (1)  and  (2).  However,  its 
prefix  u-  has  a  low  tone  in  the  a-structures,  while  in  the  b-structures  it  has  a 
high  tone. 

The  question  that  is  worth  raising  here  is  whether  tones  or  phonology  for 
that  matter  condition  the  application  of  the  rule  of  relativization  in  Ciluba;  and 
if  not,  why  it  is  that  the  b-structures  of  (1)-(2)  are  interpreted  as  relative 
structures. 

In  this  paper  I  shall  argue  that  although  in  surface  order  the  a-structures 
in  (1)-(2)  are  identical  to  the  corresponding  b-structures,  their  respective 
underlying  hierarchical  syntactic  structures  are  different.  The  hypothesis  of  this 
paper  is  that  the  verbal  tone  changes  in  the  b-stnjctures  are  syntactically 
conditioned.  It  is  important  to  note  that  similar  verbal  tone  changes  obtain  in 
interrogative  wh-structures  in  which  a  wh-word  has  been  fronted.  I  shall 
present  two  cases  from  Ciluba  which  provide  support  for  the  hypothesis  made 
here.  The  first  has  to  do  with  the  formation  of  Ciluba  relatives,  and  the 
second  with  the  fronting  of  wh-words.  In  order  to  understand  the  discussion  of 
the  two  cases,  a  background  on  Ciluba  tone  sandhi  is  needed.  The 
discussion  will  be  limited  to  verbal  tones,  since  I  will  be  dealing  with  verbal 
tone  changes  in  relatives  and  interrogatives. 
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2.    The  tones  of  the  verb 


Ciluba  verbs  can  be  grouped  in  two  main  categories:  (a)  verbs  whose 
stems  (mono  or  polysyllabic)  have  a  high  tone,  and  (b)  verbs  whose  stems 
have  a  low  tone.  The  first  category  of  verbs  will  be  referred  to  as  high  tone 
stem  (HTS)  verbs,  and  the  second  as  low  tone  stem  (LTS)  verbs.  The 
examples  in  (3)  illustrate  the  first  category,  and  those  in  (4)  the  second.  Note 
that  in  both  categories,  if  the  verb  is  in  the  infinitive  form,  the  prefix  and  the 
final  vowel  suffix  always  bear  a  high  tone.  In  the  data  presented  below  and 
throughout  this  paper  /  [acute  accent]  stands  for  a  high  tone,  low  tones  are 
unmarked. 


(3)  High-Tone  Stems  (HTS) 
Prefix    Stem  Gloss 


(4)  Low-Tone  Stems  (LTS) 
Prefix   Stem       Gloss 


ku-        ship  -a        'to  kill'  ku-       tum-    a    'to  send' 

ku-         long  -a        'to  learn'  ku-        mon-    a  'to  see' 

ku-        akaj  -a         'to  fix'  ku-       peesh-a  'to  give' 

The  verbs  in  (3)  and  (4)  can  be  followed  by  suffixes.  However,  affixation 
has  no  effect  on  the  tone  of  the  stem.  That  is,  if  the  stem  is  H/L,  it  will 
remain  H/L  regardless  of  the  number  of  suffixes  affixed  to  it,  as  the  HTS 
verbs  in  (5)  and  the  LTS  verbs  in  (6)  illustrate.  In  both  (5)  and  (6),  -N-  'for'  is 
an  applied  suffix;  -ish-  (or  -esh-)  'cause  to'  is  a  causative  suffix;  and  -angan- 
'each  other'  is  a  reciprocal  suffix. 


(5)  Extended  HTS 

Ku  -ship-  a       'to  kill' 
Ku  -long-  a      'to  learn' 
Ku  -akaj-  a       'to  fix' 

(6)  Extended  LTS 


Ku-shfp-il-a 
Ku-l6ng-el-a 
Ku-akaj-il-a 


Ku-ship-il-angan-a 
Ku-l6ng-el-angan-a 
Ku-akaj-fl-angan-a 


Ku-tum-fsh-angan-a 

Ku-mon-esh-angan-a 

Ku-peesh-ish-angan-a 


Ku  -  tum  -a      'to  send'         Ku-tum-ish-a 
Ku  -  mon  -a     'to  see'  Ku-mon-esh-a 

Ku  -peesh-a     'to  give'  Ku-peesh-ish-a 

I  shall  assume  that  in  (5)  and  (6)  each  extension  suffix  has  an 
underlying  HT.  This  is  because  whether  or  not  the  material  that  precedes  the 
suffix  (the  verb  stem)  or  follows  it  (the  final  vowel)  is  present,  the  suffix 
always  bears  a  high  tone.  Note  that  the  stems  in  both  (5)  and  (6)  keep  their 
initial  tone,  HT  for  the  former,  and  low  tone  (LT)  for  the  latter. 

However,  in  both  cases  the  tone  patterns  look  different  when  the  verb  is 
used  in  a  given  tense/aspect  or  a  phrasal  context.  For  example,  consider  the 
tonal  contrast  in  the  case  of  past  perfect  and  relative  stmctures,  as  shown  in 
(7)  for  HTS  verbs,  and  in  (8)  for  LTS  verbs.  In  these  paradigms  -ywa  is  a 
past  form  of  the  auxiliary  verb  Kwikala  'to  be'. 
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(7)  (LTS)       Infinitive      Past  Perfect  Relative  structures 

Ku-tum-a  'to  send'  ba-vwa  ba-tum-a  ba-vwa  ba-tum-a 
Ku-mon-a  '  "  see'  ba-vwa  ba-mon-a  ba-vwa  ba-mon-a 
Ku-peesh-a  '  "give'        ba-vwa  ba-peesh-a  ba-vwa  ba-peesh-a 

(8)  (HTS)       Infinitive      Past  Perfect  Relative  structures 

Ku-shfp-a  'to  kill'  ba-vwa  ba-ship-a  ba-vwa  ba-shipa 
Ku-l6ng-a  '"  learn'  ba-vwa  ba-l6ng-a  ba-vwa  ba-l6nga 
Ku-akaj-a      '"  fix'  ba-vwa  ba-akaj-a     ba-vwa  ba-akaja 

In  (7),  the  stems  remain  low.  Similarly,  the  HT  stems  of  (8)  remain  high. 
The  main  concern  here  is  explaining  the  alternation  of  HT  and  LT  on  the 
auxiliary  -vwa  in  the  past  perfect  and  relative  stmctures.  In  the  past  perfect  of 
either  HTS  or  LTS  verbs  in  both  (7)  and  (8)  the  auxiliary  -vwa  is  high-toned. 
However,  in  the  relative  structures,  -vwa  is  low-toned.  Since  in  Ciluba  the 
auxiliary  -vwa  is  usually  high-toned,  the  fact  that  it  bears  a  low  tone  in  relative 
structures  can  be  accounted  for  as  follows: 

(9)  In  a  relative  structure 

(i)      the  HT  of  the  auxiliary  lowers  after  LT,  except  in  the  third 
person  singular.^ 

(ii)      In  the  third  person  singular  the  auxiliary  prefix  changes  its 
original  low  tone  to  high  tone. 

(9i)  would  be  applicable  to  the  data  of  (7)  and  (8),  while  (9ii)  would 
account  for  data  such  as  those  presented  earlier  in  (1)  and  (2).  Whether  one 
is  dealing  with  the  data  in  (7)  and  (8)  or  with  the  data  in  (1)  and  (2),  it  seems 
that  the  generalizations  of  (9)  apply  under  certain  syntactic  conditions.  In  the 
remainder  of  this  paper  I  shall  examine  those  conditions. 

3.0   Ciluba  Relative  Structures 

Unlike  languages  such  as  English,  French,  or  Swahili,  Ciluba  does  not 
have  an  'overt'  relative  clause  marker,  a  fact  which  makes  relativization  in 
Ciluba  a  very  interesting  topic. 

To  make  the  point  clearer,  let  me  compare  the  English  structures  in 
(10),  the  Swahili  structures  in  (11),  and  their  corresponding  Ciluba 
counterparts  in  (1),  repeated  here  as  (12). 

(10)  a.      The  man  killed  the  lion. 

b.      The  man  who  killed  the  lion... 


(11)  a.       Mtu  a-me-mu-u-a  simba 

man  Ag-PAST-OP-kill  lion 
'The  man  killed  the  lion.' 

b.       Mtu  amba-ye  a-me-mu-u-a  simba 
man  RM-Pron.  Ag-PAST-OP-kill  lion 
'The  man  who  killed  the  lion...' 

(12)  a.       Muntu  u-vwa  mu-ship-a  ntambwe 

man     Ag-PAST-kill    lion 
'The  man  killed  the  lion.' 

b.       Muntu  u-vwa  mu-ship-a  ntambwe 
man     Ag-PST-kill     lion 
'The  man  who  killed  the  lion...' 

In  (10b),  the  English  Wh-word  who  signals  that  the  subject  man  is 
relativized,  and  so  does  the  Swahili  relative  structure  marker  amba  in  (lib). 
As  for  Ciluba,  the  only  thing  that  distinguishes  (12a)  from  (12b)  is  the  tone 
pattern  on  the  auxiliary  u-vwa:  In  (12a)  the  verb  u-vwa  displays  a  low  tone 
followed  by  a  high  tone,  while  in  (12b)  it  displays  a  sequence  of  high  tones. 
Given  that  both  (12a)  and  (12b)  have  the  same  surface  constituent  structure, 
the  question  is  why  is  it  that  (12b),  but  not  (12a),  is  interpreted  as  a  relative 
structure?  I  shall  discuss  this  question  in  the  following  section  by  examining 
various  cases  of  relativization  in  Ciluba,  including  subject  relativization,  direct 
object  relativization,  and  indirect  object  relativization.  I  shall  show  that  what 
we  are  dealing  with  is  the  syntactic  conditioning  (in  terms  of  c-command)  of  a 
phonological  mle.  Since  in  this  particular  case  the  auxiliary  form  involved  is 
the  third  person  singular,  u-vwa,  I  shall  argue  that  the  rule  which  applies  in 
(12b)  is  (9ii),  whereby  the  auxiliary  verb  prefix  u-  changes  its  original  low  tone 
to  a  high  tone. 

3.1    Subject  (SU)  Relativization 

In  an  attempt  to  address  the  question  raised  in  the  preceding  section 
with  regard  to  the  structures  of  (12a)  and  (12b),  I  shall  argue  that  those 
structures  differ  from  one  another  not  simply  because  of  the  verbal  tone 
difference.  Rather,  I  suggest  that  the  tone  difference  is  a  result  of  the 
difference  between  the  underlying  syntactic  structures  of  (12a)  and  (12b).  It  is 
this  syntactic  structural  difference  between  (12a)  and  (12b)  that  creates  an 
environment  for  the  application  of  (9ii)  in  the  surface  syntactic  structure  of 
(12b).  For  reasons  that  will  become  clearer  later  I  shall  assume  that  the  tones 
of  the  auxiliary  u-vwa  in  (12a)  are  basic,  while  those  in  (12b)  are  derived.  I 
propose  the  S-structure  diagrammed  in  (13a)  for  (12a),  and  the  S-structure  in 
(13b)  for  (12b). 
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(13)  a.  Muntu  uvwa  mushipa  ntambwe 
'The  man  killed  the  lion.' 


Muntu 
(man) 


-vwa      -shipa 
(Aux)     (killed) 


b.  Muntu  uvwa  mushipa  ntambw§ 
'The  man  who  killed  the  lion...' 


-shfpa        ntambwe 
(killed)  (lion) 

In  order  to  explain  the  difference  between  (13a)  and  (13b)  I  shall,  for  the 
moment,  use  the  concept  of  'c-command'  loosely,  reserving  its  definition  for 
later.  A  closer  look  at  the  two  tree-diagrams  in  (13)  reveals  two  things:  In 
(13a)  the  NP  Muntu  c-commands  and  is  sister  of  INFL  u-vwa.  In  (13b), 
however,  muntu  'man'  does  c-command  INFL  u-vwa  but  is  not  sister  of  the 
latter.  In  other  words,  in  (13a)  muntu  and  INFL  u-vwa  c-command  each  other 
because  they  are  dominated  by  the  same  node,  S.  However,  in  (13b)  the 
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same   NP  and  INFL  do  not  c-command  each  other  due  to  the  intervening 
maximal  projection,  S". 

The  fact  that  in  (13b)  muntu  and  INFL  u-vwa  are  not  sisters  creates  an 
environment  for  the  application  of  (9ii)  in  the  surface  syntactic  structure  of 
(12b).  What  is  important  here  is  that  the  only  syntactic  information  that  the 
generalization  of  (9ii)  is  sensitive  to  is  the  hierarchical  constituent  structure  of 
(12b)  as  opposed  to  that  of  (12a).  In  this  particular  case  the  syntactic 
information  is  the  c-command  relation  which  does  not  hold  mutually  between 
muntu  and  INFL  uvwa  in  (12b),  but  holds  mutually  between  the  two  in  (12a), 
as  the  tree  diagrams  in  (13b)  and  (13a)  show.  It  seems  that  move-alpha  and 
loss  of  mutual  c-command  relation  between  the  subject  NP  and  the  VP  node 
in  (13b)  creates  an  environment  for  the  application  of  the  Ciluba  sandhi  rules 
given  in  (9ii),  whereby  the  auxiliary  verb  prefix  changes  its  original  low  tone  to 
high  tone. 

The  facts  discussed  here  with  respect  to  subject  (SU)  relativization  in 
(12b)  are  also  attested  to  in  stmctures  where  a  direct  object  (DO)  or  an 
indirect  object  (10)  is  relativized,  as    will  be  shown  below. 

3.2    DO  and  10  relativization 

Consider  the  structures  in  (14)  below.  (14b)  is  a  case  of  DO 
relativization,  whereas  (14c)  is  a  case  of  10  relativization.  In  either  case  the 
relativized  NP  is  put  in  boldface.  (14a)  is  the  corresponding  non-relative 
structure.  Note  that  in  (14a)  the  auxiliary  -vwa  follows  the  logical  subject  of 
the  clause,  baana  'children',  while  in  (14b)  and  (14c)  it  precedes  it.  It  seems 
that  Ciluba  has  a  rule  whereby  the  auxiliary  reverses  its  position  with  the 
logical  subject  in  all  relatives,  except  in  subject  relativization.  When  the  rule 
applies,  the  auxiliary  agrees  not  with  the  logical  subject  but  rather  with  the 
relativized  noun.  A  case  in  point  is  (14c),  where  the  logical  subject,  baana 
belongs  to  noun  class  2,  while  the  relativized  noun,  bilamba  'clothes',  belongs 
to  class  8.  And  since  in  this  case  the  auxiliary  must  agree  with  the  relativized 
noun,  -vwa  in  (14c)  receives  the  class  8  prefix  bi^  for  agreement  with  bilamba. 

(14)  a.       Baana     ba-vwa  ba-peeshe  balongi  bilamba. 
children  Ag-Aux  Ag-give  students  clothes 
'The  children  gave  the  students  clothes' 

b.  B^lbngi'i    ba-vwa  baana    ba-peeshe  t,  bilamba 
students    Ag-Aux  children  Ag-give     clothes 

'The  students  whom  the  children  gave  clothes  to...' 

c.  Bilamba,    bi-vwa  baana    ba-peeshe  balongi  t, 
clothes     Ag-Aux  children  Ag-give  students 

'The  clothes  that  the  children  gave  the  students...' 
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B^lbngi,   ba-vwa  baana   ba-peeshe  t,  bilamba 
The  students  whom  the  children  gave  clothes 


-vwa    -peeshe 
(Aux)     (gave) 


bilamba 
(clothes) 


baiongi,         OP,      baana 
(students)  (children) 

While  in  (15a)  both  baiongi  and  -peesha  c-command  each  other,  in 
(15b)  only  baiongi  c-commands  -peesha  but  the  latter  does  not  c-command 
the  former.  This  lack  of  mutual  c-command  is  the  triggering  syntactic 
conditioning  for  the  phrasal  phonological  rule  given  in  (9i),  which  applies  to 
the  surface  syntactic  stnjture  of  (14b).  The  application  of  a  rule  such  as  this 
results  in  the  lowering  of  the  underlying  high  tone  of  the  auxiliary  -vwa  in 
(14b).  As  a  result,  (14b)  but  not  (14a)  is  interpreted  as  a  relative  stmcture. 

Similar  facts  obtain  for  the  stmcture  of  (14c),  where  the  DO  bilamba 
'clothes'  is  relativized.  The  syntactic  tree  of  (16)  below  is  illustrative.  It  shows 
that  the  NP  bilamba  has  moved  to  the  subject  position  in  the  higher  clause, 
and  like  baiongi  in  (15b),  bilamba  in  (16)  has  left  a  trace  behind,  which 
appears  in  the  position  from  which  it  has  been  moved,  bilamba  c-commands  z 
peesha,  but  the  latter  does  not  c-command  bilamba.  The  remainder  of  the 
syntactic  tree  is  the  same  as  in  (15b). 

In  the  surface  stmcture  of  (14c)  the  auxiliary  -vwa  receives  the  prefix  bii 
for  agreement  with  the  subject  NP  bi-lamba.  while  in  (14b)  -vwa  receives  ba- 
for  agreement  with  the  subject  NP  ba-longi.  However,  as  can  be  seen  in  the 
surface  structures  of  (14c)  and  (14b),  the  tone  of  the  auxiliary  -vwa  in  both 
cases  is  low  in  accordance  with  rule  (9i). 
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The  relevant  question  regarding  the  data  in  (14)  is  why  does  the 
auxiliary  verb  -vwa,  which  is  usually  high-toned  as  in  (14a),  display  a  low  tone 
in  the  structures  of  (14b)  and  (14c)?  In  an  attempt  to  answer  this  question,  I 
propose  to  consider,  first,  the  syntactic  trees  of  (15a)  and  (15b)  below  for  the 
structures  in  (14a)  and  (14b),  respectively. 

In  (15a)  both  the  10  balonqi  'student'  and  the  DO  bilamba  'clothes'  are 
dominated  by  the  same  phrasal  node,  the  VP  bapeeshe  balonqi  bilamba 
'gave  the  student  clothes'.  In  other  words,  the  NP  balonqi  and  the  NP 
bilamba  c-command  each  other  in  (15a).  In  addition,  c-commands  holds  also 
between  both  NPs  and  the  verb  -peesha.  since  the  latter  is  dominated  by  the 
same  phrasal  node  that  dominates  the  two  NPs,  the  VP  bapeeshe  balongi 
bilamba.  Similarly,  in  (15a)  the  NP  baana  'children,  INFL  ba-vwa.  and  the  VP 
bapeeshe  balonqi  bilamba  c-command  each  other  because  they  are 
dominated  by  the  same  node,  S.  Note  that  in  the  surface  structure  of  (15a) 
ba-vwa  has  a  low  tone  followed  by  a  high  tone. 

In  (15b),  however,  ba-vwa  displays  a  sequence  of  low  tones.  Also,  in  the 
tree  of  (15b),  the  verb  -peesha  c-commands  only  the  NP  bilamba.  but  does 
not  c-command  balongi.  Instead,  balonqi  has  moved  to  the  subject  position, 
and  in  the  process  has  left  a  trace  coindexed  with  it  in  the  position  from 
which  it  has  been  extracted,  as  the  tree  in  (15b)  shows.  The  fact  that  balonqi 
appears  in  the  subject  position  in  (15b)  while  in  (15a)  it  is  in  the  indirect 
object  position  shows  that  the  syntactic  stnjcture  of  (15b)  must  be  different 
from  that  of  (15a). 


(15) 


Baana     ba-vwa  ba-peeshe  balongi  bilamba. 
'The  children  gave  the  students  clothes.' 


baana 
(children) 


-vwa    -peeshe     balongi 
(Aux)     (gave)    (students) 


bilamba 
(clothes) 
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(16)  Bilambai  bi-vwa  baana  bapeeshe  balongi  t,  (same  as  (14c)) 
The  clothes  that  the  children  gave  the  students.' 


biiambaj 
(clothes) 


OP;    baana 
(children) 


-vwa  -peeshe  balongi 
(Aux)  (gave)  (students) 


To  summarize:  I  have  used  the  concept  c-command  (cf.  Chomsky  1981) 
in  order  to  account  for  the  difference  between  the  verbal  tone  patterns  in 
Ciluba  relatives  and  non-relatives.  I  have  shown  that  whenever  move-alpha 
applies  in  SU,  DO,  or  10  relativization,  it  results  in  loss  of  mutual  c-command 
between  the  relativized  NP  and  the  verb  of  the  structure  involved.  It  is  under 
this  syntactic  condition  that  the  phonological  phrasal  rules  in  (9)  apply. 

The  syntactic  conditioning  (in  terms  of  c-command)  of  the  rules  in  (9)  is 
similar  to  the  situation  described  in  Manzini  1983  regarding  liaison  in  French. 
Manzini's  hypothesis  about  liaison  in  French  is  presented  in  the  following 
section.  It  is  observed  that  Manzini's  hypothesis  provides  the  basis  for  a 
proper  analysis  of  Ciluba  relativization,  and  that  the  latter  in  turn  furnishes 
further  support  for  Manzini's  hypothesis. 


4.    Liaison  in  French 


Manzini  (1983:8)  observes  that  in  French,  liaison  is  possible  with  certain 
words.  Manzini  suggests  the  rule  presented  in  (17)  below,  where  L  stands  for 
"belonging  to  some  class  L",  and  V  for  "vowel".  The  rule  in  (17)  says  that  an 
obstruent  is  deleted  at  the  end  of  a  word  a^  unless  a  word  j^  follows  which 
begins  with  a  vowel  and  <<  itself  is  an  L  word  (Manzini  1983:8). 
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(17)  [-son]  -— >0/   o(        unless 


The  data  that  Manzini  refers  to  in  connection  with  rule  (17)  include 
cases  such  as  (18)  on  the  one  hand,  and  (19)  on  the  other.  In  the  examples 
given  below  ^  stands  for  liaison;  and  /  indicates  that  liaison  is  not  possible. 

Let  me  observe  that  prior  to  liaison,  words  such  as  donnait  and  un  in 
(18a)  can  be  produced  as  two  separate  words,  and  so  can  ecrivait  and  a  in 
(18b).  However,  when  liaison  applies  each  pair  is  produced  as  a  single  word, 
as  shown  by  the  liaison  sign,  ^.  In  cases  where  liaison  is  impossible,  as  in 
(19),  words  such  as  lunettes  and  a  cannot  be  produced  as  one  single  unit, 
neither  can  lunettes  and  etaient  in  (19b).  The  question  is  why  is  it  that  liaison 
obtains  in  the  structures  of  (18)  but  not  in  (19)? 

(18)  a.       donnaif^un  cours  'was  teaching" 

VP 


V 


NP 


Det    N 
donnait      un     cours 


b.       ecrivaif^a  Paul  'was  writing  to  Paul' 
VP 
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(19)  a.  Donner  les  lunettes/a  Marcel  'give  glasses  to  Marcel' 


D'et    N 
donner    les     lunettes 


a     Marcel 


Les  lunettes  etaient  la 
The  glasses  were  there' 


les  lunettes 


etaient  la 


Manzini  (1983:3)  suggests  that  a  possible  hypothesis  is  a  relation  of  c- 
command  which  is  satisfied  in  (18)  but  not  in  (19).  In  other  words,  for  any 
two  words  involved  in  the  'unless'  provision  of  (17)  a  relation  of  c-command 
holds,  as  in  (20),  with  c-command  defined  as  in  (21)  (Manzini  1983:3). 

(20)  In    ^][^ 

c<   and  /^  c-command  each  other  (obligatory  application) 
oi   c-commands  ^  (optional  application  of  liaison) 

(21)  e<  c-commands  /S>  iff  neither  one  dominates  the  other  and 
the  first  phrasal  node  which  dominates  a(  dominates  ^  also. 

Manzini  explains  that  liaison  obtains  in  (18a)  because  of  the  c-command 
relation  that  holds  between  donnait  and  un.  That  is,  the  first  phrasal  node 
which  dominates  donnait.  namely  VP,  also  dominates  un,  hence  donnait  c- 
commands  un.  However,  in  (19a)  the  NP  lunettes  does  not  c-command  the 
preposition  h,  since  the  first  phrasal  node  which  dominates  lunettes,  viz.  NP, 
does  not  dominate  i  (Manzini  1983:3). 

As  observed,  Manzini's  hypothesis  is  corroborated  by  the  facts  discussed 
with  respect  to  Ciluba  SU,  DO,  and  10  relativization.  Note,  however,  that  the 
rules  that  Manzini  proposes  in  (20)  and  (21)  for  liaison  in  French  work  in  the 
reverse  order  for  Ciluba  relativization.  That  is,  while  in  French,  liaison  obtains 
when  c-command  holds  between  the  elements  involved,  the  Ciluba  facts 
suggest  that  phrasal  sandhi   rules  here  apply  only  in  structures  where  no 
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'mutual  c-command'  holds  between  the  verb  and  the  subject  NP  on  the  one 
hand,  or  between  the  verb  and  its  object  or  one  of  its  object  NPs,  on  the 
other.  Thus  for  Ciluba  phrasal  sandhi  rules  such  as  (9)  to  apply  in  the  surface 
syntactic  structure  of  (12b),  for  example,  the  statements  of  (20)  and  (21)  need 
to  be  modified  as  in  (22)  and  (23),  respectively,  with  'mutual  c-command' 
defined  as  In  (23). 

(22)  In    o(    ]  [   A     :  where  o^  stands  for  lexical  category  V,  and  yS 

for  lexical  category  N, 

o(     does  not  c-command  ft  ,  but 
/3    c-commands  c<  ,  hence  there  is  'no 
mutual  c-command'  between  o(  and  ^ 

(23)  o(    andy^    mutually  c-command  each  other  iff 

i)       neither  one  dominates  the  other; 

ii)      the  first  phrasal  node  which  dominates  oC 

dominates  /§  also; 
iii)(<  c-commands  yg   and  yS   c-commands  «?<{    ; 

c-command:^  c-commands /3    and /5    c-commands  o< 
iff  every  maximal  projection 
dominating  o<,  also  dominates  S 
(Sells  1985:39).  ^ 

Stated  informally,  the  conditions  in  (22)  and  (23)  simply  mean  that  a 
lexical  category,  V,  does  not  c-command  another  lexical  category,  N,  if  N  and 
V  are  not  dominated  by  the  same  first  phrasal  node.  In  the  remainder  of  this 
paper  I  shall  use  the  term  'no  mutual  c-command'  to  refer  to  every  case 
where  an  NP  c-commands  a  verb  but  the  verb  does  not  c-command  the  NP. 

The  statements  of  (22)  and  (23)  are  borne  out  by  the  facts  discussed 
earlier  concerning  Ciluba  SU  relativization,  10  relativization,  and  DO 
relativization.  Further  evidence  for  these  statements  comes  from  data  on 
Ciluba  wh-structures,  which  I  will  discuss  in  the  next  section. 


4.    WH-Constructions 

Ciluba  has  many  ways  of  expressing  interrogation.  These  include  the 
use  of  intonation,  body  language,  and  wh-words.  Here  I  shall  be  concerned 
with  wh-structures.  Ciluba  wh-structures  are  sometimes  marked  by  the 
question  particle  [-epi].  This  particle  is  used  with  subject/verb  agreement 
prefixes  corresponding  to  all  Ciluba  noun  class  prefixes.  Expressions  in  which 
-epi  occurs  include  u-epi  'where  is  he/it?',  and  ba/i-epi  'where  are  they  (e.g. 
men/houses)?'. 
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The  question  marker  relevant  for  the  subsequent  discussion  is  zlA,  a 
particle  whose  meaning  likewise  depends  on  the  prefix  with  which  it 
combines.  For  example,  Nqani/banqani  'who'  is  used  for  questions  that  refer 
to  human  beings,  Cini/binqani  'what'  for  questions  that  refer  to  non-human 
beings,  and  Kuni/kwepi  'where'  for  questions  that  refer  to  location.  The 
structures  in  (24)-(26)  are  illustrative.  The  a-structures  are  affirmative 
statements;  the  corresponding  questions  are  given  in  the  b-structures  and  the 
c-structures. 

(24)  a.       Baana  ba-vwa  ba-mona  bivi 

children  Ag-Aux  Ag-see  thieves 
'The  children  saw  the  thieves.' 

b.  Baana  ba-vwa  ba-mona  BANGANI? 
'The  children  saw  WHO?" 

c.  Bangani,  bavwa  baana  bamona  t, 
'WHO  did  the  children  see?" 

(25)  a.       Baana      ba-vwa  ba-mona  ntambwe 

Children  Ag-Aux  Ag-see    lion 
'The  children  saw  the  lion' 
/ 

b.  Baana  ba-vwa  ba-mona  CINI? 
'The  children  saw  WHAT?' 

/ 

c.  CINI;  ci-vwa  baana  bamona  t, 
'WHAT        did  the  children  see?' 

(26)  a.       Baana  ba-vwa  ba-l6nga  ku  Chicago 

children  Ag-Aux  Ag-go  to  school  to  Chicago 
'The  children  went  to  school  in  Chicago' 

b.  Baana  ba-vwa  ba-l6nga  K^NI/KWEPI? 
'The  children  went  to  school    WHERE?' 

c.  KUNI/KWEPI,  ku-vwa  baana  ba-l6nga  t^ 
'WHERE  did  the  chidren  go  to  school?' 

Note  that  in  Ciluba  a  WH-word  can  be  left  in  situ  as  in  the  b-structures, 
or  it  can  be  fronted  via  move-alpha  to  the  subject  position,  as  can  be  seen  in 
the  c-structures.  However,  if  movement  occurs,  the  WH-word  not  only  leaves 
behind  a  trace  that  is  syntactically  bound  with  it,  but  it  also  triggers  verbal 
tone  changes  in  the  structures  involved,  as  the  c-structures  of  (24)-(26)  show. 
Note  further  that  the  rule  of  auxiliary  verb/logical  subject  inversion  referred  to 
earlier  with  respect  to  the  data  in  (14)  applies  also  in  the  c-structures  of  (24)- 
(26).  By  this  rule,  the  auxiliary  reverses  its  position  with  the  logical  subject  of 
the  structure  such  that  the  auxiliary  agrees  with  the  derived  subject  rather 
than  with  the  logical  subject.  This  explains  why  in  (26c),  for  example,  the 
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auxiliary  agrees  with  the  moved  wh-word,  ku-ni/ku-epi  'where'  rather  than  with 
the  logical  subject,  baana  'children'. 

in  the  structures  in  the  a  and  b  versions  of  (24)-(26),  the  auxiliary  -ywa 
has  a  high  tone.  In  the  c-stmctures,  however,  it  has  a  low  tone.  The  tonal 
alternation  on  the  auxiliary  -ywa  in  (24)-(26)  is  of  the  same  nature  as  the  one 
in  the  relative  structures.  Therefore,  I  will  again  propose  that  in  the  c- 
structures  of  (24)-(26)  the  auxiliary  -vwa  displays  a  low  tone  as  a  result  of 
move-alpha  and  loss  of  mutual  c-command  relation  between  the  verb  and  the 
moved  Wh-word. 

In  order  to  account  for  the  occurrence  of  the  low  tone  on  -vwa  in  the  c- 
structures  of  (24)-(26),  I  propose  in  (27a,  b)  below  the  S-structures  for  (24b, 
c).  Note  that  the  remaining  structures,  viz.  (25)  and  (26)  can  be  accounted  for 
in  the  same  fashion  by  replacing  the  lexical  items  of  (27)  with  relevant  lexical 
items  from  the  corresponding  stnjctures  in  (25)  and  (26). 


(27)  a. 


Baana  bavwa  bamona  BANGANI 
'The  children  saw  who?" 


baana 
(children) 


-vwa  -mona 

(Aux)  (see) 

BANGANI,  bavwa  baana  bamona  t, 
'Who  did  the  children  see?' 


bangani 
(who) 


baana 
(children) 


-vwa 
(Aux) 


-mona 
(saw) 
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The  concept  that  I  have  been  referring  to  thus  far,  mutual  c-command, 
can  be  used  also  here  to  explain  the  difference  between  (27a)  and  (27b).  In 
(27a)  the  verb  -mona  and  the  wh-word  bangani  c-command  each  other 
because  they  are  dominated  by  the  same  node,  the  VP  bamona  bangani 
'saw  who'.  However,  there  is  no  mutual  c-command  relation  between  the  two 
in  (27b),  since  the  first  phrasal  node  which  dominates  the  verb  -mona,  VP, 
does  not  also  dominate  the  wh-word  bangani.  Instead,  in  (27b)  the  wh-word 
bangani  has  moved  to  Comp.  In  the  process  the  moved  wh-word  has  left 
under  the  VP  node  a  trace,  t„  which  is  coindexed  with  it.  The  facts  adduced 
here  show  that  the  hierarchical  syntactic  structure  of  (27b)  is  not  identical  to 
the  hierarchical  syntactic  structure  of  (27a).  In  (27b),  where  wh-movement  has 
occurred,  that  difference  is  reflected  by  the  tone  change  on  the  auxiliary  -vwa, 
as  can  be  seen  in  (24b).  And  the  tone  change  must  be  accounted  for  by  a 
sandhi  rule  similar  to  (9i),  which  is  sensitive  to  the  absence  of  mutual  c- 
command  between  the  verb  and  the  preceeding  NP. 

We  are  now  in  a  position  where  we  can  provide  a  more  systematic 
account  of  Ciluba  relative  structures.  Although  they  do  not  contain  overt 
relative  pronouns  or  markers,  syntactically  they  are  similar  to  structures  with 
fronted  interrogative  pronouns,  just  as  in  many  other  languages:  Both  in 
relative  structures  and  in  stmctures  with  fronted  interrogatives,  move-alpha 
brings  about  the  loss  of  mutual  c-command  between  the  verb  and  the 
preceding  constituent.  A  syntactic  consequence  is  that  the  verb  agrees  with 
the  initial  constituent  which  may  be  different  from  the  logical  subject.  On  the 
phonological  level,  the  lack  of  mutual  c-command  triggers  tonal  sandhi  rules. 
In  the  case  of  relative  structures,  the  resulting  tone  changes  may  be  viewed 
as  an  indirect  marker  of  relative  clause  formation.  However,  since  the  same 
changes  take  place  in  stmtures  with  fronted  wh-words,  the  tonal  changes 
cannot  be  considered  as  functioning  primarily  as  relative  markers.  At  best,  it 
could  be  argued  that  tone  change  functions  secondarily  as  something  like  a 
signal  of  relative  clause  formation. 


5.0  Alternative  approaches 

5.1  Relative  clauses  and  feature  marking 

One  way  to  approach  the  data  on  Ciluba  relatives  and  moved 
interrogatives  would  be  to  assume  that  there  exist  in  Ciluba  bundles  of 
syntactic  features  such  as  [+/-Relative],  which  mark  relative  structures  as 
[+Relative]  and  non-relative  as  [-Relative].  So,  if  a  structure  is  marked  with 
the  feature  [4-Relative],  then  one  would  expect  a  tone  change  on  the  verb  as 
opposed  to  the  tone  pattern  that  the  same  verb  would  have  in  non-relative 
structures.  In  this  approach,  the  grammar  of  Ciluba  would  have  to  have  many 
bundles  of  such  features  in  order  to  account  for  each  individual  structure, 
whether  relative,  interrogative,  or  others.  The  problem  with  this  approach  is 
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that  it  does  account  for  all  the  facts.  For  as  the  discussion  of  interrogatives 
has  shown,  it  is  not  sufficient  for  a  structure  to  be  marked  as  [+  interrogative] 
to  undergo  verbal  tone  sandhi.  Rather,  it  is  the  application  of  move-alpha  and 
the  resulting  absence  of  mutual  c-command  that  triggers  the  tone  sandhi. 

5.2    Ciluba  tone  reversal 

Another  approach  that  may  at  first  appear  consistent  with  the  data 
discussed  in  this  paper  is  the  'tone  reversal  approach'  proposed  in  Maddieson 
1976.  Tone  reversal  is  a  phenomenon  by  which  tones  in  Ciluba  have  been 
reversed  from  their  initial  value  in  Proto-Bantu.  That  is,  Proto-Bantu  high  tone 
has  become  Ciluba  low  tone,  and  Proto-Bantu  low  tone  has  become  Ciluba 
high  tone  (Meeussen  1967,  1971;  Van  Spaandonck  1971,  Maddieson  1976). 

In  an  earlier  publication  on  tone  reversal  in  Ciluba,  Maddieson  (1976) 
suggests  that  the  verbal  tone  change  in  Ciluba  relative  structures  can  be 
explained  as  originating  from  the  loss  of  an  augment  that  Ciluba  had  in  its 
initial  stage  (e.g.  Du  Bois  1970).  Under  this  approach  it  is  assumed  that  in  its 
initial  or  pre-Ciluba  stage,  Ciluba  must  have  had  an  augment,  i.e.  a  pre-prefix 
of  some  sort,  generally  a  vowel  used  with  both  nouns  and  verbs  (Maddieson 
1976:157).  The  augment  must  have  been  high-toned,  as  can  be  inferred  from 
the  derivations  given  in  (30)  and  (31)  below.  However,  the  high  tone  of  the 
augment  changed  to  a  low  tone  as  a  result  of  the  aforementioned  tone 
reversal. 

Maddieson  explains  that  since  Ciluba  no  longer  has  an  augment,  the 
augment  must  have  been  deleted  by  a  contraction  rule  in  pre-Ciluba.  In  the 
process,  the  augment  left  a  floating  low  tone  on  the  tonal  tier  of  Ciluba  verbs 
and  nouns.  Maddieson  concludes  that  after  tone  reversal  and  augment 
deletion  (in  that  order),  the  resulting  floating  low  tone  must  have  spread 
rightward  onto  the  adjacent  tone  bearing  unit,  usually  a  vowel.  Maddieson 
(1976:161)  gives  the  rules  in  (29)  which  he  says  can  account  for  the 
sequences  in  (28).  He  postulates  the  sequences  in  (28)  as  the  underlying 
tonal  stnjcture  of  verbs  in  Ciluba  relative  structures.  Maddieson  does  not 
define  the  contraction  rule  formally  because,  in  his  words,  doing  so  would 
have  created  problems  that  were  beyond  the  scope  of  his  paper.  As  a 
consequence,  in  the  examples  below,  I  will  have  to  fudge  a  little  as  far  as  the 
application  of  this  rule  is  concerned. 

(28)        augment        prefix  vert  stem  final  vowel 

L  H  {H}  L 

{L} 
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(29)  a.  HL    {L} >  HH    {L} 

{#}  {#} 

b.  LH    {H}  --->  LL     {H} 

{#}  {#} 

c.  Contraction  (e.g.  H  L  L  — ->  H  L  ) 

VCVCV       CVCV 

Some  of  the  few  cases  that  Maddieson  (1976:161)  gives  in  which  the  rules 
stated  in  (29)  apply  include  the  examples  given  in  (30)  and  (31),  below. 

Pre-Clluba  Modern  Ciluba 

(30)  *abatoma     >  batuma  'they  who  sent' 

Derivation  (Maddieson  1976:161): 

abatoma     >  batoma  (augment  deletion) 

L  H  H  L      >  L  L  H  L 

1    I    I   I 

L  L  H  H    (tone-spreading) 

I    I    1/ 

L  L  H  (contraction) 


(31)       *abadema     >  badima 'they  who  cultivate' 

Derivation  (Maddieson,  ibid.) 


abadema     >       badima  (augment  deletion) 

L  H  L  L     >       L  H  L  L 

I  I  11 

L  H  H  L    (tone-Spreading) 

I  I  M 

L  H  H  H    (tone-spreading) 

L  H  H  (contraction) 

Maddieson's  approach  may  appear  to  account  nicely  for  data  such  as 
(30)  and  (31).  However,  the  data  under  consideration  present  only  part  of  the 
problem  that  Ciluba  relative  structures  raise.  First,  let  us  consider  again  the 
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examples  discussed  in  section  2,  which  are  similar  to  Maddieson's  examples 
in  (30)  and  (31)  above.  Recall  that  for  the  njles  in  (29)  to  provide  correct 
derivations  for  the  data  in  (30)  and  (31),  Maddieson  assumes  that  (30)  and 

(31)  each  originate  with  an  augment  in  their  respective  underlying  form. 
Based  on  this  assumption  one  might  conclude  also  that  the  lexical  items  in 

(32)  each  include  a  pre-prefixed  augment  underlyingly. 

(32)  a.      Bavwa  batuma  mwana     'They  sent  the  child' 

b.      Bavwa  batuma  mwana     'They  who  sent  the  child  ...* 

Here,  I  shall  focus  on  the  auxiliary  ba-vwa.  The  problem  is  explaining 
the  alternation  of  HT  and  LT  on  the  verb  stem,  -vwa.  Note  that  in  (32a)  -vwa 
has  a  high  tone,  while  in  (32b)  it  has  a  low  tone. 

Accepting  Maddieson's  approach,  the  following  analysis  can  be  proposed 
for  the  tonal  patterns  of  the  auxiliary  verb  ba-vwa  in  (32b):  ba-vwa  is  pre- 
prefixed  with  an  augment  underlyingly,  and  the  augment  is  high-toned. 
However,  the  augment  changes  its  original  high  tone  to  a  low  tone  as  a  result 
of  tone  reversal.  Next,  the  augment  is  deleted.  After  augment  deletion  the 
resulting  floating  low  tone  on  the  tonal  tier  of  ba-vwa  spreads  onto  the  next 
tone  bearing  unit,  in  this  particular  case  on  the  vowel  a  of  the  prefix  ba-. 
Finally,  the  derivation  is  completed  by  applying  other  tone-spreading  rules  and 
a  contraction  rule,  as  defined  in  (29).  This  explains  why  the  prefix  ba-  of  the 
auxiliary  ba-vwa  has  a  low  tone  in  (32b).  A  derivation  similar  to  (30)  and  (31) 
is  given  in  (33),  for  ba-vwa  in  (32). 

(33)  *abavwa        >  bavwa     (augment  deletion) 

L    L     H        >  L  L    H 

^   I 

L    H     (contraction  njle) 

Note,  however,  that  what  distinguishes  the  non-relative  surface  structure 
of  (32a)  from  the  relative  structure  in  (32b)  is  the  tone  of  the  auxiliary  stem,  - 
vwa.  not  the  tone  of  the  auxiliary  prefix,  ba-  has  a  low  tone  in  both  (32a)  and 
(32b),  whereas  the  stem,  -vwa.  has  high  tone  in  (32a),  but  low  tone  in  (32b). 
As  can  be  seen  from  the  derivation  in  (33),  Maddieson's  approach  would 
wrongly  predict  that  -vwa  bears  a  high  tone  in  (32b),  whereas  the  rules  I 
proposed  in  (9)  correctly  predict  that  it  has  a  low  tone. 

Furthermore,  let  us  look  at  structures  similar  to  the  ones  in  (1),  but  not 
containing  an  auxiliary;  cf.  (34). 

(34)  a.      Muntu  u-aka-shfp-a  ntambwe     'the  man  killed  the  lion' 

b.       Muntu  u-aka-ship-a  ntambwe     'the  man  who  killed  the  lion...' 

In  (34),  u-  is  a  subject-verb  agreement  prefix,  -aka-  is  a  tense  marker,  : 
ship-  is  a  verb  stem,  and  :§  is  a  final  vowel  suffix.  The  examples  in  (34) 
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show  that  also  in  relative  structures  without  auxiliary  the  verb  displays  a  tonal 
pattern  different  from  the  pattern  in  non-relative  stmctures.  Thus,  u-aka-ship-a 
in  the  non-relative  structure  in  (34a)  displays  a  sequence  of  low  tones 
followed  by  high  tones,  while  in  the  relative  structure  in  (34b)  it  shows  a 
sequence  of  high  tones  followed  by  low  tones.  Examples  like  these  show  that 
what  is  going  on  in  terms  of  tone  alternation  is  not  tone  lowering,  which 
Maddieson's  account  would  predict,  but  a  tone  reversal. 

Finally,  there  is  no  evidence  internal  to  Ciluba  which  shows  that  in  its 
initial  stage  the  language  had  an  augment,  as  Maddieson  suggests.  The  fact 
that  the  augment  is  used  in  related  languages  does  not  necessarily  imply  that 
Ciluba  also  had  an  augment  at  some  stage  in  its  development.  Furthermore, 
if  one  holds  as  a  matter  of  theoretical  principle  -  as  do,  for  example,  Pullum 
and  Zwicky  (1975)  and  Zwicky  and  Pullum  (1983),  Zwicky  (1985)  -  that  no 
(language-specific  and  rule-specific)  conditions  on  the  applicability  of  syntactic 
rules  in  any  language  may  make  reference  to  phonological  features  (e.g. 
Plank  1984:330-331),  then  reference  to  a  ghost  augment  in  the  formation  of 
relatives  in  Ciluba  must  appear  suspect. 

The  foregoing  discussion  shows  that  Maddieson's  proposal  is  less  viable 
than  the  proposal  I  have  suggested  in  this  paper  because  it  cannot  handle 
the  facts  about  Ciluba  relatives  and  moved  interrogatives.  In  my  proposal 
those  facts  can  property  be  accounted  for  if  one  refers  to  concepts  such  as 
'mutual  c-command',  as  defined  in  (23),  which  allow  us  to  distinguish  Ciluba 
relatives  from  non-relatives,  and  to  distinguish  wh-structures  in  which  wh- 
fronting  has  applied  from  those  in  which  it  has  not. 


6.    Conclusion 

This  paper  has  been  concerned  with  the  following  questions: 

(a)  How  are  relative  clauses  formed  in  Ciluba? 

(b)  How  can  Ciluba  relative  clauses  be  distinguished  from  non-relative 
clauses? 

(c)  Why  is  it  that  in  Ciluba  the  verb  of  a  relative  clause  usually 
displays  a  tone  pattern  different  from  the  tone  pattern  it  has  in  non- 
relative  clauses,  even  though  such  clauses  have  the  same  surface 
constituent  structure  as  relative  clauses? 

The  paper  has  addressed  these  questions  by  examining  various  cases 
of  relativization  in  Ciluba,  including  SU,  DO  and  10  relativization.  It  was 
argued  that  Ciluba  relative  clauses  are  formed  by  move-alpha,  which  results 
in  loss  of  mutual  c-command  relation  between  the  elements  involved  and 
consequent  verbal  tone  changes.  The  verbal  tone  changes  obtain  by  applying 
the  phrasal  sandhi  rules  stated  in  (9).  Such  rules,  I  have  argued,  are 
syntactically    conditioned    and    apply    only    in    structures    where    'mutual    c- 
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command',  as  defined  in  (23),  does  not  hold  between  the  subject  NP  and  the 
VP,  nor  between  the  verb  and  its  object  (or  one  of  its  object)  NP(s).  These 
conclusions  are  further  supported  by  the  behavior  of  moved  interrogatives. 


7.    Theoretical  Implications. 

Earlier  in  this  paper  the  question  was  raised  of  whether  the  principle  of 
phonology-free  syntax  (e.g.  Pullum  and  Zwicky  1975,  1983)  is  correct,  and  if 
it  is,  how  does  one  explain  the  fact  that  in  Ciluba,  syntactic  phenomena  such 
as  relativization  are  on  the  surface  more  visible  in  phonology  rather  than  in 
syntax?  This  paper  has  shown  that,  as  in  many  other  languages  (e.g.  Kaisse 
1977,  1981,  Plank  1984,  Kenstowic  1987),  it  is  possible  to  maintain  that 
syntax  is  independent  of  phonology,  but  that  phonological  rules  may  be 
sensitive  to  syntax.  In  fact,  the  tonal  sandhi  rules  in  Ciluba  relative  structures 
seem  to  be  exclusively  triggered  by  a  syntactic  factor,  namely  the  absence  of 
mutual  c-command  between  the  verb  and  the  preceeding  constituent. 


NOTES 


An  earlier  version  of  this  paper  was  presented  at  the  19th  Annual 
Conference  on  African  Linguistics,  Boston  University,  Boston,  Massachusetts, 
April  14-17,  1988.  I  would  like  to  express  my  gratitude  to  Hans  Henrich  Hock, 
and  to  an  anonymous  SLS  reviewer  for  their  insightful  comments  and 
suggestions.  I  also  aknowledge  with  gratitude  the  comments  and  suggestions 
of  my  colleague,  Laura  Downing,  on  the  pre-publication  version  of  this  paper. 
Any  remaining  errors  are,  of  course,  mine  alone. 

'  Ciluba  is  one  of  the  four  national  languages  spoken  in  the  Republic  of  Zaire. 
The  other  three  languages  include  Kikongo,  Lingala,  and  Swahili. 

^  The  two  rules  given  in  (9)  are  needed  to  account  for  the  behavior  of  Ciluba 
relatives,  as  the  additional  data  presented  below  also  suggest.  The  data  show 
that  in  relative  clauses,  -vwa  in  the  third  person  singular  has  a  different  tone 
pattern  than  in  the  other  persons.  In  the  third  person  singular,  -vwa  keeps  its 
underlying  tone,  high  tone,  while  in  the  other  persons  it  reverses  its  high  tone 
to  low  tone,  as  the  njles  in  (9)  predict. 
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Non-Relatives 

Relatives 

m-  vwa  n-  sama 

'1  was  sick  ' 

rn-  vwa  n-  sama 

'1  who  was  sick. 

u-  vwa  u-  sama 

'You  were..." 

u-  vwa  u-  sama 

'You  who  were... 

u-  vwa  u-  sama 

'He  was  ...  ' 

u-  vwa  u-  sama 

'He  who  was... 

tu-vwa  tu-sama 

'We  were  ..." 

tu-vwa  tu-sama 

'We  who  were... 

nu-vwa  nu-sama 

'You  were...' 

nu-vwa  nu-sama 

'You  who  were. 

ba-vwa  ba-sama    'They  were..'      ba-vwa  ba-sama    'They  who  were.. 


REFERENCES 


CHOMSKY,  Noam.  1981.  Lectures  on  government  and  binding.  Dordrecht: 
Foris  Publications. 

CLEMENTS,  George.  1978.  Tone  and  syntax  in  Ewe.  Elements  of  tones, 
stress,  and  intonation,  ed.  by  Donna  Jo  Napoli,  21-99.  Georgetown 
University,  Washington,  D.C.:  Georgetown  University  Press. 

DU  BOIS,  K.F.  1970.  The  Augment  in  the  Bantu  languages.  Africana 
Linguistica  4.85-167. 

HETZRON,  R.  1972.  Phonology  in  syntax.  Journal  of  Linguistics  8.251-265. 

KAISSE,  Ellen  M.  1977.  On  the  syntactic  environment  of  a  phonological  rule. 
Chicago  Linguistic  Society  13.173-185. 

1981.    Separating    phonology    from    syntax:    A    reanalysis    of    Pashto. 

Journal  of  Linguistics  17.197-208. 

KENSTOWICZ,  Michael.  1987.  The  phonology  and  syntax  of  wh-expressions 
in  Tangale.  Studies  in  the  Linguistic  Sciences  15:2.79-92. 

KLAVANS,  Judith  L  1985.  The  independence  of  syntax  and  phonology  in 
cliticization.  Language  61:1.95-120. 

MADDIESON,  Ian.  1976.  Tone  reversal  in  Ciluba:  A  new  theory.  Studies  in 
Bantu  tonology.  South  Carolina  Occasional  Papers  in  Linguistics  3.141- 
165. 

MANZINI,  Maria  R.  1983.  Syntactic  conditions  on  phonological  rules.  MIT 
Working  Papers  in  Linguistics  5.1-9. 

MEEUSSEN,  A.E.  1967.  Bantu  grammatical  reconstructions.  Africana 
Linguistica  3.30-121. 

.1971.    Relative    clauses    in    Bantu.    Studies    in    African    Linguistics, 

Supplement  2.3-10. 


109 

PLANK,  Frans.  1984.  Romance  disagreements:  Phonology  interfering  with 
syntax.  Journal  of  Linguistics  20.329-349. 

PULLUM,  G.K.,  and  A.M.  Zwicky.  1975.  Sections  from  The  phnciple  of 
phonology-free  syntax'.  Ohio  State  University  Worl<ing  Papers  in 
Linguistics  18.172-183. 


1983.  The  syntax-phonology  interface.    New  York:  Academic  Press. 


SADOCK,  Jerrold  M.  1983.  The  necessary  overlapping  of  grammatical 
components.  Papers  from  the  Parasession  on  the  Interplay  of 
Phonology,  Morphology,  and  Syntax,  198-221.  Chicago  Linguistic 
Society. 

SELKIRK,  Elisabeth.  1984.  Phonology  and  syntax:  The  relation  between 
sound  and  structure.  Cambridge,  MA:  The  MIT  Press. 

SELLS,  Peter.  1985.  Lectures  on  contemporary  syntactic  theories.  Chapter  3: 
Government-Binding  Theory,  19-73.  Stanford:  Center  for  the  Study  of 
Language  and  Information,  Stanford  University. 

VAN  SPAANDONCK,  Marcel.  1971.  On  the  so-called  reversive  tone  system  in 
Ciluba.  Studies  in  African  Linguistics  2.131-144. 

ZWICKY,  Arnold  M.  1985.  Rules  of  allomorphy  and  phonology-syntax 
interactions.  Journal  of  Linguistics  21.431-436. 


Studies  in  the  Linguistic  Sciences 
Volume  18,  Number  2,  Fall  1988 


Root  control,  dnderspecification,  and  atr  harmony. 


Meterwa  A.  Ourso 


In  Lama  phonology  [+ATR]  high  vowels  trigger 
harmony  across  the  word,  while  [+ATR]  mid  vowels  do  not. 
When  they  occur  with  [+ATR]  high  vowels  within  the  same  root, 
the  mid  vowels  do  not  block  the  spreading  of  [+ATR]  value  on 
to  the  affixes  adjacent  to  them.  Assuming  that  affixes  are  not 
specified  for  ATR  in  the  UR,  1  argue  that  the  [+ATR]  mid  vowels 
are  transparent.  Considered  within  the  combined  approach  of 
Root  Controlled  Harmony  (Clements  1981)  and  the  theory  of 
underspecification  (Pulleyblank  1983,  1986,  and  1988,  and 
particularly  van  der  Hulst  and  Smith  1986),  the  ATR  vowel 
harmony  in  Lama  becomes  straightforward.  It  is  shown  in  this 
paper  (a)  that  [+ATR]  is  the  only  value  needed  for  the  feature 
ATR  in  the  UR;  (b)  that  the  OCR  plays  a  crucial  role  in  the 
vowel  harmony  of  Lama. 


0.  Introduction. 

This  paper  is  an  investigation  into  ATR  (Advanced  Tongue  Root)  vowel 
harmony  in  Lama,  a  Gur  language  of  the  sub-ohental  family.  The  harmonic  facts 
considered  here  can  be  summarized  as  follows:  (1)  High  vowels  identified  as 
[+ATR]  in  the  Underlying  Representation  (UR)  of  the  language  never  occur  with 
[-ATR]  vowels  at  the  surface  level.  (2)  Mid  vowels  identified  as  [+ATR]  in  the  UR 
do  appear  with  [-ATR]  vowels  at  the  surface  level.  (3)  The  low  vowel  /a/  surfaces 
as:  (a)  [+ATR]  in  roots  with  [+ATR]  high  vowels;  (b)  [-ATR]  In  roots  with  [+ATR] 
mid  vowels,  and  (c)  [-ATR]  with  all  the  other  [-ATR]  vowels. 

Except  for  the  asymmethc  behavior  of  the  [+ATR]  mid  vowels  with  respect 
to  the  low  vowel  (and  some  complexities  involving  affixes  to  be  discussed  below), 
the  ATR  vowel  harmony  would  be  straightforward  within  the  approach  of  the 
'Root  Controlled  Harmony'  proposed  in  Clements  1981  and  supported  in 
Pulleyblank  1986  and  1988. 

Within  that  approach,  the  feature  [ATR]  is  a  property  of  the  root,  which  in 
turn,  passes  it  down  to  the  affixes  which  are  not  specified  for  that  feature.  Crucial 
to  Clements's  theory,  both  feature  values,  [+ATR]  and  [-ATR],  must  be  specified 
in  the  UR  to  account  for  Akan  vowel  harmony.  His  motivation  is  that  the  low 
vowel  /a/  in  Akan  is  an  opaque  segment  which  must  be  lexically  specified  as 
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[-ATR]  in  the  UR.  Moreover,  he  argues  that  the  low  vowel  must  link  to  that  lexical 
ATR  value  before  the  association  conventions.  It  follows  from  Clements's  theory 
that  both  harmonic  roots  and  disharmonic  roots  must  be  posited  in  the  UR  to 
explain  the  presence  of  [+ATR]  and  [-ATR]  vowels  within  the  same  word  at  the 
surface  level.  An  important  point  to  remember  from  Clements's  framework  is  that: 
(1)  opaque  segments  such  as  the  low  vowel  in  Akan  block  [+ATR]  harmony;  (2) 
they  do  not  undergo  [+ATR]  harmony;  and  (3)  they  spread  their  lexically  specified 
[-ATR]  value  on  to  the  affixes  within  their  domain. 

Contrary  to  the  behavior  of  opaque  segments  exemplified  by  the  low  vowel 
in  Akan  (Clements  1981)  and  Bah  (Hall  and  Yokwe  1978,  van  der  Hulst  and 
Smith  1986),  the  mid  vowels  in  Lama  pose  harmony  problems  of  the  following 
nature.  First,  when  a  suffix  of  the  stmcture  CV  is  attached  to  a  monosyllabic  root 
with  a  mid  vowel,  the  vowel  of  that  suffix  is  realized  as  [-ATR]  irrespective  of  the 
ATR  value  of  that  root  vowel  in  the  UR.  Second,  when  a  suffix  of  the  same 
structure  is  attached  to  a  bisyllabic  root  with  both  high  and  mid  vowels,  the  suffix 
is  realized  as  [+ATR],  provided  the  high  vowel  of  the  root  is  [+ATR]  in  the  UR;  the 
position  of  the  root  vowels  has  no  bearing  on  this  harmonic  fact.  Thirdly,  if  a 
prefix  and  a  suffix  attach  to  a  [+ATR]  mid  vowel  root,  both  affixes  will  be  realized 
as  [-ATR];  they  will  appear  as  [+ATR]  if  the  root  vowel  is  a  [+ATR]  high  vowel. 
And  finally,  the  low  vowel  /a/  neither  blocks  nor  triggers  ATR  harmony  in  Lama. 

In  this  paper,  I  argue  that  the  'Root  Controlled'  approach  proposed  in 
Clements  1981  cannot  account  for  Lama  ATR  vowel  harmony  without  resorting  to 
various  ad  hoc  njles.  I  will  show  that  the  asymmetric  facts  presented  by  Lama 
vowel  harmony  fall  out  from  the  general  phnciples  of  Universal  Grammar  (UG) 
when  considered  within  the  framework  of  both  'Root  Control'  and  'Under- 
specification'. 

In  Pulleyblank  1986  and  1988,  it  is  argued  that  for  ATR  as  a  contrasting 
feature  only  one  value  is  specified  in  the  UR;  the  other  value  is  supplied  by  rule. 
Pulleyblank  convincingly  shows  that  [-i-ATR]  is  the  specified  value  for  the  feature 
ATR  in  the  UR.  I  adopt  this  position  for  the  analysis  of  the  data  discussed  in  this 
paper.  Following  van  der  Hulst  and  Smith  (1986),  I  will  argue  that  the  mid  vowel 
in  Lama  is  lexically  linked  to  [+ATR]  when  it  is  specified  for  that  feature  in  the  UR. 
I  will  further  argue  that,  being  linked  to  its  lexical  ATR  value  before  the 
association  conventions,  the  [h-ATR]  mid  vowel  acquires  the  status  of  a  neutral 
vowel  in  Lama.  Within  this  approach,  the  ATR  harmony  becomes  straightforward 
in  that  the  neutral  vowel  neither  spreads  nor  prevents  the  spreading  of  the 
[+ATR]  value  from  other  sources. 

The  paper  is  structured  as  follows.  The  first  section  introduces  the  vowel 
system  of  Lama.  The  second  section  presents  and  illustrates  the  harmonic  facts 
with  respect  to  the  features  ATR,  High,  and  Low.  In  section  3,  I  first  discuss  the 
Root  Controlled  Principle  and  relevant  aspects  of  the  theory  of  underspecification 
in  relation  to  the  facts  presented  in  section  2;  I  then  propose  an  analysis  of  these 
facts  within  a  combined  approach  of  Root  Controlled  Harmony  and 
Underspecification.  The  final  section  of  the  paper  summarizes  my  arguments. 
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1.  The  vowel  system  in  Lama. 


Throughout  this  paper,  the  characters  in  bold  face  indicate  [+ATR]  vowels 
and  the  vowels  in  normal  characters  are  [-ATR]  ones.  The  symbols  (3/3)  will  be 
used  for  the  [+ATR]  and  [-ATR]  schwas  respectively.  Brackets  [  ]  indicate 
segments  at  the  surface  level  and  slashes  /  /  indicate  segments  in  the  UR.  The 
acute  accent  marks  a  high  tone,  the  grave  accent  indicates  a  rising  tone  (which  is 
always  realized  as  a  low  tone  at  the  surface  level),  and  the  circumflex  accent 
marks  a  falling  tone;  the  level  low  tone  is  not  marked  to  distinguish  it  from  the 
rising  tone.1 

1.1.  Front  and  back  vowels. 

Lama  has  4  front  vowels  and  4  back  vowels  described  and  contrasted  in 
this  section.  The  front  vowels  are  presented  below. 

The  vowels  [i/i].  The  two  vowels  differ  from  eac^  other  in  the  amount  of 
pressure  required  from  the  tongue  in  their  production,  [i]  is  produced  with  high 
tension  on  the  tongue  and  with  the  vocal  cords  in  strong  vibration;  as  a  result,  the 
front  vowel  [I]  is  more  audible  than  its  counterpart  [i].  In  the  production  of  the 
latter,  no  tension  is  required  from  the  tongue  and  the  vocal  cords  are  hardly 
vibrating,  resulting  in  a  less  perceptible  sound.  The  two  sounds  are  contrasted  in 
minimal  pairs  like  the  following. 

(1)  si  'wear" 
si  'enter' 
sisa  'tighten' 
sis3  'stick  in' 

The  constrast  between  the  two  vowels  is  evidence  that  both  of  them  are 
required  in  the  UR. 

The  vowels  [e/e].  In  the  production  of  the  vowel  [e],  the  tension  required 
from  the  tongue  is  lower  than  in  the  production  of  the  front  vowel  /i/;  but  that 
tension  is  higher  than  in  the  production  of  the  front  l\l  or  the  vowel  [e].  The 
following  illustrates  the  contrast  between  the  two  vowels  [e/e]. 

(2)  se        'run' 
se        'plant' 
ress     'dream' 

ress     'pass  (something)  over' 

As  in  the  case  of  the  high  vowels,  the  preceding  illustration  shows  that 
both  lei  and  lei  exist  in  the  UR  of  the  language.  The  difference  between  the  pairs 
i/i  and  e/e  is  in  the  height  the  tongue  adopted  in  the  production  of  each  pair. 
Following  Chomsky  and  Halle  1968,  I  argue  that  it  is  the  feature  [high]  which 
distinguishes  the  two  pairs  of  front  vowels.  The  value  [+high]  characterizes  the 
vowels  i/i  whereas  [-high]  describes  the  pair  e/e. 


There  are  4  contrasting  back  vowels  in  Lama;  these  are  [u/u],  [o/o],  and 
they  contrast  in  the  same  way  as  the  front  vowels. 

The  high  back  vowels  [u]  and  [u]  contrast  in  pairs  like  the  following. 

(3)  su        'plant  on' 
su       'take' 
luta      'stir' 

!ut3      'intermingle' 

The  following  illustrates  and  establishes  the  contrast  [o/o]. 

(4)  wo       'open  halfway" 

wo       'spread  (mat,  sheet)' 

t6s3     'put  together' 

tos3     'ward  off,  be  prepared  for' 

Apart  from  these  sets  of  front  and  back  vowels,  central  vowels  are  also 
attested  in  the  language. 


1.2.  Central  vowels. 

Three  types  of  central  vowels  can  be  identified  in  the  UR  of  the  language 
on  the  basis  of  their  contrasting  distribution;  the  three  vowels  are  [3/3],  and  [a]. 

The  vowels  [a/3].  In  the  production  of  the  [+ATR]  schwa  [3],  the  tongue 
remains  in  its  normal  position;  however,  a  tension  is  felt  at  the  root  of  the  tongue, 
which  is  more  advanced.  The  vocal  cords  vibrate,  resulting  in  an  audible  sound. 
The  opposite  activity  in  the  root  of  the  tongue  and  the  vocal  cords,  while  the 
tongue  is  in  its  normal  position,  produces  the  [-ATR]  schwa  [3].  The  following 
illustrates  the  contrast  between  the  two  types  of  schwa. 

(5)  lal        'shrink' 

l3l        'give  birth' 
lata       'skin  an  animal' 
I3t3      'get  loose' 

The  vowels  [a/3]  contrast  with  the  high  front  vowels  in  the  following. 

(6)  lata      'skin  an  animal* 
ilta       'tease' 

I3t3      'get  loose' 
lita       'write  badly' 
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The  two  vowels  also  contrast  with  the  high  back  vowels  as  shown  in  the 
following. 

(7) 


lata 

'skin  an  animal" 

luta 

'stir  up' 

I3t3 

'get  loose' 

Iut3 

'intermingle' 

tai 

'carry  repeatedly' 

tul 

'become  short' 

tal 

'open' 

tul 

'wrap  around  the  waist' 

The  various  contrasts  between  the  two  schwas  and  the  other  vowels 
esiablish  their  existence  in  the  UR  of  the  language. 

The  vowel  [a].  In  the  production  of  this  vowel,  the  tongue  is  completely 
relaxed;  no  tension  is  exercised  on  the  tongue.  The  main  difference  between  this 
vowel  and  the  other  vowels  is  that  the  opening  of  the  mouth  is  wider  in  its 
production.  The  following  illustrates  the  contrast  between  the  three  central 
vowels. 


(8)       sas3  'spur  on' 

S3S3  'put  upright' 

sasa  'hang  down' 

tal  'arrive' 

t3l  'open' 

tal  'dig  up' 

The  vowel  [a]  also  contrast  with  the  other  vowels  as  shown  below. 
(9) 


fa 

'beg  for' 

fe 

'give  way' 

sa 

'scratch' 

se 

'run' 

su 

'plant  on' 

su 

'take' 

si 

'wear* 

si 

'enter' 

wa 

'dance' 

wo 

'lift,  open  a  little' 

wo 

'spread  a  mat' 

Unlike  the  other  vowels,  the  low  vowel  /a/  does  not  show  any  contrast  in 
the  values  of  the  ATR  feature  in  monosyllabic  words,  where  it  is  always  realized 
as  [-ATR].  At  the  surface  level,  however,  it  occurs  as  [+ATR]  in  words  with  a 
[+ATR]  high  vowel.  Assuming  that  the  low  vowel  acquires  the  feature  value 
[+ATR]  through  vowel  harmony,  I  tentatively  classify  it  as  [-ATR]  in  the  UR. 
Evidence  for  this  approach  is  provided  in  the  next  section.  On  this  basis  we  can 
claim  that  Lama  has  eleven  vowels  in  the  UR. 
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1.3.  Lama  vowels  and  ATR  features 

The  phonetic  details  of  ATR  as  a  distinctive  vowel  feature  are  extensively 
covered  in  Clements  1981.  Following  Stewart  1967,  Clements  notes  that  'the 
phonetic  basis  of  the  distinction  between  [two  sets  of]  vowels  ...  is  the  position  of 
the  tongue  root:  advanced  for  the  vowels  of  [one  set],  neutral  or  retracted  for  the 
vowels  of  [the  other  set]'  (111). 

Recalling  the  contrasts  examined  in  the  preceding  section,  it  follows  that 
the  feature  values  [+]  and  [-]  ATR  characterize  each  pair  except  for  the  low  vowel 
/a/.  As  mentioned  earlier,  the  low  vowel  is  always  realized  as  [-ATR]  in 
monosyllabic  words.  However,  in  polysyllabic  words  it  shows  a  [+ATR]  value 
when  it  occurs  in  the  environment  of  any  preceding  or  following  [+ATR]  high 
vowel,  as  illustrated  by  the  following  data: 


(10) 


Set1 

asaseer 

'maggot' 

apu 

'head  louse' 

aluku 

'chicken  coop" 

wunasa 

'hvers' 

lamasa 

'nape  of  the  neck' 

kurasa 

'venereal  diseases 

tuka 

'shrubs' 

Set  2 

Singular 

Plural 

ci 

'father' 

cina 

ri 

'mother' 

rinS 

ram 

'iroko' 

ramnS 

nun 

'aunt' 

nunnd 

aci 

'daddy 

acind 

Sets 

Singular 

Plural 

r3m 

'snake' 

ramna 

ra 

'fhend' 

rana 

ko 

•sister' 

kona 

raal 

'brother' 

raalna 

nem 

'grinding  stone' 

nemna 

The  first  set  in  (10)  shows  the  low  vowel  cooccurring  with  [+ATR]  high 
vowels,  in  set  2,  the  low  vowel  of  the  suffix  -na  is  [+ATR]  with  [+ATR]  roots, 
whereas  in  set  3  it  appears  as  [-ATR]  with  [-ATR]  roots. 
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For  an  adequate  oresentation  of  the  harmonic  facts  and  problems 
addressed  in  section  3  of  this  paper,  I  adopt  Clements's  description  of  the  feature 
ATR  presented  above,  and  I  propose  the  ATR  classification  of  Lama  vowels 
below. 


(1 1 )     Vowels  and  ATR  features, 
i  i  e         e         d 

ATR    +  -  +          -  + 


Note  that  the  classification  in  (11)  shows  only  the  ATR  values  for  each 
vowel  in  the  UR.  Since  the  value  [+ATR]  is  predictable  for  the  low  vowel,  the 
question  we  need  to  address  is  what  rules  underlie  ATR  harmony  in  Lama. 

2.  Lama  vowel  harmony:  basic  facts. 

The  vowel  chart  in  (1 1 )  shows  two  sets  of  ATR  vowels:  a  [+ATR]  set  with  5 
vowels  and  a  [-ATR]  set  with  6  vowels.  The  general  harmony  type  is  the 
root-controlled  harmony  where  the  same  affix  can  show  an  alternation  in  ATR 
feature  value  depending  on  the  context  in  which  it  appears. 


2.1.  ATR  alternation  in  suffixes. 

In  the  context  of  [-t-ATR]  roots,  suffixes  always  appear  as  [+ATR],  but  they 
appear  as  [-ATR]  with  [-ATR]  roots.  The  following  examples  show  this  alternation 
in  the  causative  suffix  -S3. 


(12)     Set1 

wur  'become  black'  wur-sa 

taw  'come  down'  ta-sa 

Set  2 

tu  'eat'  tu-s3 

tel  'escape'  tel-s3 


'blacken' 
'bring  down' 


'feed  someone' 
'save  s.o's  life' 


This  sample  alternation  in  affixes  shows  that  Lama  has  an  ATR  vowel 
harmony  which  is  root-controlled. 

While    this    generalisation    holds    in    most    cases,    the    ATR    harmony 
processes  are  more  complicated  as  will  be  shown  in  the  next  sections. 
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2.2.  Alternation  in  the  vowel  [a]. 

In  section  1 ,  we  observed  that  the  vowel  laJ  does  not  contrast  in  the  values 
of  the  ATR  feature  as  the  other  vowel  do  in  the  UR.  However,  whenever  it  occurs 
as  [+ATR]  at  the  surface  level,  a  pressure  is  felt  at  the  root  of  the  tongue  which  is 
advanced.  The  [+ATR]  vowel  [a]  occurs  in  roots  with  [+ATR]  high  vowels  only. 
Consider  the  following. 

(13)  sau      'rich  person' 
nawu  'bull' 

As  an  affix,  this  vowel  also  shows  an  alternation  in  ATR  values.  The 
following  examples  illustrate  this  alternation  according  to  the  ATR  value  of  the 
root. 

(14)  Set1   [+ATR] 

ci         'father'  ci-n§  'fathers' 

ram      'irokotree'  rdm-nS  'iroko  trees' 

Set  2  [-ATR] 

ri         'python'  ri-na  'pythons' 

rann      'snake'  ram-na  'snakes' 

From  these  illustrations,  a  tentative  conclusion  is  that  [+ATR]  roots  govern 
ATR  harmony  in  affixes  in  general  and  in  the  low  vowel  /a/  in  particular.  A  more 
problematic  case  in  this  vowel  harmony  system  concerns  the  [+ATR]  mid  vowels. 


2.3.  The  case  of  the  [+ATR]  mid  vowels. 

The  relevant  facts  related  to  mid  vowels  are  as  follows. 

1)  They  can  cooccur  with  [-ATR]  vowels  in  the  same  root,  while  their 
[+ATR]  high  vowel  counterparts  do  not.  The  examples  below  illustrate  roots  with 
i+ATR]  mid  vowels  and  other  [-ATR]  vowels. 

(15)     calanko  'chameleon' 

ponta  'leopard' 

centi  'friend' 

2)  The  vowel  of  a  suffix  is  realized  as  [+ATR]  only  when  no  consonant 
intervenes  between  that  vowel  and  the  [+ATR]  mid  vowel  of  the  root.  This  is 
illustrated  below. 


(16)     ko        'shout' ko-u —>k6ou  'to  shout' 

se       'run'     se-u  — >seeu  'to  run' 

The  suffix  -u  marks  the  infinitive  on  the  verb.  The  vowel  lengthening 
process  and  other  segmental  alternations  related  to  suffixation  are  fully 
discussed  in  Ourso  1988. 
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3)  A  suffix  of  the  structure  CV  never  harmonizes  with  the  root  [+ATR]  mid 
vowel.  This  is  illustrated  in  the  following,  where  -S3  and  -u  are  respectively  noun 
class  6  and  noun  class  3  markers. 


(17)     Ie-S3    — >les3  'patridges' 

w6-S3  — >  W0S3  'rivers' 

lek-U  — >  leku  'arrow  shaft' 

The  relevant  harmonic  facts  presented  in  this  section  are  summarized  as 
follows:  (a)  [+ATR]  mid  vowels  cooccur  with  [-ATR]  vowels  in  polysyllabic  roots 
(e.g.  cal3r|ko  'chameleon');  (b)  None  of  the  vowels  of  suffixes  with  the  structure 
CV  are  realized  as  [+ATR]  after  [+ATR]  mid  vowels  in  the  word. 

This  discrepancy  in  the  harmonic  behavior  of  the  [+ATR]  high  vowels  and 
the  [+ATR]  mid  vowels  raises  several  questions  in  relation  to  the  harmonic  rules 
in  Lama. 

First,  is  ATR  harmony  a  property  of  the  root  or  is  it  a  property  of  individual 
vowels?  That  is,  is  ATR  assigned  to  the  root  as  a  floating  feature,  does  a  linking 
convention  followed  by  a  spreading  rule  then  apply  to  spread  the  ATR  value  onto 
the  other  vowels  of  the  word?  Or  is  ATR  assigned  and  linked  to  specified 
individual  vowels  in  the  root,  and  those  vowels  in  turn  spread  the  ATR  value  onto 
the  adjacent  vowels,  not  specified  for  that  value? 

Second,  if  ATR  harmony  is  root-controlled,  are  there  disharmonic  roots?  In 
short,  what  principles  underlie  the  ATR  harmony  presented  by  the  facts  above? 

3.  Analysis. 

For  an  attractive  analysis  of  the  harmonic  facts  presented  in  the  sections 
above,  I  consider  below  some  aspects  of  'Root  Controlled  Harmony'  discussed  in 
Clements  1981,  and  underspecification  in  ATR  features  as  discussed  in  Pulley- 
blank  1983,  1986,  and  1988.  I  argue  that  a  combination  of  some  aspects  of  the 
'Root  Controlled  Harmony'  approach  and  some  aspects  of  the  underspecification 
theory  accounts  for  the  facts  presented  above  in  a  straightforward  way. 


3.1.  The  principle  of  root  controlled  harmony. 

The  theoretical  principle  of  'Root  Controlled  Harmony'  developed  in 
Clements  1981  and  supported  in  Pulleyblank  1986  and  1988,  Is  adopted  with 
modifications  which  will  be  presented  in  this  section. 

The  main  point  in  Clements's  framework  is  that  the  value  [-i-ATR]  or  [-ATR] 
is  assigned  to  the  root,  and  it  is  the  root  which  thus  passes  that  value  on  to  the 
affixes.  It  follows,  then,  that  affixes  are  not  specified  for  any  ATR  value  in  the  UR. 
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In  this  view,  the  facts  presented  in  relation  to  roots  with  [+ATR]  high 
vowels,  on  the  one  hand,  and  roots  with  [-ATR]  vowels,  on  the  other  hand,  would 
be  straightforward.  That  is,  the  value  [+ATR]  is  assigned  to  the  roots  with  [+ATR] 
high  vowels,  and  [-ATR]  is  assigned  to  the  other  roots  (except  for  roots  with 
+ATR]  mid  vowels  which  constitute  a  special  case).  Words  like  acina  'daddies" 
and  ahumi  'cicada'  will  have  the  following  representation  where  the  ATR  feature 
is  not  associated  with  any  specific  ATR-bearing  unit. 

(18)  (a)       +A  (b)       -A 

A-cl-nA  A-hUm-l          Root  ATR 

The  association  convention  which  links  the  floating  [ATR]  of  the  root  to  the 
leftmost  ATR-bearing  unit  will  then  link  [+A]  to  the  leftmost  vowel  in  (a),  i.e  to  the 
low  vowel;  likewise,  [-A]  will  link  to  the  low  vowel  in  (b)  as  shown  below. 

(19)  (a)       +A  (b)       -A 

AclnA  AhUmI  Root  ATR-Association. 

Finally,  the  association  convention  spreads  the  value  of  the  linked  ATR 
onto  the  adjacent  ATR-bearing  units  from  left  to  right  to  derive  the  expected 
results  in  (20). 

(20)  (a) 


-^A 

(b) 

-A 

/K 

^N. 

AcinA 

Ahum! 

Spreading 

acina 

ahumi 

Phonetic  realisation 

Recall  that  the  third  category  of  facts  presented  in  the  previous  sections 
involved  roots  with  [-t-ATR]  mid  vowels  and  [-ATR]  vowels;  for  convenience  I 
repeat  some  of  these  data  below. 


(21) 


Within  the  root-controlled  appproach  illustrated  so  far,  the  category  of 
roots  exemplified  above  can  be  accounted  for  by  what  Clements  (1981)  has 
characterized  as  'disharmonic  roots';  that  is,  roots  specified  for  both  [-t-ATR]  and 
[-ATR].  This  implies  that  the  low  vowels  in  the  polysyllabic  roots  above  contribute 
an  ATR  value  in  the  harmonic  processes. 

While  this  approach  explains  the  cases  exemplified  in  (21),  it  fails  to 
explain  why  the  low  vowel  behaves  as  an  opaque  segment  in  the  context  of 
[+ATR]  mid  vowels,  and  as  a  neutral  vowel  in  the  context  of  [+ATR]  high  vowels. 


wonka-s3 

'donkeys' 

wonda-s3 

'months' 

lek-u 

'arrow  shaft' 

p6nta-na 

'panthers' 

hO-S3 

'pregnancies' 

W0-S3 

'black  bugs' 

Ie-S3 

'partridges' 
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Note  that,  under  this  approach,  the  low  vowel  is  opaque  in  (21)  since  it 
blocks  the  spreading  of  the  preceding  [+ATR]  value.  Furthermore,  it  spreads  its 
own  [-ATR]  value  on  to  the  vowels  on  its  nght;  this  characteristic  is  common  to 
opaque  vowels  (Clements  1981,  van  der  Hulst  and  Smith  1986,  van  der  Hulst  et 
al.  1986). 

Recall  that  in  roots  with  [+ATR]  high  vowels,  the  low  vowel  always  shows 
up  with  a  [+ATR]  value.  In  such  contexts,  the  low  vowel  behaves  then  as  a 
transparent  segment  in  that  it  neither  blocks  [+ATR]  spreading  nor  spreads  its 
own  [-ATR]  value.  Moreover,  note  that  the  low  vowel  itself  undergoes  [+ATR] 
harmony,  which  is  expected  from  transparent  vowels  in  ATR  harmony  systems. 

Without  ad  hoc  rules,  the  disharmonic  root  approach  can  explain  neither 
this  asymmetry  in  the  low  vowel  nor  why  the  suffix  in  words  like  Ies3  'partridges' 
is  always  realized  as  [-ATR]  while  the  root  vowel  is  [+ATR].  An  independent 
pnnciple  could  be  motivated  to  account  for  this  asymmetric  harmony;  that  is,  a 
principle  which  would  assign  the  two  ATR  values  to  these  types  of  roots  as  a 
language-specific  rule  with  respect  to  [+ATR]  mid  vowels.  In  this  view,  a  single 
root  is  then  assigned  [+A]  and  [-A]  and  where  a  feature  value  has  no  ATR- 
beahng  unit  to  anchor  to,  it  will  remain  unlinked  and  therefore  unpronounced.  In 
Lama,  words  like  wo  lwos3  'black  bug/s'  will  have  the  representation  below. 

(22)      (a)        +A-A  (b)        +A -A 

wO  W0-S3 

Assuming  a  one-to-one,  left  to  right  association,  it  follows  that  the  value 
[-ATR]  will  remain  unpronounced  in  (22a),  while  in  (22b)  it  will  associate  to  the 
ATR-beanng  unit  of  the  suffix,  and  therefore  be  realized  at  the  surface  level. 

This  approach,  however,  will  not  explain  why  prefixes  like  a-  in  akolu 
'dumping  area'  is  realized  as  [-ATR]  if  we  assume  root-controlled  harmony.  We 
would  expect  those  prefixes  to  acquire  the  [-hATR]  value  from  the  adjacent 
[-I-ATR]  mid  vowel.  Moreover,  how  do  we  account  for  the  fact  that  the  two  values 
of  [ATR]  are  only  assigned  to  roots  with  [+ATR]  mid  vowels? 

In  the  remaining  part  of  this  section,  I  will  show  that  a  combination  of  some 
aspects  of  the  Root  Controlled  Harmony  with  some  aspects  of  the  under- 
specification  theory  accounts  for  the  asymmetric  ATR  harmony  in  Lama  in  a 
straightforward  way. 
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3.2.  Underspecification  in  ATR  values. 

Of  interest  for  the  present  problem  are  the  aspects  of  underspecification 
theory  discussed  in  Pulleyblank  1983,  1986,  and  1988. 

The  theory  of  underspecification  stipulates  that  for  a  contrasting  feature, 
only  one  value  is  specified  and  the  other  value  supplied  by  rule.  For  instance,  if 
the  value  [+ATR]  is  underlyingly  specified  for  a  set  of  vowels,  it  becomes 
redundant  to  specify  the  other  value.  However,  when  harmony  rules  apply 
contrasting  the  two  values  at  a  certain  level  of  phonological  derivation,  the  value 
[-ATR]  is  assigned  by  Universal  Grammar  (UG)  to  those  ATR-bearing  units  not 
specified  for  the  feature  ATR. 

Following  these  ideas,  the  specification  of  the  vowels  in  the  Lama  vocalic 
system  is  revised  as  follows  with  respect  to  [ATR]  values. 

(23)     i  i  eeasaoouu 

ATR    +  +  +  +  + 

From  this  feature  specification  in  the  UR,  it  follows  that  the  non-specififed 
value  for  the  target  feature  is  provided  by  a  redundancy  rule  such  as 


(24)[]-->[-ATR] 

That  is,  an  ATR-bearing  unit  is  assigned  [-ATR]  if  it  is  not  specified  for  that 
feature. 

Assuming  with  Clements  (1981),  that  the  root  is  the  domain  for  ATR 
harmony  under  discussion,  the  following  principles  can  be  formulated  to  account 
for  the  facts  presented  so  far. 


(25)     Principles  for  Lama  ATR  harmony 

I  Affixes  are  not  specified  for  ATR 
features;  they  obey  the  principle 
of  root  control  (Clements  1 981 ). 

II  (a)       [-I-ATR]  is  the  only  specified 

feature  value  in  the  UR  for  [ATR]. 
(b)       [ATR],  as  the  property  of  the 
root,  is  a  floating  feature. 

III  (a)       [ATR]  of  the  root  is  linked  to 

the  leftmost  A-bearing  unit  (vowel) 
after  affixation. 
(b)       Spreading  takes  place  within  the 
word  from  left  to  right  after 
linking. 

IV  [  ]  -— >  [-ATR]. 

From  Principle  I,  it  follows  that  no  affix  can  be  assigned  a  [+ATR]  value;  it 
also  follows  from  the  redundancy  rule  in  IV  that  an  affix  can  acquire  a  [-ATR] 
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value  if,  for  some  reason,  it  does  not  acquire  the  specified  feature  value  through 
harmonic  rules  from  Phnciples  II  and  HI. 

Crucial  to  this  analysis  is  the  assumption  that  Principle  IV  is  a  direct 
consequence  of  Principle  II;  and  as  such,  it  follows  that  IV  automatically  applies 
to  roots  not  specified  for  [ATR].  It  is  also  assumed  that  whenever  IV  applies  to  a 
root,  Principle  III  is  automatically  triggered  and  thus  applies. 

Note,  however,  that  while  these  principles  account  for  the  facts  related  to 
the  roots  with  [+ATR]  high  vowels  and  the  roots  with  vowels  not  specified  for 
[ATR],  they  make  incorrect  predictions  for  roots  with  [+ATR]  mid  vowels. 

Recall  from  the  data  in  (21 )  that  when  a  root  has  both  a  low  vowel  and  a 
mid  vowel,  the  low  vowel  is  always  [-ATR]  even  if  the  mid  vowel  is  [+ATR].  Within 
the  framework  of  underspecification,  it  follows  that  the  low  vowel  (and  affixes) 
cooccurring  with  the  [+ATR]  mid  vowels  acquires  its  [-ATR]  value  either  by 
Principle  IV  stipulated  above  or  by  a  different  stipulation.  Principle  IV  might  apply 
in  this  case  because  for  some  reason  the  [+ATR]  mid  vowel  fails  to  spread  its 
value.  Following  van  der  Hulst  and  Smith  (1986),  I  argue  that  mid  vowels  are 
neutral  segments  in  Lama.  In  van  der  Hulst  and  Smith's  analysis  neutral  vowels 
can  be  opaque  or  transparent.  They  are  opaque  when  their  idiosyncratic  value  is 
the  opposite  of  the  feature  value  specified  in  the  UR.  In  the  case  of  ATR 
harmony,  the  value  [+ATR]  being  the  specified  one,  it  follows  that  opaque  vowels 
assume  [-ATR]  as  their  lexical  property,  and  block  [+ATR]  harmony.  In  contrast  to 
opaque  vowels,  transparent  vowels  assume  as  their  lexical  property  the  specified 
value  for  the  feature  under  consideration.  In  the  present  case,  the  value  [+ATR] 
is  therefore  the  idiosyncratic  property  of  the  mid  vowel,  which  neither  spreads  nor 
blocks  the  spreading  of  that  feature  value  from  other  sources. 

In  relation  to  mid  vowels  in  Lama,  I  argue  that  [+ATR]  is  their  lexical 
property  when  they  are  underlyingly  specified  for  the  feature  ATR;  that  is,  the 
value  [+ATR]  is  not  a  floating  feature  for  mid  vowels.  It  follows  from  this 
appproach  that  three  types  of  ATR  roots  should  be  posited  in  Lama.  (1)  Roots 
with  a  floating  [+ATR]  value;  these  roots  are  characterized  by  high  vowels 
underiyingly  specified  for  the  feature  ATR.  (2)  Roots  with  a  lexically  linked 
[+ATR];  these  are  characterized  by  the  mid  vowels  underlyingly  specified  for 
ATR.  And  (3)  roots  with  no  ATR  specification;  these  are  exemplified  by  the 
words  which  show  [-ATR]  vowels  all  across  at  the  surface  level. 

To  account  for  the  asymmetry  displayed  by  type  2  roots,  I  propose  that  in 
addition  to  the  principles  which  govern  root  harmony,  Lama  has  a  complement 
rule  of  the  structure 


(26)     [  ]  [-ATR] 

I  >  I 

V  V 

That  is,  assign  a  [-ATR]  value  to  an  ATR-bearing  unit  linked  to  an  empty 
ATR  slot.  The  complement  rule  applies  after  the  principles  in  (25).  Note  that  while 
Principle  IV  is  provided  by  UG,  the  complement  rule  is  language  specific. 
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The  interaction  between  the  complennent  rule  and  the  principles  stipulated 
for  [ATR]  harmony  is  illustrated  below  for  the  words  acina  'daddies',  ahumi 
'cicada'  and  akolu  'dumping  area'. 


(27)     (a)        +A 


(b)       +A 


(c) 


+A 


UR 


A-cln-A 


A-hUm-l 


A-kol-U 


The  underlying  representation  of  the  words  in  (27a)  and  (27b)  shows  ATR 
as  a  floating  feature  assigned  to  the  root  by  the  relevant  rules  in  (25).  The  root  in 
(27c)  shows  the  [+ATR]  value  as  a  linked  idiosyncratic  property  of  the  neutral  mid 
vowel.  The  root  harmony  Principle  III  applies  to  (27a)  and  (27b)  to  give  the 
derivation  in  (28). 


(28)     (a) 


/K 
AclnA 
acina 
daddies 


(b) 


AhUmI 
ahumi 
cicada 


Association 

and 
Spreading 
Phonetic 
realisation. 


Since  (27c)  does  not  satisfy  the  principles  underlying  the  root-controlled 
harmony  argued  for  so  far,  it  follows  that  the  spreading  of  the  idiosyncratic 
[+ATR]  from  the  transparent  mid  vowel  does  not  take  place.  Assuming  that  the 
complement  rule  (26)  is  triggered  by  the  ATR  status  in  type  3  roots,  the  derivation 
in  (27c)  will  be  completed  by  that  rule  as  shown  in  (29). 


(29) 


+A 


AkolU 


-A 


-A 


ikolu 


UR 

Harm. rules  inappl. 
Complement  rule 

Phonetic  realisation. 


An  issue  raised  by  the  present  analysis  relates  to  polysllabic  roots  of  the 
following  nature. 2  First,  there  are  roots  containing  both  [+ATR]  mid  vowels  and 
other  [-ATR]  vowels  as  in  the  following. 


(30)     wonka-s3       'donkeys' 
kponseer       'mad  man' 
cal3nko  'chameleon' 

In  cases  like  those  in  (30),  do  we  assign  the  value  [-ATR]  as  a  floating 
feature  to  the  root  in  addition  to  the  lexically  linked  [+ATR]?  It  follows  from  the 
framework  developed  here  that  an  assignment  of  the  redundant  value  to  such 
roots  is  not  necessary.  The  complement  mle  presented  in  this  analysis  is 
motivated  to  handle  this  particular  case  which  is  outside  the  scope  of  root- 
controlled  harmony.  It  is  my  position,  however,  that  when  the  complement  rule 
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applies  to  several  adjacent  ATR-bearing  units,  the  OCP  (Obligatory  Contour 
Principle)  is  automatically  triggered.  Thus,  the  derivation  of  calsnkoss  of  type  3 
roots  will  have  the  following  representation. 


(31) 


+A 


UR 


cAl3nko-s; 

I  1 
-A-A 

-A 

cal3nkos3 


Compl.Rule 

OCP 

Phon. Realisation. 


Second,  note  roots  containing  both  [+ATR]  high  vowels  and  [+ATR]  mid 
vowels  of  the  following  combination  where  the  parentheses  indicate  potential 
affixes. 


(32)      (a)       (-)        CV      CV       (-) 


(b)       (-) 


[f_ATR| 


1  -low 

^I+atr) 


CV       CV 

-I    n 


(-) 


hi 

low 

hATR 


+hi 
+ATR 


The  respresentation  in  (32)  predicts  that  the  affixes  adjacent  to  the  mid 
vowel  will  surface  as  [-ATR]  since  spreading  from  that  vowel  fails  under  the 
analysis  presented  so  far;  this  representation  further  predicts  that  the  affixes 
above  cannot  link  to  the  floating  [+ATR]  without  crossing  the  lexical  association 
line  of  the  mid  vowel.  But  as  already  shown  by  the  facts  in  section  2,  the  affixes 
in  the  context  of  the  structures  in  (32)  are  realized  as  [+ATR].  Since  the  lexical 
ATR  value  of  the  mid  vowel  does  not  spread,  it  follows  that  roots  exemplified  by 
the  structures  in  (32)  have  two  [+ATR]  values,  that  is,  the  lexical  value  of  the  mid 
vowel  and  the  floating  value  of  the  root.  The  OCP  applies  to  delink  and  collapse 
the  inert  idiosyncratic  [+ATR]  value  of  the  mid  vowel  with  the  dynamic  [+ATR]  of 
the  root.  This  process  is  illustrated  below. 


(33)      (a)[+A]  [+A] 

kUkby-0 

kUkOyO 
kukoyo 

cry 


(b)[+A]  [-jA] 

wonArUd-U 

'^^^ 

wOnArUdU 
wonarudu 

gazelle 


UR 


OCP 


Root  Harmony 
Phonetic 
realisation. 
Gloss. 


126 

4.  Conclusion. 

In  this  paper,  I  have  shown  that  ATR  vowel  harmony  in  Lama  is 
straightforward  following  the  theoretical  assumptions  that:  (1)  [ATR]  harmony  is 
the  property  of  the  root;  (2)  [+ATR]  is  the  feature  value  specified  for  [ATR]  in  the 
UR  (it  therefore  follows  that  [-ATR]  is  a  redundant  feature  not  specified);  and  (3) 
when  they  are  specified  for  [ATR],  mid  vowels  assume  the  value  [+ATR]  as  their 
idiosyncratic  property  (in  this  case  the  mid  vowels  are  transparent). 
Consequently,  root  harmony  is  not  blocked  under  appropriate  conditions. 

NOTES. 

*l  am  grateful  to  L.  Downing  and  H.  H.  Hock  for  their  invaluable  comments 
and  suggestions  on  an  earlier  version  of  this  paper.  I  wish  to  thank  especially  C. 
Kisseberth  for  his  attention  to  my  work  in  general  and  his  interest  in  this  paper  in 
particular.  I  benefited  a  lot  from  his  wide  knowledge. 

lThe  data  in  this  paper  are  based  on  the  material  of  the  second  chapter  of 
my  dissertation.  The  details  on  the  segmental  and  tonal  alternations  are 
considered  in  that  work.  Note  only  that  there  are  two  low  tones  in  Lama;  the  one 
marked  by  the  grave  accent  is  in  fact  a  rising  tone  which  simplifies  to  a  low  tone 
at  the  surface  level.  Further  information  on  the  tonal  aspects  can  be  found  in 
Kenstowicz,  Nikiema,  and  Ourso  1988. 

2  Of  a  total  of  1019  nouns  and  540  verbs  considered  only  7  roots  show  the 
combination  of  both  [+ATR]  high  vowels  and  [+ATR]  mid  vowels.  One  root  shows 
the  combination  [+ATR,  +mid]  and  [-ATR,-mid,-low]  (cf.  30).  Combinations 
involving  the  low  vowel  in  the  roots  are  numerous.  One  root  shows  two  [+ATR] 
mid  vowels  (kponsen  'fiber  straw');  this  noun  belongs  to  noun  class  1  and  does 
not  take  any  noun  class  suffix.  My  intuition  is  that  even  if  that  noun  were  to  take 
the  NC2  suffix  na,  the  vowel  of  that  suffix  would  appear  as  [-ATR]. 
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WHERE'S    MY   NP?      NON-TRANSFORMATIONAL   ANALYSES 
OF   VEDIC   PRONOMINAL   FRONTING* 


Steven  Schaufele 


Vedic  Sanskrit  shows  some  evidence  of  hierarchical  structure 
(i.e.  VPs  and  PPs).  This  means  the  grammar  allows  NPs  not  to  be 
immediately  dominated  by  S.  However,  certain  determiners  routinely 
behave  as  immediate  daughters  of  S,  even  when  'controlled',  in  terms 
of  morphological  agreement,  by  NPs  which  are  not.  This  paper 
compares  the  ability  of  two  non-transformational  frameworks,  GPSG 
and  LFG,  to  describe  the  relationships  between  such  pronominal 
determiners  and  their  controlling  NPs.  I  conclude  that  LFG  provides  a 
much  simpler,  more  straightforward  and  elegant  description  of  the 
data  than  GPSG. 

0.  Introduction 

'WH-fronting',  generally  understood  as  the  fronting  of  relative  and  interrog- 
ative pronominals,  is  a  common  tendency  in  Indo-European  (IE)  languages. 
The  parallel  tendency  in  Vedic  Sanskrit  fronts  not  only  relative  and  interrogat- 
ive, but  certain  demonstrative  pronominals  as  well.  However,  whereas  in  mod- 
ern IE  languages  such  as  English  the  entire  phrase  dominating  the  WH-word  is 
fronted  along  with  it,  in  Sanskrit  only  the  pronominal  word,  not  its  'controlling' 
NP,  is  fronted.  See  (1)  below,  (la)  shows  an  acceptable  English  clause,  in 
which  the  entire  PP  dominating  the  WH-word  'which'  has  been  fronted.  (Ib-c) 
show  similar  strings  in  which  only  the  pronominal  word(s)  have  been  fronted; 
clearly  these  are  unacceptable.  (Id)  shows  a  clause  from  the  Vedic  Prose 
corpus,  in  which  the  constituents  occur  in  approximately  the  same  order  as  their 
English  equivalents  in  (1c).  This  is  an  acceptable  Sanskrit  clause. 

(1 )     (a)     For  which  deities  those  oblations  have  been  taken 

(b)  'Which  those  oblations  have  been  taken  for  deities 

(c)  *Which  those  for  deities  oblations  have  been  taken 
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(d)     yabhyah    eva        tani      devatabhyah     havrrhsi  grhMni 
"which'-b  PTCL  'those'-N     "deities'-D  'oblations'-N  'taken' 

bhavanti 

'are' 
'For  which  deities  those  oblations  are  taken'  (AB  7.2.3) 

In  this  paper,  I  will  discuss  the  possibility  of  describing  the  resulting  strings 
without  transformations.  I  will  compare  the  relative  abilities  of  Generalized 
Phrase-Structure  Grammar  (GPSG)  and  Lexical-Functional  Grammar  (LFG)  at 
this  task,  and  will  conclude  that,  while  GPSG  may  be  able  to  provide  an 
adequate  description  at  great  cost,  LFG  provides  a  much  simpler,  more  straight- 
fonward,  and  elegant  description. 

This  paper  is  necessarily  corpus-based.  Whereas  a  syntactician  invest- 
igating a  living  language  can  solicit  a  native  speaker's  intuitions,  I  am  limited  to 
the  strings  actually  attested  in  Vedic  literature.  While  such  a  pre-recorded  body 
of  literature  may  occasionally  reflect  a  wider  variety  of  grammatical  practice  (e.g. 
idiolectal,  dialectal,  or  registral  variation)  than  a  fully  formal  theory  of  grammar 
may  feel  comfortable  with,  I  must  assume  that  all  strings  attested  in  the  literature 
are  grammatical,  and  seek  to  elucidate  and  describe  the  attested  patterns. 
Whenever  I  present  a  string  with  a  star  (*)  or  describe  it  as  'inattestable',  I  mean 
that  I  have  made  it  up  myself  and  that  a  survey  of  a  wide  range  of  data  has 
convinced  me  that  there  are  patterns  in  Vedic  grammar  which  said  string 
violates.  Similarly,  a  string  which  I  have  marked  with  a  question  mark  is  one 
which,  attested  or  not,  my  survey  of  the  corpus  leads  me  to  believe  may  be 
acceptable,  but  highly  marked. 

In  the  first  section  of  this  paper  I  will  outline  some  preliminary  assumptions 
about  how  constituent  structure  (c-structure)  in  general  should  be  described 
and  about  the  c-structure  of  Vedic  Sanskrit  in  particular.  In  the  second  section  I 
will  attempt  to  build  a  descriptively  adequate  account  of  the  facts  within  GPSG, 
an  endeavour  which  I  will  conclude  is  probably  not  worth  the  trouble,  especially 
when  compared  with  the  straightforward  way  in  which  LFG  accounts  for  the 
same  facts. 

1.    The  description  of  constituent  structure 

1.1.     General  assumptions 

1.1.1.     ID/LP  format 

Throughout  this  paper  I  will  be  using  ID/LP  format  (defined  below)  in 
describing  c-structure,  whether  the  theoretical  context  is  GPSG  or  LFG.  While 
early  literature  in  LFG  (e.g.  Bresnan  1982)  does  not  make  use  of  ID/LP  format,  I 
have  not  found  any  theoretical  reason  why  it  would  be  inappropriate,  and  in  fact 
it  is  used  in  an  LFG  context  in  Bresnan  &  Mchombo  1987.  I  assume  that  such 
usage  is  legitimate  and  will  not  try  to  defend  it. 
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Briefly,  ID/LP  format  makes  possible  the  independent  expression  of  gener- 
alizations about  dominance  or  precedence  relations.  Instead  of  a  'Phrase- 
Structure  Grammar'  defined  by  a  set  of  explicit  phrase-structure  rules  in  the 
manner  of  Chomsky  1965,  in  ID/LP  format  there  are  two  independent  sets  of 
rules  which  together  generate  a  PSG-describable  language.  An  ID  or  'Imme- 
diate Dominance'  rule  of  the  form  shown  in  (a)  stipulates  an  unordered  set  of 
permissible  daughters  of  a  given  mother.  An  LP  rule  imposes  constraints  on  the 
relative  order  of  two  or  more  constituents,  whenever  they  occur  as  sisters.  If  the 
grammar  allows  two  constituents  to  occur  as  sisters  but  has  no  LP  rule  referring 
to  them,  they  are  free  to  occur  in  either  order.  The  format  of  an  LP  rule  is  as 
shown  in  (b)  whose  meaning  is  that,  whenever  they  occur  as  sisters,  A  must 
precede  B  (and  C)  and  B  must  precede  C. 

(a)  A  ->  B.  C,  D 

(b)  A<B<C 

A  GPSG  with  ID/LP  format  defines  a  tree  as  well-formed  if  and  only  if  every 
immediate  dominance  relation  in  it  is  licensed  by  some  ID  rule  and  violates  no 
LP  rules. 

Since  LFG  crucially  depends  upon  the  annotation  of  constituents  for  gram- 
matical function  (GF)  such  as  Subject,  Object,  etc.,  within  that  framework  the 
constituents  specified  in  ID  rules,  and  possibly  LP  rules  as  well,  would  have  to 
be  susceptible  to  annotation  for  GF."!  Further,  ID  rules  would  not  need  to  include 
any  semantic  component  such  as  they  have  to  have  in  GPSG,  since  semantics 
in  LFG  is  read  off  a  separate  level  of  analysis,  f-structure. 

1.1.2.     'Metarule'-type  generalizations  across  ID  rules;  SLASH 

I  also  adapt  to  LFG  some  of  the  'metarule'-type  strategies  developed  within 
GPSG  for  describing  generalizations  across  sets  of  ID  rules.  In  doing  so,  I  note 
that  some  details  of  these  strategies  are  not  necessary  in  LFG,  since  LFG 
defines  a  distinct  level  of  analysis,  f-structure,  within  which  some  of  the  relevant 
information  may  be  encoded.  As  long  as  a  constituent  is  somewhere  in  the 
clause,  it  can  be  cross-referenced  with  an  appropriate  element  in  f-structure  and 
is  in  that  sense  'accessible'  to  f-structure'. 2  Therefore,  GPSG's  SLASH  strategy 
would  be  useful  in  LFG  only  in  licensing  ID  mles,  and  therefore  c-structures,  that 
might  not  otherwise  be  licensed. 

SLASH  in  GPSG  is  a  'category-valued'  feature,  taking  a  category  (such  as 
V  or  NP)  as  its  value.  A  category  marked  for  a  SLASH  feature  with  the  value  X 
is  a  category  dominating,  not  necessarily  immediately,  a  gap  of  category  X. 

SLASH  is  the  means  whereby  GPSG  describes  'unbounded  dependenc- 
ies'. I  am  not  sure  to  what  extent  it  and  the  double-arrow  convention  in  LFG  are 
notational  variants  of  each  other.  For  the  purposes  of  this  discussion,  I  will  use 
SLASH  to  describe  strings  in  which  I  believe  some  sort  of  'extraction'  has  taken 
place. 3 
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1.2.    Assumptions  about  Vedic  constituent  structure 

1.2.1.  General  assumptions 

1.2.1.1.  Only  two  Bar  levels 

I  assume  that  Sanskrit  distinguishes  only  two  Bar  levels,  lexical  and  phras- 
al. Hale  1982  and  Speas  1988  have  argued  that,  either  universally  or  with 
regard  to  certain  specific  languages,  there  nnay  be  no  point  in  trying  to 
distinguish  between  different  levels  of  phrasal  projection  of  a  same  lexical 
category.  I  have  yet  to  see  any  strong  evidence  of  such  fine  distinctions  in 
Sanskrit.  The  freedom  of  constituent  order  characteristic  of  this  language  may, 
in  fact,  be  due  in  part  to  the  lack  of  intermediate  levels  of  hierarchical  stnjcture 
which  might  otherwise  distinguish  between  types  of  NP-  modifiers  or  -comple- 
ments, or  between  VP-adverbials  and  complements,  or  between  different  types 
of  VP-complements.  Therefore,  in  what  follows  I  will  distinguish  between  lexical 
and  phrasal  categories  only. 

1.2.1.2.  Adverbials 

For  brevity's  sake  I  will  use  the  term  'adverbial'  (abbreviated  Adv)  to  cover 
a  disjoint  set  of  categories  including  the  following: 

(a)  lexical  adverbs  and  projections  thereof,  if  any'* 

(b)  adpositional  phrases  (Most  if  not  all  PPs  in  Sanskrit  function  semant- 
ically  as  adverbials.) 

(c)  some  NPs  in  any  case  but  Nominative  (There  are  adverbial  functions 
for  every  non-Nominative  case  in  Sanskrit.) 

From  a  formal  point  of  view,  these  different  categories  are  difficult,  if  not 
impossible,  to  collapse  into  one.5  But,  as  far  as  this  paper  is  concerned,  they 
have  the  same  syntactic  distribution.  Therefore,  rather  than  multiply  ID  rules  in 
order  to  cover  them  all,  I  will  subsume  them  all  under  the  abbreviatory  category 
•AdV. 

1.2.2.  Preliminary  ID  rule  schemata 

Where  ID  rule  schemata  would  differ  for  GPSG  and  LFG,  I  will  give  both  (a) 
the  GPSG  form  and  (b)  the  LFG  form.  Any  category  in  parentheses  is  optional; 
a  category  qualified  by  a  star  (*)  may  occur  an  indefinite  number  of  times. 

(2)     (a)     VP     -»  NP,  (Adv)*,       H 

[+SUBJ]  [-SUBJ] 

(b)     S  -  NP,       (Adv)*,    VP 

(TSUBJ  =  i)  (T  =  i) 
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(3)  (a)     NP  -»  (Det),        (NP).        (AP).  H0 

[CASE:GEN] 

(b)     NP  -      (Det).        (NP).  (AP),      H0 

(TDet  =  i)  [CASE:GEN]  (TMOD  =  i) 

The  category  label  'Det'  is  intended  to  cover  a  variety  of  pronominal 
elements  that  can  serve  as  NP  determiners.  The  limits  of  this  category  in 
relation  to  this  rule  schema  and  the  previous  one  will  be  a  major  focus  of  later 
discussion. 

All  nominal  elements  in  Sanskrit  are  marked  for  gender,  number,  and 
case.  I  assume  that  these  are  'Head  Features',  inherited  by  head  nouns  from 
their  mothers  and  spread  to  adjectives  and  determiners  according  to  the  normal 
procedures  in  either  framework.^ 

Of  course,  it  is  possible  for  the  Head  N  of  an  NP  to  be  +NULL.  meaning 
that  it  is  phonologically  empty,  while  still  being  the  NP  Head. 

(4)  VP  -►  (Adv)*.  (NP)*.  H0 

(5)  PP  ■*  NP.  H0 

(6)  V  -»  (Pfx),  H 

Like  'phrasal  verbs'  in  English  ('look  up',  'put  up').  Sanskrit  has  many 
verbs  that  include,  besides  their  roots,  one  or  more  prefixes.  In  the  Vedic  lan- 
guage, though  not  in  Classical  Sanskrit,  these  prefixes  are  separable  from  the 
verbal  root.  Syntactically,  therefore,  they  must  be  thought  of  as  separate  entit- 
ies. However,  the  semantics  and  subcategorization  of  the  combination  of  root- 
plus-prefix  is  not  always  predictable.  For  instance,  the  rootyna  means  'know' 
and  is  a  simple  transitive  verb;  the  basic  meaning  of  the  adposition  anu  is  'after', 
and  it  also  governs  the  accusative.  The  combination  dnu-jnd,  however,  means 
'grant,  concede,  permit',  and  is  bi-transitive.  taking  both  an  accusative  and  a 
dative  complement.  Since  the  semantics  and  subcategorization  of  phrasal 
verbs  are  often  non-compositional,  they  must  be  defined  as  lexical,  i.e.  0-Bar 
categories  (cf.  Gazdar  et  al.  1985:33);  but  the  morphological  bond  between  root 
and  prefix(es)  is  transparent  to  the  syntax,  since  root  and  prefix(es)  can  be 
separated;  cf.  (7)-(8)  below  beside  (9).  I  therefore  will  adopt  Farkas'  1986 
analysis  of  phrasal  verbs  in  Hungarian,  in  which  a  0-Bar  category  must  be 
allowed  to  branch.  Throughout  this  paper,  when  a  finite  verb  occurs  with  a 
prefix  (and  they  are  adjacent  to  each  other),  they  will  be  hyphenated  as  in  (9). 

(7)  si'kseyam     ft        mahayat6  dive-dive    rayah      a  kuhacidvide 
V  -  'send'  PTCL  'rejoicing'-D  'daily'    'riches'-A  Pfx.  'finding  it 

somewhere'-D 
'I  would  send  riches  daily  to  the  one  who.  rejoicing,  finds  it 
somewhere.'  (RV  7.32.19) 
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(8)  9  havai  asmin        svah         ca        nist.yah       ca     samsante 
Pfx.  PTCL  'him'-L  'his  own*-Np.  '&'  'strangers'-Np.  '&'  V  -  'place  taist' 
'In  him  both  his  own  (relatives)  and  strangers  place  their  trust.' 
(^B1 .6.4.1 7) 

(9)  ut-grbhnHe       vai     e?ah   asmat       lokit        devalokam 

V  -  'raise  oneself  PTCL  'he'-N  'this'-Ab  'world'-Ab  'god-world'-A 
'He  raises  himself  from  this  world  to  that  of  the  gods.'  (^B  3.1 .4.1 ) 

1.2.3.     Topicalization  7 

In  addition  to  basic  ID  rule  schemata,  an  adequate  description  of  Vedic 
constituent  order  requires  a  means  of  accounting  for  the  syntactic  behaviour  of 
a  class  of  words  which  I  shall  refer  to  as  'sentential  particles'  and  which  are 
labelled  PTCL  in  the  examples;  cf.  e.g.  it,  vaf'm  (7),  (9)  above. 

The  majority  of  these  particles  are  difficult  to  translate,  since  instead  of 
specific  information  they  communicate  nuances  relating  to  speaker  attitudes, 
etc.8  Many  of  the  glosses  in  (10)-(17)  below  therefore  are  only  approximate 
and,  except  in  this  section,  PTCLs  will  be  left  unglossed  (except  for  conjunctions 
like  ca  'and').  What  is  important  is  that,  like  Wackernagel's-Law  clitics,  they 
occupy  the  second  position  in  their  mother  category,  'second  position'  being 
defined  as  'immediately  following  the  first  lexical  item.'  As  can  be  seen  from 
(10)-(1 1),  these  particles  cannot  occur  in  clause-initial  position,  nor  can  they  be 
separated  from  the  left  boundary  of  their  mother  by  more  than  one  word.^ 

(10)  (a)         tam         tu     nah         agatam        prati-pra-brutat 

ACCfem.  PTCL    cl.     ACCfem.(part.)    V  2s.impv. 
DEM       'but'lp.pro.       'come'  'announce' 

'But  announce  her  to  us  as  having  come  =  when  she  has  come.' 
(^6  3.2.1.22) 

(b)  *tu    tam  nah  agatam  prati-pr^-brutat 
'but' 

(c)  *t§m  nah  tu  Igatam  prati-pr^-brutat 

'but' 

(d)  *tim  nah  agatam    tu  prati-pr^-brutat 

'but' 

(11)  (a)    prajapatih  khalu     vai    tasya  veda 

NOM  PTCL     GEN  V  3s.pres. 

'P.'         'now'  'indeed'DEM  'know' 
'Now,  Prajapati  indeed  knows  about  that.'  (TS  2.1.6.5) 

(b)    *khalu     vaf     prajapatih  tasya  veda 
'now'  'indeed' 
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As  shown  in  (12)-(14),  the  word  preceding  the  particle  string  often  corre- 
sponds to  a  gap  (represented  by  1)  to  its  right.  Such  a  word  will  henceforth  be 
said  to  be  'topicalized'. 

(12)  brahmajca    vai    idam       agre       [tj  subrahma  ca  ]np   astam 

NOM       PTCL     Adv        LOG  NOM     PTCL  V  3d.impf. 

'B.'       '&'  'now'  'here'  'beginning'  'S.'        '&'  'were' 

'Now  in  the  beginning,  there  were  here  Brahman  and  Subrahman.' 
(SB  1.1.1) 

(13)  dvisantami    ha     asya      tat     [tj  bhratrvyamJNp  abhi-ati-ricyate 

ACC      PTCL  GENcl.  NOM  ACC  V  Ss.pres. 

'hateful'  'now'  Ss.pro.  DEM  'foe'  'remain  for* 

'That  now  remains  for  his  hateful  foe.'  (^B  1.9.1.18) 

(1 4)  indhej       ha        vai        etat  adhvaryuh  [idhmena  agnfm  tj  ]vp 
V  3s.pres.  PTCL    PTCL     Adv.      NOM  INST     ACC 

'ignite'      'now'  'indeed'  'now'        'A.'  'kindling'  'fire' 

'Now  the  Adhvaryu  indeed  lights  the  fire  with  the  kindling.' 
(3b  1.3.5.1) 

The  clause-initial,  pre-PTCL  words  in  (10)-(11)  would  be  clause-initial 
even  without  the  PTCL  string,  but  it  will  simplify  the  description  of  the  syntax  of 
particles  if  (10)-(11)  are  also  assumed  to  include  gaps  immediately  to  the  right 
of  the  PTCL  strings.  It  is  then  possible  to  state  that  the  PTCL  string  always  has 
two  sisters,  one  a  Bar  0  item  to  its  left,  the  other  a  constituent  dominating  a  gap 
corresponding  to  that  Bar  0  item.  This  generalization  can  be  adequately  stated 
in  formal  terms  by  ID  rule  schema  (15)  and  LP  njle  (16). 

(15)  X  -»  Y[BAR  0],  PTCL,  XA'[BAR  0] 

(1 6)  [BAR  0]  <  PTCL  <  [SLASH:  BAR  0] 

The  vast  majority  of  particles  have  clausal  scope.  For  these,  the  category 
denoted  by  X  in  ID  rule  schema  (15)  is  S.  In  order  to  clarify  the  distinction  be- 
tween particles  that  have  exclusively  clausal  scope  and  those  that  are  not  so 
restricted,  I  shall  in  all  subsequent  examples  mark  each  PTCL  either 'S'  if  it  has 
exclusively  clausal  scope  or  'P'  if  it  does  not. 

1.2.4.     Scrambling  and  evidence  for  hierarchical  structure 

Freedom  of  constituent  order,  or  freedom  to  generate  discontinuous  con- 
stituents, is  a  frequently-remarked-upon  characteristic  of  Sanskrit  and  other 
early  IE  languages.  Much  of  the  discontinuity  found  in  Sanskrit  results  from 
fronting  processes  such  as  topicalization  and  the  fronting  of  pronimals  (to  be 
discussed  in  the  rest  of  this  paper),  or  from  other  syntactic  phenomena  which 
appear  to  be  independently  motivated.  There  remains  however  a  residue  of 
strings  whose  discontinuities  cannot  be  accounted  for  by  any  of  these  inde- 
pendently-motivated phenomena,  but  solely  by  the  fact  that  the  grammar  does 
not  forbid  them.  Such  strings  may  be  said  to  be  'scrambled'.  Cf.  (17)-(18).  The 
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(a)  strings  are  the  ones  actually  attested.    The  unscrambled  counterparts  are 
given  in  the  (b)-version. 

(17)  (a)  purastat  mithunam    prajananam     kriyate 

'at  the  beginning'  'union'-N  'procreation'-N  'Is  made' 

samidhenmam 
'kindling'-G 
'At  the  beginning  a  union,  a  procreation  of  the  kindling  is  made  / 
is  made  of  the  kindling.'  (6b  1 .4.1 .2) 

(b)    purastat  [samidhenmam  mithunam  praj^nanamj^p  kriyate 

(18)  (a)      tasya    dasa        avastat  ahani   divakfrtyasya  bhavanti 

DEM-G  'ten'-N  'before'-Adp.(G)  'days'-N  G  'are' 

'Before  the  Divaklrtya  there  are  ten  days.'  (AS  4.18.4) 

(b)     [tasya  divaklrtyasya  avastat]pp  [dasa  ahanil^p  bhavanti 

I  hope  to  present  elsewhere  my  conclusions  regarding  the  best  mechan- 
ism for  describing  this  sort  of  discontinuity.  Here  I  will  merely  state  that  it 
involves  a  'wrapping'  operation  similar  to  that  discussed  in  Pollard  1984  and 
Zwicky  1986.  The  discontinuous  constituent  (e.g.  the  bracketed  NP  in  (17b))  is 
'wrapped'  around  another,  or  two  constituents  (e.g.  the  bracketed  PP  and  NP  in 
(18b))  are  'wrapped'  around  each  other. 

For  the  present  paper,  the  issue  of  'scrambling'  in  Vedic  is  important 
primarily  because  it  offers  circumstantial  evidence  for  hierarchical  structure. 
There  appear  to  be  certain  constraints  on  scrambling  in  Vedic.  First  of  all,  some 
constituents  scramble  more  easily  than  others.  NP-Scrambling  appears  to  be 
fairly  common  in  all  registers  of  the  corpus.  PP-Scrambling  is  much  rarer;  (18) 
is  the  only  example  I  have  so  far  come  across  in  Vedic  Prose,  and  even  in  the 
thousands  of  verses  of  the  Rg-Veda  I  have  found  only  a  dozen  or  so.  VP- 
Scrambling  seems  to  be  rarest  of  all;  I  have  so  far  found  at  most  a  handful  in  the 
entire  corpus.  Furthermore,  there  appears  to  be  an  even  stronger  constraint  on 
what  might  be  called  'Multiple  Scrambling',  the  scrambling  of  a  constituent 
which  includes  a  scrambled  daughter.  Thus,  even  though  NPs  in  general 
appear  to  Wrap  more  easily  than  PPs,  if  an  NP  contains  a  PP  that  is  itself 
scrambled,  that  NP  is  extremely  unlikely  to  be  scrambled.  For  some  further 
discussion,  see  Schaufele  1988,  to  appear. 

These  constraints  seem  to  me  significant,  albeit  circumstantial,  evidence 
for  hierarchical  structure.  Since  their  description  appears  to  necessitate  refer- 
ence to  such  notions  as  NP,  PP,  and  VP,  it  follows  that  such  labels  must  refer  to 
something  real  in  the  grammar.  And  since  NPs,  PPs,  and  VPs  can  all  dominate 
NPs  (cf.  ID  rule  schemata  (3)-(5)),  it  must  be  possible  for  NPs  not  to  be  immedi- 
ate daughters  of  S. 
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2.     Pronominal  fronting 

2.1.     General  description 

NPs  can  be  generated  as  daughters  of  any  phrasal  constituent.  It  is  char- 
acteristic of  Sanskrit  that  any  NP  may  'control'  (in  the  sense  of  requiring 
agreement  in  gender,  number,  and  case)  a  pronominal  determiner,  whether 
demonstrative,  interrogative,  or  relative. 

All  Sanskrit  relative  pronominals  (based  morphologically  on  the  stem  ya- 
and  hereafter  abbreviated  REL)  and  interrogative  pronominals  (based  on  the 
stem  ka-,  including  the  form  km,  and  hereafter  abbreviated  INT)  belong  to  the 
syntactic  class  under  discussion.  But  not  all  demonstratives  do.  Demonstra- 
tives in  Sanskrit  (hereafter  abbreviated  DEM)  can  be  divided  into  two  classesi'io 

(a)  locational  deictics  (abbreviated  'IDEM'),  members  of  the  \dam  and 
adah  paradigms.  These  have  primarily  a  directional  sense  —  'this  here',  'that 
over  there"  —  referring  to  entities  in  the  objective  world. 

(b)  discourse  deictics  (abbreviated  *dDEM').  more  likely  to  be  used  to 
refer  to  items  in  the  discourse  context:  'that  mentioned  previously',  'the  X  about 
to  be  specified',  etc.  Discourse  deictics  are  based  on  the  stem  \a-.  I  include 
here  the  non-neuter  singular  nominative  forms  in  which  /s/  substitutes  for  /t/,  as 
well  as  those  derived  from  the  fa-  stem  by  means  of  the  prefix  /e/.  These  latter 
differ  from  the  prefix-less  dDEMs  in  a  tendency  to  have  cataphoric  reference: 
Whereas  the  referent  of  a  prefix-less,  'anaphoric'  dDEM  is  likely  to  be  identified 
and/or  defined  in  a  previous  clause,  the  referent  of  a  prefixed,  'cataphoric' 
dDEM  is  likely  to  be  identified  and/or  defined  in  a  subsequent  clause.  Cf.  Hock 
1982:13-14  for  details. 

Of  these  two  classes,  only  the  dDEMs  share  with  RELs  and  INTs  the  strong 
tendency  to  be  fronted.  Cf.  (19),  in  which  substituting  dDEMs  for  the  attested 
IDEMs  results  in  highly  marked  strings  unless  the  dDEMs  are  fronted. 

(19)  (a)       eslmi         vai        lokanam'      ayamJ  eva       vyomaJ 

IDEM-Gp.  PTCL-S  "worlds'-Gp.  IDEM-Nm.  PTCL-P  'ether'-Nm. 

idamk  antariksamk 

IDEM-Nn.  'atmosphere'-Nn. 
'Of  these  worlds  the  ether  is  this  atmosphere.'  (^B  5.1 .2.5) 

(b)  ?etesam'  vai  lokanam'  esahJ  eva  vyomaJ  etat'^  antariksamk 

dDEM  dDEM  dDEM 

(c)  etesam'  vaf  esahJ  etat*^  lokanam'  eva  vyomlJ  antariksami^ 

dDEM       dDEM  dDEM 

I  am  concerned  here  only  with  the  pronominal  lexemes  (including  pronom- 
inal lexemes  serving  adverbial  functions)  derived  from  the  pronominal  stems 
ya-,  ka-,  and  ta-  (all  RELs,  INTs.  and  dDEMs).  and  no  other  lexemes  in  the 
language.     I  shall  for  the  present  treat  them  as  marked  positive  for  a  feature 
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AFP,  for  'Accented  Fronted  Pronominal',  although  I  suspect  that  this  will  event- 
ually' prove  to  be  shorthand  for  a  complex  of  features. 

It  is  on  this  class  of  lexemes  that  the  Vedic  equivalent  of  'WH-fronting'  is 
defined.  In  Vedic  (though  not  in  the  Classical  language),  AFPs  are  normally 
fronted  within  the  clause;  they  are  only  rarely  found  associated  in  c-structure 
with  the  NPs  that  nevertheless  control  them.  See  (20)-(23),  in  which  the  super- 
scripts indicate  morphological  control. 

(20)  (a)  tau'         vnliih         ca  eva       yavah  ca 

dDEM-Nd.  'rice'-Ns.  PTCL-'&'  PTCL-P  'barley'-Ns.  PTCL-'&' 

[bhutau'lNP    ajayetam 

'beings'Nd.  'were  born' 

'Rice  and  barley,  these  two  beings  were  born.*  (AB  2.1 1.12) 

(b)    ?vr"ihih  ca  eva  yavah  ca  tau'  bhutau'  ajayetam 

(21)  (a)    yabhyah'    eva         taniJ     [devatabhyah"]Np  [havfrhsiJlNP 

REL-D    PTCL-P  dDEM-N      "deities'-D  'oblations'-N 

grhftani  bhavanti 

'taken'        'are' 

'For  which  deities  those  oblations  are  taken'  (AB  7.2.3) 

(b)  ?taniJ  havFrrisiJ  yabhyah'  eva  devatabhyah'  grhitani  bhavanti 

(c)  ?yabhyah'  eva  devatabhyah'  taniJ  havFrhsiJ  grhitani  bhavanti 

(22)  (a)       kati'        ayam       adya  udgata  [[asmfnJ        yajfieJ]NP 

INT-Ap.  IDEM-Ns.  'today'    Ns.       IDEM-Ls.  'sacrifice'-Ls. 

[stotriyah'jNP  stosyatijvp 

Ap.  'sing' 

'How  many  stotriyas  will  this  Udgatr  sing  in  this  sacrifice  today?' 
(6b  14.4.6.12) 

(b)     ?ayam  adya  udgata  asmin  yajn6  kati  stotriyah  stosyati 

(23)  (a)        tatha        u  ha        etam'      esahJ       atiyantam'    agnihJ 

Adv.-AFP  PTCL-S  dDEM-A  dDEM-N  'passing  by'-A  'fire'-N 

dhuryah         na     hinasti. 
'assoc.w.yoke'-N  'not'  V-'injure' 

'Then  this  fire  in  the  yoke  does  not  injure  the  one  passing  by.' 
(6B  1.1.2.10) 

(b)  ?tatha  u  ha  etam'  atiyantam'  e§ahJ  agnihJ  dhuryah  nd  hinasti. 

(c)  Ttatha  u  ha  esahJ  agnihJ  dhuryah  et^m'  atiyantam'  n^  hinasti. 
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In  trying  to  account  for  this  behaviour,  I  will  start  by  assuming  the  LP  rule  in 
(24),  which  will  guarantee  that  AFPs  will  precede  any  non-pronominal  sisters. 

(24)  +AFP<-PRO 

2.2.     Some  complications 

2.2.1.     Interaction  between  distinct  fronting  processes 

2.2.1.1.  Pronominal  fronting  and  topicalization 

The  ordering  of  AFPs  is  complicated  by  two  independent  features  of  Vedic 
syntax.  One  is  topicalization.  As  is  evident  from  (25)-(26),  AFPs  may  be  pre- 
ceded by  the  clause-initial  PTCL  string  and  its  left  sister,  the  'topicalized'  word. 
According  to  the  proposed  formal  description  of  PTCL  syntax  (cf.  above,  (18)- 
(19)),  however,  these  two  elements  are  sisters  of  S[SLASH],  not  of  any  lex- 
emes. If  the  AFPs  in  (25)-(26)  are  assumed  to  be  immediate  daughters  of 
S[SLASH],  then  LP  rule  (24)  will  front  them  therein,  but  will  not  refer  to  the  PTCL 
string  or  the  topicalized  word,  since  these  are  aunts,  not  sisters,  of  the  AFPs. 

(25)  sarvamj       u  eva        esah    etat  [sapatnanam  t|  J^p 

'all'-A  PTCL-S  PTCL-P  dDEM-N  'thus'  'enemies'-G 

sam-vrhkte 

'appropriates' 

'On  the  other  hand,  this  (person)  thus  appropriates  in  fact  his 

enemies'  all  =  all,  in  fact,  his  enemies'  goods.'  (^B  1.7.2.24) 

(26)  indhej      ha  vai         etat        adhvaryuh  [idhmena  agnim  tj  ]vp 
'ignites'  PTCL-S  dDEM-Adv.      'A.'-N      'kindling'-l  'fire'-A 
'Now  the  Adhvaryu  indeed  lights  the  fire  with  the  kindling.' 
(3b  1.3.5.1) 

2.2.1.2.  Pronominal  fronting  and  clitic  pronouns 

The  ordering  of  AFPs  is  also  complicated  by  clitic  pronouns.  Clitic  pro- 
nouns in  Sanskrit  are  obligatorily  unaccented  personal  pronouns  having,  unlike 
most  AFPs,  the  syntactic  status  of  full  NPs.''"'  Being  unaccented,  clitic  pronouns 
require  phonological  hosts  and  are  subject  to  phonological  constraints  on  their 
ordering  within  the  clause.  In  Vedic  verse  it  is  possible  for  a  clitic  pronoun  to 
cliticize  to  its  governor  (case-assigner)  or  to  appear  at  the  end  of  a  clause  or 
metrical  unit.  (A  phonological  constraint  operating  throughout  the  history  of 
Sanskit  requires  that  the  first  word  in  a  clause  or  metrical  unit  be  accented,  and 
since  clitic  pronouns  cannot  be  accented,  they  are  ipso  facto  excluded  from 
such  positions.)  In  Vedic  Prose  they  can  appear  only  in  the  position  immed- 
iately following  the  first  word  in  the  clause  and  the  PTCL  string,  if  any.  They 
necessarily  precede  any  AFPs  following  the  PTCL  string.  Cf.  (27)-(28). 
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(27)  (a)    vrtram      hi         asmai        etat'  jaghnuse 

A     PTCL-S  'him'-Dcl.  dDEM-A  'having  slain'-D 

apyiyanam'  akurvan 

'envigorating  draught'-D  'made' 
'For  him,  (the  one)  having  slain  Vrtra,  they  prepared  this 
envigorating  draught.'  (^B  1.6.4.12) 

(b)     *vrtram  hf  etat'  asmai  jaghnu?e  apyayanami  akurvan 

(28)  (a)         te  enam  etat'        vratam'    upayantam      viduh 

dDEM-Np.  'him'-Acl.  dDEM-As.   "vow'-A  'undertal<ing'-A  'know' 
'They  know  him  (to  be)  the  one  undertaking  this  vow.' 
(6B  1.1.1.7) 

(b)  *te  etat  enam  vratam  upayantam  viduh 

(c)  *te  etat  vratam  upayantam  enam  viduh 

The  ability  of  clitic  pronouns  to  precede  AFPs  in  strings  like  these  is  ac- 
counted for  by  the  fact  that  they  are  +PRO.  and  LP  njle  (24)  constrains  AFPs 
only  to  precede  -PRO  sisters."'^ 

2.2.2.     Double  determiners 

In  special  cases,  one  NP  can  in  Vedic  control  two  determiners.  It  can 
control  one  determiner  of  any  type,  and  in  addition  one  of  any  of  the  following 
types:  one  PEL,  one  dDEM,  or  one  INT.  Cf.  (29)-(30).  In  order  to  account  for 
strings  such  as  this,  it  is  necessary  to  assume  that  a  given  NP  may  control  as 
many  as  two  distinct  determiners. 

(29)  etam'     havai      etami  nyahgami  anu  garbhah 
dDEM-A  PTCL-S  dDEM-A  'sign'-A  'according  to'-Adp.(A)      N 

'In  accordance  with  this  same  sign  is  "garbhah".'  (JUS  1 .1 .4.2) 

(30)  [sah'  esah'         eva         mrtyuh']       [yah'         esah' 
dDEM-Nm.  dDEM-Nm.  PTCL-P  'death'-Nm.  REL-Nm.  dDEM-Nm. 

et^sminJ  min^laleJ    punj?ah'].    6tha       [e\&\^  amrtam"*] 

dDEM-L      'orb'-L    'person'-Nm.  COMP  dDEM-Nn.  'immortality'-Nn. 

[y^t"*        et^fk        arcfhk     dipyate] 
REL-Nn.dDEM-Nn.  'light'-Nn.  'shines' 

'Which  this  person  is  in  that  orb,  that  very  one  Is  death,  and  which  this 
light  shines,  that  very  one  is  immortality  ■  That  very  person  in  that  orb 
is  death,  and  this  shining  light  is  immortality.'  (^8  10.5.2.3) 

A  specialized  usage  exists  for  a  double  REL,  meaning  approximately 
'whichever',  answered  by  a  double  dDEM.    In  general,  doubling  any  word 
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results  in  a  'distributive'  or  intensive  moaning.    See  below,  ex.  (31)-(32).    Such 
doublings  can  almost  certainly  be  considered  (quasi-)lexicalized. 

(31)  yam' yam'      nu  eva      aham  abhi-ava-iksisye,  tena' tena' 

REL-A     PTCL-SPTCL-P    T-N  "look  at'-fut.  dDEM-l 

eva  me  saha  bhavisyati 

PTCL-P  'mine'-cl.  'with'-Adp.(l)  'become'-fut. 
•Whomever  I  will  look  at  with  that  one  together  will  be  mine.'  (JB  17) 

(32)  tat'  tat'  anadisya  §jyasya      eva      rupena  grhnati 
dDEM-A  'without  announcing'  'butter'-G  PTCL-P  'form'-!      'takes' 
'Each  one  he  takes  with  (=  in)  the  form  of  butter  without  announcing 
(it).'  (^B  1.3.2.7) 

2.2.3.     Non-fronted  determiners 

This  paper  will  be  primarily  concerned  with  fronted  determiners.  It  must 
not  be  thought,  however,  that  the  fronting  of  these  determiners  is  absolutely 
obligatory  in  Vedic,  any  more  than  the  fronting  of  WH-words  is  obligatory  in 
English.  While  RELs,  INTs,  and  dDEMs  have  a  strong  tendency  to  be  fronted, 
strings  with  non-fronted  determiners  do  occasionally  occur.   Cf.  (33). 

(33)  ?tei  devlhi  etatJ       yajuhJ  apasyan 
dDEM-Np.  'gods'-Np.  dDEM-As.     As.        'saw' 
'The  gods  saw  this  Yajus.'  (TS  5.2.3.1 ) 

In  this  clause,  since  the  pronoun  etat  is  just  as  much  a  member  of  the 
dDEM-class  (and  therefore  of  the  AFP-class)  as  te,  we  would  expect  the  two 
demonstratives  to  be  next  to  each  other  as  in  (33'),  rather  than  next  to  their 
respective  controllers.  Yet  this  is  not  the  case. 

(33')  te'         etatJ     devShi    yajuhJ    apasyan 

dDEM    dDEM 

Strings  such  as  (33)  are  quite  rare  in  the  Vedic  Prose  corpus  (hence  the 
question  mark  qualifying  (33)),  yet  they  do  occur. 

The  problem  with  which  this  paper  is  concerned  is  this:  With  regard  to 
constituent  order,  AFPs  generally  behave  as  immediate  daughters  of  S,  being 
free  to  be  fronted  within  S  rather  than  NP;  but  with  regard  to  morphology  they 
behave  as  daughters  of  NP,  in  that  they  can  be  controlled  for  purposes  of 
morphological  agreement  by  any  NP  in  the  clause. ""^  A  descriptively  adequate 
theory  of  Sanskrit  needs  to  account  for  this  discrepancy. 

2.3.     Some  proposed  solutions 

2.3.1.     Pruning  in  semantics 

Some  (e.g.  Arnold  Zwicky,  Georgia  Green,  pp.c.)  have  suggested  that  this 
problem  could  be  solved  semantically:    Allow  AFPs  to  be  freely  generated  as 
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daughters  of  S,  with  any  agreement  marking  conceivable,  and  let  the  semantics 
sort  out  which  strings  are  acceptable.  This  is  a  logically  possible  solution,  con- 
sistent also  with  Chomsky's  (1988)  recent  assertion  that  grammars  generate 
huge  amounts  of  'garbage'  (in  the  sense  of  unacceptable  strings)  anyway,  and 
it's  the  job  of  semantics  to  eliminate  it.  In  Chomsky's  view  a  grammar  (or  a  the- 
oretical framework  for  grammar)  cannot  be  faulted  for  failing  to  do  the  pruning 
job  that  can  be  left  to  semantics. 

This  proposed  solution,  however,  is  only  adequate  if  all  of  the  relevant 
features  are  accessible  to  semantic  analysis,  and  in  the  case  of  Sanskrit 
determiners  this  is  dubious  at  best.  In  Sanskrit,  there  are  three  agreement 
features  necessary  for  pronominal  determiners,  gender,  number,  and  case, 
illustrated  below,  (34)-(36).  (See  the  explanation  of  abbreviations  for  these 
features  at  the  end  of  the  paper.) 

(34)  (a)       etamJ  u  eva  esah'        etasmaii^  visnuh 

dDEM-Af.  PTCL-S  PTCL-P  dDEM-Klm.  dDEM-Dm.      Nm. 

yajnah'  vikrantimJ  vf-kramati 

'sacrifice'-Nm. 'victorious  step'-Af.   'strides' 
'This  sacrifice,  Visnu,  strides  this  victorious  step  for  him.' 
(^B  1.1.2.13) 

(b)       *etamJ     u  eva      etat'        etasmai^  visnuh  yajii^h' 
dDEM-Am.  dDEM-Nn.  dDEM-Dm.     Nm.      Nm. 

vikrantimJ  vi-kramati 
Af. 

(35)  (a)  tau'  vnTiih         ca  eva        yavah  ca 

dDEM-Nd.  'rice'-Ns.  PTCL-'&'  PTCL-P  'barley'-Ns.  PTCL-'&* 

bhutau'       ajayetam 
'beings'-Nd.  'were  born' 
'Rice  and  barley,  these  two  beings  were  born.'  (AB  2.1 1.12) 

(b)  sah'      vrihih  ca  eva  yavah  ca  bhutau'  ajayetam 
dDEM-Ns.         ■  Nd. 

(c)  te"       vnliih  ca  eva  yavah  ca  bhutau'  ajayetam 
dDEM-Np.  Nd. 

(36)  (a)       tasya'     dasa         avastat  ahani   divakTrtyasya' 

dDEM-G  'ten'-N  'before'-Adp.(G)  'days*-N  G 

bhavanti 

'are' 
"Before  the  Divakfrtya  there  are  ten  days.'  (AB  4.18.4) 
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(b)  *sah'    dasa  avastat  ahani  divakfrtyasya'  bhavanti 
dDEM-N  G 

(c)  'tasmini  dasa  avastat  ahani  divakfrtyasya'  bhavanti 
dDEM-L  G 

While  gender  and  number  might  conceivably  be  regarded  as  semantic  in 
nature, ""^  case  is  a  purely  grammatical  feature  with  no  direct  relation  to  the  na- 
ture of  the  referent.  This  is  obvious  from  the  fact  that  co-referential  nominals  in 
successive  clauses  can  be  in  different  cases,  though  they  will  almost  certainly 
agree  in  gender  and  number.  If  semantics  is  unable  to  force  co-refential  nomi- 
nals in  different  clauses  to  agree  in  case,  I  don't  see  how  it  can  do  so  within  the 
same  clause. 

Case  is  also  a  purely  grammatical  feature  in  that  it  is  assigned  to  NPs 
syntactically  (whether  by  configuration  or  subcategorization),  not  freely.  In  a 
language  like  Sanskrit  with  a  rich  morphological  case  system,  head  nouns 
inherit  their  morphological  case  from  their  NP  mothers.  APs  get  case  by 
agreement-control  from  their  governing  head  nouns  or  from  their  NP  mothers 
(depending  on  details  of  theoretical  formalism),  and  the  same  procedure  is 
routinely  assumed  for  determiners.  However,  this  assumption  requires  the 
further  assumption  that  determiners  are  daughters  of  NP,  and,  as  stated  above, 
the  problem  with  which  this  paper  deals  is  precisely  that  in  Vedic  some  deter- 
miners do  not  behave  as  daughters  of  NP  with  regard  to  constituent  order. 
Either  case  must  be  assigned  by  the  syntax  to  determiners  independently  of 
NPs,  which  seems  like  a  violation  of  Occam's  Razor,  or  the  case  assigned  to 
NPs  must  somehow  be  also  assigned  automatically  to  their  determiners,  even 
when,  in  terms  of  constituent  structure,  these  are  not  their  daughters. 

2.3.2.      Subcategorization 

Since  verbs  and  adpositions  have  to  subcategorize  for  their  arguments  in 
terms  of  case,  it  might  be  asked,  Cannot  this  constraining  mechanism  be  ex- 
tended in  order  to  solve  the  problem  that  morphological  case  agreement  poses 
for  semantic  pruning,  as  discussed  in  the  previous  section?"'^ 

With  regard  to  determiners  agreeing  with  subcategorized  arguments,  the 
assignment  of  case  as  a  result  of  subcategorization  might  be  extended  in  such 
a  fashion  in  an  HPSG-type  theory  with  little  trouble  (cf.  Pollard  &  Sag  1987). 
Within  GPSG,  in  which  subcategorization  requirements  must  be  filled  by  sisters 
(Gazdar  et  al.  1985:33-34),  such  an  extension  might  prove  formally  more  diffic- 
ult. But  I  am  not  going  to  address  this  problem  in  detail,  because  in  view  of  the 
problem  discussed  in  the  following  paragraph,  the  proposed  extension  Is  itself 
clearly  inadequate. 

It  is  assumed  that  verbs  and  adpositions  subcategorize  for  their  arguments 
(with  the  exception  in  GPSG  of  subjects,  which  are  not  sisters  of  lexical  verbs  — 
cf.  Gazdar  et  al.  1985:34).  But  it  is  also  assumed  that  verbs  do  not  subcategor- 
ize for  VP  adverbials.  even  though  such  do  occur. i^  As  noted  above.  1.2.1.2., 
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adverbials  can  take  the  form  of  oblique-case  NPs.    Such  NPs  may  control 
determiners  which  may  be  fronted.  Cf.  (37). 

(37)  etamJ  u  eva  esah'       etasmaji^  vi'snuh 
dDEM-Af.  PTCL-S  PTCL-P  dDEM-Nm.  dDEM-Dm.    Nm. 

[yajfiah'lNP  [[Q^np     [vikrantimJ]NP     vi-kramati.]vp 
'sacrifice'-Nm.  'victorious  step'-Af.    'strides' 

'This  sacrifice,  Visnu,  strides  this  victorious  step  for  him.' 
(6B  1.1.2.13) 

In  (37)  we  have  an  AFP  determiner  etasmai  controlled  by  a  (phonological- 
ly  null)  dative  NP  adverbial  modifying  the  VP.  While  it  does  subcategorize  for 
an  accusative  direct  object  {(etam)  vikrantim  in  (52)),  I  have  no  reason  to  be- 
lieve that  the  verb  vi'-kram  subcategorizes  for  a  beneficiary-dative  NP.  The  fact 
that  NPs  which  do  not  get  case  by  subcategorization  can  themselves  control,  for 
the  purposes  of  case-agreement,  AFP  determiners  indicates  that  subcategoriza- 
tion itself  cannot  adequately  account  for  case-agreement  on  AFP  determiners. 

2.3.3.     Extraction 

One  possible  solution  that  immediately  springs  to  mind  is  to  have  all  AFPs 
extracted  from  any  phrasal  mother  into  S  by  means  of  SLASH."" ^  it  would  cer- 
tainly be  possible  to  define  instantiations  of  SLASH  with  appropriate 
pronominal  values,  so  that  there  could  be  ID  rules  such  as  those  in  (38)-(43). 

(38)  S  -►  AFP[x].  NP[x]/AFP[x],  VP 

(39)  S  -♦  AFP[x],  NP.  VP/AFP[x] 

(40)  VP/AFPM  -*  V,  NP[x]/AFP[x] 

(41)  VP/AFP[x]  -♦  V,  NP,  PP/AFP[x] 

(42)  PP/AFP[x]  -»  P.  NP[x]/AFP[x] 

(43)  NPIx]/AFP[x]  -♦  N[x],  AFP[x]/AFP[x] 

There  are  two  major  problems  with  this  hypothesis.  One  involves  the 
possibility  of  VPs  dominating  two  or  more  NPs,  each  of  which  may  control  a 
demonstrative.  For  example,  (44)  below  might  be  analyzed  as  a  case  of 
extraction  along  the  lines  of  (44'). 

(44)  etat '        ha  vaf      devah      vratam'      caranti 
dDEM-As.  PTCL-S  'gods'-Np.  'vow'-As.  'undertake' 
'The  gods  undertake  this  vow.'  (^B  1.1.1.5) 

(44')  [etati]dDEM  ha  vai  [dDEM/dDEM  devah  [[dDEM/dDEM 

vratam']NP/dDEMcaranti]vp/dDEM]s/dDEM 
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But  in  (34),  as  already  noted  in  tne  previous  section,  the  VP  dominates 
both  an  accusative  NP  and  a  dative  NP  (phonologicaliy  null),  both  of  which 
control  AFP  determiners.  Therefore,  in  this  clause  the  VP  would  have  to  be 
SLASHed  twice,  once  for  each  daughter  NP  controlling  a  demonstrative.  Within 
orthodox  GPSG,  this  is  impossible  (Gazdaret  al.  1985:81). 

Maling  &  Zaenen  1982,  however,  (and  Zwicky  p.c.)  have  already  argued 
that  double-SLASHing  may  be  necessary  in  at  least  some  languages.  So  this, 
by  itself,  is  perhaps  not  an  insurmountable  problem.  The  hypothesis  runs  into 
more  serious  problems  in  Sanskrit  on  two  counts:  (a)  AFP-fronting  in  Sanskrit 
fronts  only  words  (determiners),  not  whole  phrases,  and  (b)  a  given  NP  in 
Sanskrit  can  control  more  than  one  determiner. 

Consider  (45).  Assuming  that  the  two  dDEMs  are  generated  separately 
rather  than  as  some  kind  of  composite  determiner  (which  the  intervention  of  a 
PTCL  strongly  suggests),  it  would  presumably  be  possible  for  the  accusative  NP 
to  dominate  both,  or  either,  and  for  only  one  (or  none)  to  be  extracted  into  and 
fronted  within  the  clause,  as  in  (46). is 

(45)  etam'         vai         etam'     prastavena  ream'  aptva 
dDEM-Af.  PTCL-S  dDEM-Af.      'P.'-lm.     'r.'-Af.  'having  obtained' 
'Having  by  means  of  the  prastava  obtained  this  (same)  re' 
(cf.JUB  1.4.3.3) 

(46)  (a)     ?prastavena  vai  [etam    etam  rcamJNp  aptva 

dDEM  dDEM 

(b)     *etamj  vai  prastavena  [  d;  etam  rcamJNp  aptva 
dDEM  dDEM 

Yet,  while  it  is  possible  (though  rare)  for  dDEMs  not  to  be  fronted  (see 
2.2.3.),  if  an  NP  controls  two  dDEMs  it  is  not  possible  for  one  to  be  fronted  and 
the  other  noO^ 

The  problems  discussed  in  this  section  result  at  least  in  part  from  the 
assumption  that  fronted  determiners  are  generated  according  to  ID  rule  schema 
(3)  as  daughters  of  the  NPs  whose  semantic  determiners  they  are,  and  then 
must  be  extracted  into  S.  Since  they  regularly  behave  with  regard  to  c-structure 
as  daughters  of  S,  let  us  suppose  that  they  are  in  fact  regularly  generated  under 
S,  and  amend  ID  rule  schemata  (2)  and  (3)  accordingly  to  (2")  and  (3'): 

(2')    S  -♦  (AFP)*,  NP,  (Adv)*,  VP 

(3')    NP  -»  (Det),  (NP[GEN]),  (AP),  H02O 

However,  it  is  still  necessary  for  AFP  determiners  to  be  controlled  in  some 
way  by  NPs,  even  NPs  inside  VP.  This  means,  within  a  GPSG  framework,  that 
the  necessary  agreement  features  must  be  accessible,  either  in  the  specifica- 
tion of  their  mother  category  (S)  or  among  their  sisters,  even  if  those  features 
are  initially  specified  on  an  NP  within  VP. 


146 

2.4.    Attempt  at  a  GPSG  description^^ 

2.4.1.  GEND,  NUM,  &  CASE  are  Foot  features 

In  attempting  a  GPSG  solution  to  this  problem,  I  begin  by  proposing  that 
the  necessary  agreement  features  for  AFPs,  namely  GEND,  NUM,  and  CASE, 
are  all  Foot  features,  meaning  that,  whenever  any  of  them  are  instantiated  on 
any  category,  they  must  'percolate  up'  to  that  category's  mother,  along  with  their 
instantiated  values.  (Cf.  Gazdar  et  al.  1985:78-83.) 

Designating  such  inherently  nominal  features  as  GEND  and  CASE  as  Foot 
features  means  instantiating  them  on  -N  categories  such  as  VP,  which  is  con- 
trary to  general  intuitive  notions  of  the  nature  of  such  categories.  But  it  does  get 
the  necessary  agreement  features  where  we  need  them  for  control  of  AFPs  in  S 
from  VP-internal  NPs.  For  instance,  a  VP  node  dominating  a  direct  object  NP 
would  be  marked  with  the  agreement  features  appropriate  for  that  NP,  and  it 
would  be  a  simple  matter  to  generate  an  AFP  to  agree  therewith. 22 

Problems  arise,  however,  when  a  given  node  (say,  VP)  has  to  be  marked 
with  conflicting  values  for  one  or  more  agreement  features.  For  instance,  in 
(37),  according  to  this  hypothesis  the  VP  node  must  for  the  sake  of  the  direct 
object  be  marked  [GEND:F;  CASE:ACC];  but  for  the  sake  of  the  indirect  object- 
beneficiary,  in  order  to  generate  the  Demonstrative  etasmai,  it  must  also  be 
marked  [GEND:M;  CASE:DAT|. 

Clearly,  if  a  VP  node  is  to  bear  a  variety  of  instantiations  of  agreement  fea- 
tures, one  for  each  NP  it  dominates,  all  of  these  must  somehow  be  segregated 
from  each  other  as  values  for  different  features,  in  order  to  be  equally  available 
to  the  system  while  not  conflicting  with  each  other. 

2.4.2.  Annotation  of  NPS 

To  segregate  distinct  instantiations  of  agreement  features  from  each  other 
within  a  single  category,  I  propose  encapsulating  each  such  instantiation  within 
an  intermediate  category  (=  bundle  of  feature  values),  cross-referenced  to  the 
relevant  NP.  These  capsule-categories  might  be  arbitrarily  labelled,  but  the 
simplest  and  most  direct  way  to  label  them  is  probably  by  labels  relating  to 
grammatical  relation  or  function.  If  we  assume  something  like  the  LFG  principle 
of  functional  uniqueness  (Bresnan  1982:181-182),  a  designation  by  grammat- 
ical function  will  necessarily  refer  to  a  unique  NP.  In  which  case,  any  arbitrary 
method  of  distinctively  labelling  NPs  will  in  fact  be  equivalent  to  every  other 
such  possible  method,  including  labelling  by  grammatical  function.  And 
labelling  by  grammatical  function  is  at  least  mnemonically  convenient.23 

To  present  the  details  of  this  hypothesis,  I  will  first  specify  some  definitions. 

An  UNDEFINED  feature  is  one  which  does  not  occur  within  a  given  category. 
(Cf.  Gazdar  et  al.  1985:ch.2  passim.)  An  unspecified  feature  is  one  which  is 
defined  for  a  given  category,  but  to  which  no  value  is  assigned  by  that  category. 
For  example,  a  given  category  might  have  the  feature  CASE  defined  but  un- 
specified, which  I  will  notate  as  [CASE:  ***] 
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There  is  a  feature-set  GF  of  all  tne  possible  Grammatical  Functions  (a  la 
LFG  or  RG)  in  a  given  language  (in  this  case,  Vedic),  hereafter  referred  to  as 
GF-features. 

Each  GF-feature  is  a  category-valued  feature,  taking  typical  NP-type 
values.  (The  necessary  agreement  features  for  AFPs  are  the  most  important 
ones  for  our  discussion,  but  values  such  as  ±HUMAN  or  ±ANIN/1ATE  may  also 
be  relevant  to  a  given  language.) 

Each  GF-feature  is  a  Foot  Feature. 

A  set-valued  feature  is  one  which  takes  as  its  value  a  set  of  features.24 

There  is  a  Set-Valued  Feature  RELAT.  whose  value  is  always  a  subset  of 
GF.  Verbs  and  adpositions  have  the  feature  RELAT  as  part  of  their  lexical 
entries,  and  their  subcategorization  frames  may  be  defined  thereby  (cf.  below). 

Since  verbs  and  adpositions  assign  case  to  their  nominal  complements 
(i.e.,  a  given  verb  or  adposition  can  collocate  only  with  an  NP  marked  for  a 
specific  case,  not  any  case),  their  lexical  entries  would  need  to  specify  the  value 
of  the  feature  CASE  on  their  NP-complements.  Since  under  this  hypothesis  the 
necessary  features  of  NP-complements  would  be  coded  in  RELAT,  this  would 
be  the  appropriate  place  to  code  this  information.  Since,  however,  verbs  and 
adpositions  generally  have  no  constraints  on  the  grammatical  number  or 
gender  of  their  NP-complements,  there  would  be  no  reason  to  specify  this 
information  in  RELAT  in  the  lexicon. 

For  example,  a  verb  like  anu-jna  'grant'  may  have  in  the  lexicon  a  value  of 
RELAT  as  in  (47). 

(47)  [RELAT:  10  TGEND:  *"  1.  DO  TGEND:  *"     11 
I  INUM:"*      I         INUM:"*        II 

L  LCASE:  DATj         [CASE:  ACC  JJ 

This  example,  by  the  way,  should  make  clear  how  a  set-valued  feature  is 
different  from  a  category-valued  feature.  The  value  of  each  of  the  GF-features 
10  and  DO  is  a  category,  that  is,  it  is  a  set  of  features  of  the  form  {a,  b,  c,  d,  ...} 
such  that  each  feature  is  distinct  from  the  others:  GEND  is  a  distinct  feature  from 
NUM  which  is  distinct  from  CASE.  This  particular  instantiation  of  RELAT,  how- 
ever, includes  in  its  value  set  two  instantiations  apiece  of  the  features  GEND, 
NUf^,  and  CASE,  and  the  two  instantiations  of  CASE,  for  instance,  each  being 
assigned  a  different  value,  would  conflict  (i.e.,  be  incompatible)  with  each  other 
were  they  not  clearly  distinguished  by  being  encapsulated  in  distinct  categories. 

A  verb  or  adposition's  subcategorization  frame  could  be  defined  in  terms 
of  its  lexical  RELAT  value  if  we  adopt  the  following  stipulation:  At  the  phrasal 
level,  no  unspecified  features  would  be  allowed.  This  means  that  any  VP  or  PP 
category  must  dominate,  in  addition  to  its  Head,  the  NPs  that  will  provide  actual 
values  for  features  such  as  GEND  and  NUM  which  are  not  specified  by  the 
Head's  RELAT  value. 
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Thus,  the  RELAX  feature  lexically  assigned  to  any  verb  or  adposition 
would  encapsulate  everything  that  needs  to  be  specified  by  the  customary  un- 
derstanding of  such  a  controller's  'subcategorization  frame',  and  the  prohibition 
against  unspecified  features  at  the  phrasal  level  will  guarantee  that  the  'subcat- 
egorization frame'  must  be  filled.25 

RELAT  is  also  defined  for  NPs,  taking  as  its  value  a  single  GF.  A  GF  a  in 
RELAT  can  get  its  values  for  GEND  and  NUM  from  an  NP  whose  own  RELAT 
value  is  a. 26 

Since  RELAT  is  a  Foot  feature,  the  GF-features  for  any  NP  within  VP  or  PP 
will  percolate  up  to  the  mother-node,  along  with  their  values  for  the  necessary 
agreement  features.  AFPs  can  then  be  generated  in  agreement  with  these  GF- 
features  as  shown  in  Figs,  la.  Ha,  Ilia,  at  the  end  of  this  paper.  Similar  strings 
with  AFPs  that  do  not  agree  with  the  agreement  features  instantiated  in  any  of 
the  GF-features  under  S,  such  as  those  in  (34b),  (35b-c),  and  (36b-c),  would  be 
ungeneratable. 

For  convenience,  I  will  continue  to  describe  an  AFP  generated  in  agree- 
ment with  features  in  a  given  GF-feature  as  being  controlled  by  the  nominal 
bearing  that  GF  feature,  whether  they  are  sisters  or  not. 

2.5.    An  LFG  description 

I  will  maintain  in  the  LFG  analysis  the  conclusion  reached  in  2.3,  that  AFPs 
are  generated  as  daughters  of  S,  while  still  being  controlled  by  NPs  which  may 
or  may  not  be  their  sisters.  So  the  question  once  again  is.  How  are  AFPs 
controlled? 

The  precise  LFG  form  of  ID  rule  schema  (2')  would  probably  be  something 
like  (2"),  where  X  is  a  variable  over  the  set  of  functions  that  can  be  assumed  by 
NPs.  This  means  that  an  AFP  can  be  generated  immediately  dominated  by  S 
but  functioning  as  the  determiner  for  any  NP  within  the  clause,  whether  an 
immediate  daughter  of  S  or  not. 

(2")    S  -»      (AFP)*,  NP,    (Adv)*,VP 

(TX  DET  =  i)  (TSUBJ  =  i)      (T  =  i) 

In  f-structure,  each  AFP  determiner,  along  with  its  value  for  GEND,  NUM, 
and  CASE,  must  be  merged  into  the  f-structure  of  the  NP  whose  determiner  it  is 
identified  as  by  its  functional  annotation. 27  lfG  has  a  requirement  called  the 
consistency  condition,  which  prohibits  conflicts  in  the  value  of  any  feature  within 
a  particular  f-structure. 28  Thus,  in  the  f-structure  the  GEND,  NUM,  and  CASE 
features  of  an  AFP  determiner  would  be  expected  to  merge  with  those  of  the  NP 
it  determines,  which  they  could  do  only  if  their  respective  values  did  not  conflict. 
Were  there  a  conflict,  the  f-structure  would  be  ill-formed. 

The  result  can  be  seen  in  Figs,  lb,  lib,  and  lllb  at  the  end  of  this  paper, 
which  represent  LFG  analyses  corresponding  to  the  GPSG  analyses  represent- 
ed in  Figs,  la,  lla.  and  Ilia.    Although  in  the  LFG  analyses  the  determiners 
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appear  in  c-structure  independent  of  the  nominals  they  determine,  they  are 
included  within  those  same  nominals  in  f-structure.  Their  agreement  therewith 
in  all  relevant  features  is  therefore  guaranteed  by  the  consistency  condition, 
while  the  variants  in  (34b),  (35b-c),  and  (36b-c)  would  be  likewise  rejected. 

2.6.     Problems  with  the  GPSG  description 

So  far,  the  GPSG  analysis  described  in  2.4  may  be  able  to  describe  the 
data  under  consideration.  However,  it  requires  many  stipulations  (in  addition  to 
the  stipulation  that  an  NP  in  Sanskrit  may  control  two  determiners,  which  is  a 
language-specific  stipulation  required  in  any  case)  the  content  of  which  in  the 
LFG  analysis  follows  directly  from  the  general  assumptions  of  the  framework. 

The  proposed  GPSG  analysis  requires: 

(a)  a  distinction  between  undefined,  unspecified,  and  null  feature  specif- 
ications; 

(b)  a  distinct  category-valued  feature  for  every  GF  possible  in  a  given 
language; 

(c)  a  new  type  of  feature:  'set-valued'; 

(d)  a  set-valued  feature  RELAT,  defined  for  every  V0,  P0,  VP,  PP,  and 
NP; 

(e)  the  stipulation  that  the  set-valued  feature  RELAT,  the  individual  GF- 
features  that  make  it  up,  and  the  features  that  make  up  the  feature- 
matrix  of  each  GF-feature  are  all  Foot  features; 

(f)  the  stipulation  that  unspecified  features  are  not  allowed  at  the 
phrasal  level; 

(g)  the  assumption  that  a  GF-feature  within  a  given  RELAT  feature  that 
happens  to  be  part  of  the  feature-matrix  of  an  immediate  daughter  of 
S  is  able  to  control  agreement  on  an  AFP. 

All  of  these  stipulations  are  required  merely  to  account  for  phenomena  in  a 
single  language;  there  is  no  reason  to  believe  that  they  would  be  of  any  use  in 
the  grammar  of  any  language  other  than  Vedic  Sanskrit.  They  are,  in  fact,  just 
as  much  language-specific  stipulations  as  the  stipulation  that  an  NP  may  control 
two  determiners.  In  an  LFG  framework,  however,  they  are  unneccessary,  since 
their  consequences  are  provided  for  by  a  distinct  level  of  analysis,  f-structure, 
which  is  not  a  language-specific  stipulation  but  a  fundamental  and  general  part 
of  the  theory. 

There  are  further  problems,  however,  with  the  GPSG  analysis  that  incline 
me  all  the  more  to  conclude  that  LFG  is  better  able  to  account  for  the  data. 

For  one  thing,  the  GPSG  analysis  requires  that,  in  describing  a  string  like 
(34)  or  (48)  which  include  dDEMs  without  overt  controlling  NPs  but  with  clear 
nominal  referents,  an  NP  be  generated  that  is  phonologically  null  but  carries  all 
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the  feature  specifications  necessary  for  generating  of  the  appropriate  deter- 
miners. Cf.  Figs.  Ila  and  Ilia.  The  analysis  requires  that  in  every  case  involving 
the  generation  of  an  AFP  two  distinct  nominals  be  generated  which  agree  with 
each  other  in  ail  relevant  features,  in  spite  of  the  fact  that  one  of  them  may  be 
devoid  of  any  overt  phonological  form  expressing  those  features.  This  is 
certainly  a  logical  possibility,  but  is  clearly  inelegant,  since  it  calls  for  a  multi- 
plicity of  types  of  'empty  categories'  in  either  the  grammar  or  the  lexicon. 

(48)  evam     eva         esah'     etamj    a'  papmane    dvisate    bhrgtrvyaya 
•thus"  PTCL-P  dDEM-N  dDEM-A         'evil'-A  'spiteful'-D      'foe'-D 

vajramJ  ut-yachati 

'thunderbolt'-A  'raises  against' 

'Thus  he  raises  this  thunderbolt  against  the  evil,  spiteful  foe.' 
(^B  1.2.4.3) 

But  in  the  LFG  analysis  while  determiners  may  be  separated  from  their 
NPs  in  c-structure  they  are  still  part  of  them  in  f-structure.  Thus,  in  Figs,  lib  and 
lllb  the  f-structures  of  the  phonologically  null  NPs  'get'  the  information  about 
gender  and  number  from  the  phonologically  overt  determiner  and  merely  note 
the  fact  that  there  is  no  other  phonological  realization  of  these  NPs  (that's  what 
'PRED  :  PRO'  means).  Neither  in  c-structure  nor  in  f-structure  is  there  duplica- 
tion of  nominals.  In  this  respect  the  LFG  analysis  is  more  elegant. 

In  addition  to  being  monumentally  cumbersome  both  in  the  amount  of 
extra  stipulations  it  makes  and  in  the  amount  of  extra  information  it  forces  into 
the  analyses  of  specific  clauses,  the  hypothetical  grammar  developed  in  2.4  is 
liable  to  overgenerate. 

At  the  end  of  2.3  it  was  recommended  that  a  descriptively  adequate 
grammar  of  Vedic  allow  AFP  determiners  to  be  generated  under  S  while  allow- 
ing determiners  to  be  generated  under  NP  as  well.  The  hypothetical  grammar 
developed  in  2.4  could  generate  a  clause  in  which  a  given  nominal  simultan- 
eously controlled  an  'NP-internal'  determiner  and  an  AFP.  See  (49).  (49a)  is 
the  actually-attested  string,  with  the  locative-case  noun  loke  controlling  a  IDEM 
asmin.  The  hypothetical  grammar  discussed  in  2.4  would  also  generate  a 
string  like  (49b),  in  which  the  same  noun  is  ALSO  controlling  a  +AFP  demon- 
strative within  S;  this  is  possible,  though  rare.  Furthermore,  the  proposed 
GPSG  analysis  predicts  the  possibility  of  generating,  within  a  given  clause,  an 
indefinite  number  of  AFPs  controlled  by  a  single  NP.  There  is  nothing  in  the 
hypothetical  grammar  being  discussed  to  prevent  the  generation  of,  say,  three 
AFPs  agreeing  with  the  LOG  NP  ias  in  (49c).  This  might  be  rejected  for  seman- 
tic or  pragmatic  considerations,  but  given  what  has  already  been  said  (2.2.2.) 
about  multiple  determiners  in  Sanskrit,  such  considerations  would  probably  not 
prevent  the  generation  of  something  like  (49d),  or  for  that  matter  like  (49e),  in 
which  loke  controls  two  cataphoric  dDEMs,  one  inside  and  one  outside  the  NP  it 
heads.  As  noted  above  (2.3.3,  discussion  of  (45)-(46)),  such  strings  do  not 
occur  in  the  corpus,  and  must  therefore  be  assumed  to  be  unacceptable.  As  is 
evident  from  (49f),  however  (cf.  also  (29)  and  (45)),  (49e)  would  be  rejected  not 
because  it  involves  two  identical  determiners  controlled  by  one  nominal  (in 
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which  case  semantic  or  pragmatic  considerations  might  be  of  help),  but  be- 
cause one  is  dominated  by  NP  while  the  other  is  not.  Barring  topicalization, 
both  such  determiners  must  be  sisters. 

(49)  (a)    stomena    vai        devah    [asmin       loke]NP     ardhnuvan 
•S.'-l     PTCL-S  'gods'-N   IDEM-L  'world'-L  'prospered' 
'By  means  of  the  Stoma  the  gods  prospered  in  this  world.' 
(TS  5.2.1.6) 

(b)  ?st6mena  vai  etasmin  devah  [asmin  loke]NP  ardhnuvan 

dDEM-L  IDEf^-L  L 

(c)  "stomena  vai  etasmin  yasmin  kasmin  devih  [asmin  lokejNP 

dDEM-L  REL-L    INT-L  IDEM-L  L 

ardhnuvan 

(d)  *st6mena  vai  etasmin  devah  [asmin  etasmin  loke]NP 

dDEM-L  IDEM-L  dDEM-L  L 

ardhnuvan 

(e)  *st6mena  vai  etasmin  devah  [etasmin  loke]NP  ardhnuvan 

dDEM-L  dDEM-L    L 

(f)  etasmin  vai  etasmin  stomena  devah  [lok6]NP  ardhnuvan 
dDEM-L       dDEM-L  L 

In  an  LFG  analysis,  however,  the  free  generation  of  multiple  determiners 
controlled  by  a  same  NP  is  prevented  straightforwardly.  The  principle  of  func- 
tional coherence  (cf.  Kaplan  &  Bresnan  1982:  211-212)  would  guarantee  that 
an  NP  will  control  at  most  one  determiner,  and  therefore  strings  with  multiply- 
de-termined  NPs  such  as  (69c-d)  could  not  be  generated.  As  noted  earlier,  it 
will  be  necessary  to  include  in  any  adequate  description  of  the  grammar  of 
Vedic  Sanskrit  a  stipulation  that  an  NP  may  control  as  many  as  two  determiners. 
With-in  an  LFG  analysis  such  as  I  have  been  discussing,  this  would  presumably 
take  the  form  of  a  stipulation  that  an  NP  may  control,  in  addition  to  the  'general' 
determiner  we  would  normally  expect,  an  optional  'second'  determiner  with, 
perhaps,  a  restricted  range  of  types  (cf.  above,  2. 2.2.). 29  Thus,  the  f-structure  for 
any  nominal  would  allow  for  a  second  determiner  (DET2),  whose  existence 
would  therefore  not  conflict  with  that  of  the  'general'  determiner  or  result  in 
functional  incoherence  (as  long  as  it  agrees  completely  in  GEND,  NUM.  and 
CASE,  of  course).  To  demonstrate.  I  include  an  LFG  analysis  of  (45)  in  Fig.  IV  at 
the  end  of  this  paper. 

3.     Conclusion 

In  this  paper,  I  have  attempted  to  develop  and  present  an  account  of  pro- 
nominal fronting  in  Vedic  Sanskrit  within  the  perspective  of  non-transformation- 
al generative  grammar.  Since  my  primary  concern  is  with  constituent  order,  I 
have  within  that  perspective  focussed  most  of  my  attention  on  the  description  of 
constituent  structure  (c-structure).  a  task  for  which  GPSG  has  a  wider  and 
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better-defined  range  of  tools.     I  have  therefore  adopted  ID/LP  format  and 
SLASH-extraction  to  general  non-transformational  generative  grammar. 

In  the  course  of  the  paper,  I  have  argued  that  Relative,  Interrogative,  and 
certain  Demonstrative  Pronominals,  as  well  as  pronominal  clitics,  must  be  gen- 
erated under  S,  even  if  they  are  controlled  by  NPs  which  are  complements  of  -N 
governors  and  are  therefore  not  their  sisters. 

In  2.4,  I  have  tried  to  develop  a  strategy  within  a  GPSG  framework  where- 
by AFPs  (Accented  Fronted  Pronominals)  can  be  controlled  by  NPs  anywhere 
within  the  clause,  including  within  VP  or  PP.  I  have  argued  that  this  necessit- 
ates a  means  of  uniquely  annotating  each  NP,  and  have  suggested  that,  if  such 
annotation  is  done  on  the  basis  of  GFs,  subcategorization  frames  may  be  defin- 
able in  terms  of  such  annotation. 

My  conclusion  is  that  the  GPSG  framework  may  be  able  to  describe  the 
facts,  but  in  order  to  do  so  must  be  burdened  with  a  large  number  of  extraneous 
and,  in  some  cases,  ad  hoc  features  in  the  form  of  redundant  null  constituents 
and  language-specific  stipulations.  And  even  so,  the  hypothesis  seems  to  have 
problems  with  overgeneration.  LFG  on  the  other  hand,  endowed  as  it  is  with  an 
(independently-motivated)  extra  level  of  analytic  structure,  can  handle  all  the 
facts  considered  simply  and  straightforwardly,  as  explained  in  2.5,  once  a  few 
details  of  c-structure  have  been  spelled  out.  I  am  therefore  inclined  to  the  view 
that  LFG  does  a  better  job  than  GPSG  in  dealing  with  the  problem  of  demon- 
strative-fronting in  Vedic. 


ABBREVIATIONS 

AB  =  Aitareya-Brahmana 

JB  =  Jaiminfya-Brahmana 

JUB  =  Jaiminfya-Upanisad-Brahmana 

RV  =  Rg-Veda  Sarhhita 

SB  =  ^atapatha-Brahmana 

SB  =  Sadvirhsa-Brahmana 

TS  =  Taittirfya-Sarhhita 

1 ,  2,  3  =  first,  second,  third  person 

s.,  d.,  p.  =  singular,  dual,  plural  number 

m.,  f.,  n.  =  masculine,  feminine,  neuter  gender 

N,  A,  I,  D,  Ab,  G,  L  =  nominative,  accusative,  instrumental,  dative,  ablative, 

genitive,  locative  case 
pres.,  impf.,  fut.,  impv.  =  present,  imperfect,  future  tense,  imperative  mood 
part.  =  participle 

pro.,  cl.  =  pronoun,  clitic  pronominal 
AFP  =  Accented  Fronted  Pronominal 

DEM,  REL,  INT  =  demonstrative,  relative,  interrogative  pronominal 
dDEM,  IDEM  =  discourse  demonstrative,  locational  demonstrative 
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PTCL  =  particle 

Adv,  Adp.  =  Adverb,  Adposition 

Adp.(A),  etc.  =  Adposition  governing  accusative,  etc. 

Pfx  =  verbal  prefix 


NOTES 

*!  would  like  to  acknowledge  the  comments  and  criticisms  of  Rick  Cervin, 
Jaeohk  Cho,  Georgia  Green,  Hans  Henrich  Hock,  Jerry  Morgan,  Arnold  Zwicky, 
and  an  anonymous  referee  who  have  seen  earlier  versions  of  this  paper.  Any 
errors  of  fact  or  interpretation  are,  of  course,  my  own. 

All  Sanskrit  words  are  represented  in  their  'pre-pausal'  form,  inter-word 
sandhi  having  been  undone.  Glosses  include  only  information  relevant  to  the 
text-context. 

■"Zwicky  1986,  p.c.  has  suggested  that  such  annotations  might  be  neces- 
sary even  within  a  GPSG  framework,  though  strictly  on  a  local  basis. 

2Cf.  Ford,  et  al.  1982:772:  'In  a  simple  sentence  [by  which  I  assume  they 
mean  a  single-predicate  clause,  without  embedding],  an  argument  of  the  lexical 
form  of  a  verb  is  satisfied  by  any  surface  category  that  has  the  function  specified 
for  that  argument,  regardless  of  the  surface  position  of  that  category. 

3|t  should  be  noted  that,  in  general,  both  the  SLASH  mechanism  in  GPSG 
and  the  double-arrow  convention  in  LFG  were  developed  to  account  for 
UNBOUNDED  dependencies.  There  is,  however,  some  question  as  to  whether 
unbounded  dependencies  are  possible  in  Sanskrit.  Unbounded  dependencies 
presuppose  the  possibility  of  unlimited  embedding  of  clauses.  But  I  have  so  far 
seen  no  good  evidence  in  Sanskrit  of  embedded  finite  clauses.  Sanskrit  (at 
least  Vedic  Sanskrit)  does  recognize  lexically  and  phonologically  the  distinction 
between  (semantically)  independent  and  subordinate  clauses.  But  embedding, 
a  normal  strategy  for  'clause-union'  among  modern  Western  European  lan- 
guages, is  at  best  a  marginal  strategy  in  Sanskrit;  finite  clauses  are  almost 
never  embedded.  Sanskrit  does  have  various  kinds  of  non-finite  'reduced' 
clauses,  which  can  be  embedded  in  finite  clauses  (cf.  Hock  1985).  These 
appear  to  be  islands  in  some  cases  but  not  in  others,  implying  that  the  syntax 
treats  them  as  either  Ss  (islands)  or  NPs  (non-islands). 

'^I  shall  for  the  purposes  of  this  paper  ignore  the  nevertheless  serious  the- 
oretical question  of  the  true  nature  of  adverbs  as  a  syntactic  category. 

5We  cannot  subsume  all  of  the  NP  cases  together  while  excluding  the 
Nominative  case.  Furthermore,  if  lexical  adverbs  are  assumed  to  be  [+N,  +V, 
+ADV]  while  adpositions  are  assumed  to  be  [-N.-V]  (cf.  Gazdar  et  al.  1985:21, 
24),  we  cannot  subsume  AdvPs  and  PPs  into  one  clearly  defined  category. 
Therefore,  explicit  reference  to  each  of  these  categories  would  require  that 
every  category  that  admits  of  'adverbial'  daughters  should  have  eight  distinct  ID 
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rule  schemata  associated  with  it,  one  for  AdvP  adverbials.  one  for  PP  adverb- 
ials,  and  one  each  for  the  six  adverbial  NP-cases.  Even  if,  as  seems  arguable, 
adverbs  and  adpositions  in  Sanskrit  should  be  subsumed  into  one  lexical 
category  type,  this  would  reduce  the  necessary  schemata  only  by  one. 

spor  the  details  of  control  and  agreement  in  GPSG,  see  Gazdar  et  al.  1985: 
83-98.  For  LFG,  note  the  definition  of  'head'  in  Bresnan  1982:296,  especially 
the  assumption  that  'the  functional  features  of  a  phrase  are  identified  with  those 
of  its  head'.  And.  with  regard  to  agreement  in  LFG,  cf.  Kaplan  &  Bresnan  1982: 
183-215. 

^Schaufele  1985  consists  of  a  more  detailed  presentation  of  the  hypothes- 
is presented  in  this  section. 

^Kendall  &  Yoon  1986:54  describe  similar  elements  in  other  languages  as 
'morphemes  whose  sole  function  is  to  indicate  speech  act  properties'. 

9As  can  be  seen  in  (1 1  ),(1 2),  and  (1 4),  PTCLs  can  'pile  up'  to  form  a  'PTCL 
string'.  The  ordering  of  particles  within  the  string  appears  to  be  controlled  by 
phonological  considerations  which  will  not  concern  us  here;  cf.  Hock  1982:11. 
Syntactically,  it  is  the  PTCL  string  itself  that  is  important. 

lOProulx  (1988)  has  reconstructed  two  double  sets  of  demonstratives  for 
Proto-Algonquian.  It  is  not  clear  whether  the  distinction  between  them  is  like 
that  in  Sanskrit. 

■'■'Some  clitic  pronouns,  such  as  asya  or  asmai,  are  homonymous  with 
IDEMs,  at  least  in  the  sense  of  having  the  same  sequence  of  phonemes.  How- 
ever, they  differ  from  these  in  two  respects:  (a)  they  are  accentless.  and  (b)  they 
behave  syntactically  and  semantically  as  personal-pronominal  NPs,  rather  than 
as  NP-determiners  or  -modifiers  or  locational  adverbs. 

■•^Note  that  the  hypothesis  presented  in  1.2.3.  concerning  toplcalization 
and  PTCL  placement  would  assign  to  (27)  a  structure  such  as  (27),  in  which  the 
clitic  pronoun  asmai  ends  up  as  the  leftmost  daughter  of  its  mother.  Since 
phonological  constraints  forbid  clitic  pronouns  to  appear  in  leftmost  position, 
this  can  only  be  an  adequate  description  of  the  true  grammar  if  such  intra- 
clausal  constituent  boundaries  are  invisible  to  the  phonological  component. 

(27')  [vrtramj      hi  [asmai  etat'    [[[tj  jaghnuse]vp/N]NP/N 

'V.'-A    PTCL-S  3s.pro-DATcl.  dDEM-A  'having  sIain'-D(part.) 

apyayanam'  akurvan]vp]s/N]s 

'envigorating  draught'-A        'prepared' 

'For  him,  (the  one)  having  slain  Vrtra,  they  prepared  this  envigorating 
draught.'  (^B  1.6.4.12) 

iSThis  problem  does  not  arise  for  AFPs  which  have  adverbial  functions 
(which  in  any  case  would  be  generated  as  instantiations  of  the  category  Adv), 
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since  they  are  not  controlled  by  any  NPs.    Such  AFP  adverbials  will  therefore 
not  be  at  issue  here. 

■''^Though  it  might  be  difficult  to  make  a  hard-and-fast  case  for  gender 
being  a  semantic  feature  in  Sanskrit.  Like  German  and  French,  Sanskrit  has  a 
grammatical  gender  system  that  does  not  always  bear  any  obvious  relation  to 
'natural'  or  'semantic  gender".  And  it  can  happen  that  words  of  different  gen- 
ders can  refer  to  the  same  real-world  entity.  Consider  e.g.  the  words  ap  (fem.) 
and  udakam  (neut.),  both  meaning  'water'.  The  former  presents  some  extra 
problems  with  regard  to  the  assumption  that  even  grammatical  number  is  sem- 
antic in  nature,  since  it  is  in  Vedic  normally  (and  in  the  Classical  language 
always)  used  in  the  plural. 

■"SFor  instance,  in  (36)  the  fact  that  a  genitive  demonstrative  tasya  is  ac- 
ceptable but  not  a  locative  tasmin  is  presumably  due  directly  or  indirectly  to  the 
fact  that  the  adposition  avasfar  subcategorizes  for  a  genitive  rather  than  a  locat- 
ive complement. 

iscf.  above,  1.2.1.2.  for  discussion  of  adverbials  and  (4)  for  the  ID  rule 
schema  introducing  VP  adverbials. 

■•^Or,  conversely,  to  have  determiners  extracted  and  then  have  AFP  feat- 
ures instantiated  on  them,  similarly  to  the  GPSG  analysis  of  WH-fronting  in 
English;  cf.  Gazdar  et  al.  1985:153-158. 

■"SExamples  of  identical  double  determiners  such  as  (29)  are  rare  in  the 
Vedic  corpus,  and  those  that  occur  are  ambiguous  with  regard  to  fronting,  since 
as  in  (29)  the  determiners'  controlling  nominal  tends  to  follow  the  determiners 
immediately  in  linear  order.  While  I  have  so  far  found  no  attested  strings  like 
(45),  having  identical  double  determiners  fronted  away  from  their  controlling 
nominal  (and  therefore  not  only  do  I  have  to  make  up  an  example  such  as  (45) 
but  offer  (46a)  as  a  possibly  acceptable  alternative),  I  can  confidently  state  that 
strings  such  as  (46b)  are  unattested  and  therefore  presumably  ungrammatical. 

i9This  hypothesis  may  actually  create  worse  problems  with  regard  to  the 
syntax  of  clitic  pronouns,  given  that  these  are  obligatorily  fronted  in  Vedic  Prose 
(cf.  2.2.1.2.),  and  therefore  would  have  to  be  extracted  from  their  'natural' 
mothers  under  all  circumstances. 

20Note  that  we  still  need  a  category  'Det'  under  NP,  to  cover  determiners, 
which  are  not  fronted.  It  should  also  be  noted  that,  given  the  apparent  obligat- 
oriness of  clitic-fronting  in  Vedic  Prose,  the  grammar  should  also  guarantee  that 
all  clitic  pronouns  are  generated  under  S,  even  though  they  may  function  sem- 
antically  as  verb-  or  adposition-complements  or  as  genitive  modifiers  to  NPs. 

21 A  somewhat  different,  and  much  more  elegant,  description  is  possible 
within  HPSG  (cf.  Pollard  &  Sag  1987).  The  analysis  presented  here  within 
GPSG  is  not  to  be  understood  as  representative  of  all  GPSG-derived  frame- 
works. I  hope  to  address  the  question  of  an  HPSG  analysis  of  Vedic  pronom- 
inal fronting  in  the  near  future. 
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22presumably  AFPs  can  be  assumed  to  be  marked  with  an  AGR  feature 
into  which  the  relevant  information  can  be  copied.  I  will  not  go  into  detail  on  this 
(cf.  Gazdar  et  al.  1985:83-94  for  the  Control  Agreement  Principle  and  the  oper- 
ation of  AGR  features),  but  merely  point  out  that  the  problem  is  getting  the  relev- 
ant features  onto  a  sister  (or  mother)  of  the  AGR-marked  AFP. 

23zwicky  (1986:§6)  has  independently  recommended  that  GPSG  adopt  at 
least  a  limited  annotation  of  NPs  for  GF. 

24a  'set-valued  feature'  is  rather  like  a  'list-valued  feature'  in  HPSG 
except,  of  course,  that  it  is  unordered. 

25;  distinguish  between  an  'unspecified'  feature,  i.e.  a  feature  for  which  no 
value  is  specified,  and  a  feature  for  which  the  value  NULL  is  explicitly  specified. 
I  assume  that  all  syntactic  NPs,  including  NPs  which  are  themselves  phonolog- 
ically  null,  have  values  for  the  features  GEND  and  NUM  specified.  This  may  be 
an  unwarranted  assumption  and  a  strike  against  this  hypothesis;  cf.  2.6. 

26The  proposal  presented  in  2.4.2  has  been  criticized  for  leaving  out  of 
consideration  NPs  embedded  in  other  NPs,  which  would  complicate  the  details 
of  the  RELAT  features  for  the  NP  mothers  to  the  point  of  unwieldiness  if  not 
indescribability.  For  this  reason,  this  proposal  almost  certainly  would  not  work 
except  for  languages  with  little  or  no  embedding  in  NPs.  As  noted  earlier  (note 
3),  there  is  not  as  much  embedding  in  Sanskrit  as  in  English.  However,  there  is 
some  (note  that  all  versions  of  ID  rule  schema  (3)  include  genitive  NP  modifiers 
dominated  by  NP),  which  is  a  further  reason  to  suspect  that  inthe  end  this  pro- 
posal will  prove  inadequate. 

27See  Kaplan  &  Bresnan  1982:191-193,  274  for  the  definition  of  the 
•merge'  operation,  and  183-215  generally  for  the  conditions  on  well-formedness 
off-structures. 

28Cf.  Grimshaw  1982:  96-97,  Kaplan  &  Bresnan  1982:  181,  203-207. 
Neidle  1982:  399.  Kaplan  and  Bresnan  call  it  the  'Uniqueness  Condition',  but  I 
find  the  name  'consistency  condition'  better  reflects  the  gist  of  it.  In  a  way,  the 
consistency  condition  is  the  LFG  equivalent  of  the  constraint  in  GPSG  that  pre- 
vents two  conflicting  values  for,  say,  NUM  to  be  instantiated  on  one  category. 
The  crucial  difference  is  that,  like  most  well-formedness  conditions  in  LFG,  the 
consistency  condition  is  defined  on  f-structure. 

29|n  order  partly  to  make  any  type-restrictions  on  determiners  explicit  in 
analyses,  I  will  define  the  feature  'TYPE'  for  each  DET,  taking  values  such  as 
REL,  dDEM-cat.,  dDEM-an.,  IDEI^,  or  INT.  A  stipulation  may  still  be  necessary 
to  insure  that  determiners  of  identical  type  that  are  'sisters'  in  f-structure  must 
also  be  sisters  (barring  topicalization)  in  c-structure.  There  are  many  details  of 
the  constraints  on  double  determiners  in  Vedic  that  merit  further  research. 
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etaf  ha  va(     devSh      vratAm        caranti 

dDEM-As.   PTCL-S   'gods'-Np.   Vow'-As.  'undertake' 
'The  gods  undertake  this  vow.'  (SB  1.1.1.5) 
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evam   eva        esah       elam  pBpmane  dvlsate  bhrlllrvySya      vajram      ul-yacchali 
"thus"  PTCL-P  dDEM-N  dDEM-A  'evil'-D  "spiteful'-D      'foe'-D      'thunderbolf-A    'raises  at* 

Thus  he  raises  this  thunderbolt  against  the  evil,  spiteful  foe'      (SB  1.2.4.3) 
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elam         u         eva         esah        etasmai  v'snuh      yajnah  vikrSnUm      vf-kramati 

dDEM-Af.PTCL-SPTCL-PdDEM-Nm.dDEM-Dm:V.'-Nm.'sacrifice"-Nm.Victorious  step'-Af.  'strides' 
This  sacrifice,  Visnu,  strides  this  victorious  step  for  him."  (SB  1.1.2.13) 
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etSm       vol      etSm    prastSvena  ream     gpnoti 
dDEM-Af.  PTCLdDEM-Af.     "P.'-lm.      T'-Af. "obtains" 
"This  re  he  obtains  by  means  of  the  prastava."      (cf.JUB  1.4.3.3) 
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THE   DISCOURSE   FUNCTION   OF  THE   ABSOLUTIVE   IN   THE 
PANCATANTRA* 


Sarah  Tsiang 


In  this  paper  I  will  provide  evidence  for  a  stylistic  explanation  for 
the  frequent  use  of  the  absolutive  in  the  genre  of  fable  literature.  I  will 
argue  that  the  reason  the  absolutive  occurs  so  frequently  in  the 
Pancatantra  is  that  it  is  an  extremely  useful  vehicle  for  the  rhetorical 
organization  that  typifies  the  text.  In  so  doing  I  will  contribute  to  a 
better  understanding  of  the  Sanskrit  absolutive  and  will  provide 
further  evidence  for  the  importance  of  stylistic  investigations  for 
historical  syntactic  research. 


1.0   Introduction 

The  purpose  of  this  paper  is  to  relate  the  frequent  appearance  of  the 
absolutive  in  the  Pancatantra  to  its  discourse  function.  Absolutives  (also  called 
gerunds)  are  a  prominent  feature  of  later  Sanskrit.  Gonda  notes  that  by  the 
post-Vedic  period,  the  absolutive  is  in  frequency  'the  main  device  for  forming 
complex  sentences'  (1971:136).  Whitney's  comments  about  the  absolutive 
suggest  that  its  appearance  may  in  fact  be  too  frequent.  Comparing  the 
occurrence  of  the  tva/ya  -gerund  in  the  Story  of  Nala  in  the  Mahabharata  with 
its  occurrence  in  the  Bhagavad-Gfta,  he  remarks:  'The  use  of  the  gerund, 
though  not  changing  in  its  character,  becomes  much  more  frequent,  and  even 
excessive  in  the  later  language'  (1879:358).'' 

As  in  other  cases  of  historical  syntax,  explanations  for  the  increased  use  of 
the  absolutive  may  come  in  two  varieties.  On  the  one  hand,  the  increase  may 
be  attributed  to  a  general  historical  drift  which  in  turn  may  be  attributed  to 
various  factors.  On  the  other  hand,  it  may  be  correlated  with  textual  differences. 
Both  types  of  phenomena  can  be  observed  for  other  Sanskrit  constructions. 

Thus,  the  ever  increasing  use  of  the  ta  -participle^  can  be  explained  as  a 
'simplification'  of  the  past  tense  system,  replacing  in  effect  three  distinct  tense 
formations,  the  aorist,  the  perfect,  and  the  imperfect.  And  as  Hock  (1986:431-2) 
notes,  the  change  may  have  been  further  motivated  by  convergence  with 
Middle  Indo-Aryan. 

A  second  type  of  argument  for  the  increased  use  of  a  particular 
construction  looks  at  the  question  text  by  text,  or  genre  by  genre,  linking  the  use 
of  the  construction  to  its  discourse  function  in  different  types  of  texts.  This  is  the 
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approach  in  Jacobi  1903,  which  links  nominal  style  to  scientific  writing.  Gonda 
(1942)  associates  the  (non-obligatory)  appearance  of  1st  and  2nd  person 
subject  pronouns  with  their  role  as  marking  affective  or  'colloquial'  speech. 
Thus  kuru  tvam  'you  do  it'  is  considered  more  'freundlich'  (friendly)  and  'auf- 
munternd'  (cheerful)  than  just  kuru  'do  if  (1942:122).  He  concludes  that  the 
seemingly  redundant  1st  and  2nd  person  pronouns  in  contexts  like  this  are 
'indeed,  irrespective  of  chronology,  frequently  added  in  a  variety  of  affective  or 
more  or  less  "colloquial"  utterances'  (1971 :123). 

Now  it  is  true  that  the  use  of  the  Sanskrit  absolutive  increased  over  time. 
Gonda  notes  that  the  absolutive  "has  in  the  oldest  texts  not  yet  fully  developed' 
(1971 :135).  The  question  then,  is  how  did  the  language  change  to  give  a  more 
prominent  role  to  the  absolutive?  Tikkanen  argues  that  the  possible  contexts 
for  the  absolutive  increased  as  a  result  of  changes  in  the  language. 
Specifically,  its  functional  potential  increased  'due  to  the  relaxation  of  its 
syntactic  (constructional),  semantic  (temporal),  and  pragmatic  (coreferential  and 
operational)  constraints'  (1987:  abstract).  Though  it  is  clear  that  the  absolutive 
is  found  in  'new'  contexts,  Tikkanen's  explanation  for  this  expansion,  namely 
that  the  Sanskrit  construction  converged  with  the  Dravidian  past  verbal 
participle  construction  can  be  questioned;  cf.  Hock  1984  and  forthcoming. 
Jeffers/Kantor  (1984)  advance  a  different  argument  for  the  expanded  function  of 
the  absolutive,  but  Tikkanen  (1987:71-4)  considers  their  explanation  doubtful. 
Moreover,  discussions  like  these  leave  unanswered  the  question  of  why  the 
absolutive  construction,  and  not  some  other  construction  is  selected  for  the  new 
contexts  in  the  first  place. 

In  the  present  study,  I  will  look  at  a  particular  genre  of  texts  with  frequent 
use  of  the  absolutive  to  see  if  there  is  any  discourse-level  motivation  for  its 
heavy  use,  and  whether  that  motivation  is  different  from  the  one  advocated  by 
Gonda  (1971)  and  Renou  (1956a).   I  will  do  so  by  looking  at  the  distribution  of 
the  absolutive  in  selected  texts  from  the  Sanskrit  fable  literature. 

2.0  Accounts  of  the  absolutive 

2.1  Formal  accounts 

The  absolutive  is  a  non-finite  adverbial  form  of  the  verb  which  expresses  a 
subordinate  action  antecedent  to  (or  simultaneous  with)  the  action  of  the  main 
verb  as  in  the  following  examples: 

(1)  nagaram  gatva       devadattah    katam       karoti 
city-acc.    go-abs.     D.-nom.         mat-acc.  do-3  sg. 
'Having  gone  to  the  city,  Devadatta  makes  a  mat.' 

(2)  nagaram  upagamya     devadattah    hasati 
city-acc.    appr.-abs.      D.-nom.  laugh-3  sg. 
'Approaching  the  city,  Devadatta  laughs.' 

The  absolutive  has  been  discussed  in  much  detail  in  the  many  excellent 
Sanskrit  grammars,  beginning  with  Panini's  AstadhyayF  and  continuing  through 
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more  recent  works  such  as  Whitney  1879,  which  describes  absolutive 
constructions  mainly  from  a  morphological  point  of  view,  and  Speijer  1886  and 
Delbrijck  1888,  which  describe  them  from  a  syntactic  point  of  view.  Works 
which  specifically  treat  the  absolutive  in  some  detail  include  Hock  1987  and 
Tikkanen  1987.  Tikkanen  1987  is  a  monumental  work,  describing  in  detail,  with 
illustrative  examples,  every  aspect  of  the  absolutive  construction,  from  its 
historical  development  and  syntactic  function  to  its  use  and  interpretation  in 
Sanskrit  texts. 

While  formal  descriptions  investigate  the  contexts  in  which  a  particular 
construction  can  appear,  and  by  illustrative  excerpts  show  where  that 
construction  does  appear,  such  descriptions  cannot  account  for  why  one 
construction  rather  than  another  with  similar  grammatical  function  appears.  In 
this  respect,  it  Is  important  to  point  out  that  there  are  other  means  of 
subordination  available  in  Sanskrit  grammar.  For  example,  all  of  the  examples 
in  (3)  use  different  subordinating  constructions,  or  consecutive  sentences  to 
express  the  same  general  idea. 

(3)       a.  devattah    nagaram    gatva   kat.arn    karoti     (absolutive) 

b.  devattah    nagaram    gatah    katam    karoti     (participle) 

c.  yada  devattah  nagaram  gatah  tada  katam  karoti    (rel.  clause) 

d.  devattah    nagaram    gatah  |  sa    katam    karoti     (2  sentences) 
'When  Devadatta  has  gone  to  the  city,  he  makes  a  mat.' 

A  full-scale  investigation  of  the  use  of  the  absolutive  would  compare  the 
use  of  the  absolutive  to  all  other  means  of  subordination. 3  Such  a  study  would 
offer  insight  about  the  'preference'  of  the  absolutive  among  competing 
alternatives.  In  light  of  the  claim  that  the  absolutive  is  probably  the  primary 
means  of  subordination  in  later  Sanskrit  (cf.  Gonda  1971:136;  Jeffers/Kantor 
1984:96),  this  investigation  will  focus  only  on  the  frequency  of  the  absolutive. 
And,  in  order  to  keep  the  investigation  in  manageable  proportion,  I  will  limit 
myself  to  comparing  the  frequency  of  the  absolutive  in  two  types  of  discourse 
found  within  the  Sanskrit  fable  literature. 

2.2  Discourse-oriented   accounts 

In  addition  to  discussing  the  formal  aspects  of  the  absolutive,  Tikkanen 
and  other  scholars  also  consider  'context'.  But  they  do  so  only  in  a  very  narrow 
sense,  in  that  context  is  understood  as  the  sentence  or  sentences  relevant  to 
the  correct  interpretation  of  the  absolutive.  For  example,  Tikkanen  looks  at 
context  in  this  narrow  sense  in  order  to  determine  whether  the  action  of  the 
absolutive  is  completed  before  the  action  of  the  main  verb  begins.  This  idea  of 
clarifying  context  is  important,  since  in  the  absence  of  more  information,  a  single 
construction  may  have  many  possible  readings  as  illustrated  in  the  following 
example  from  Tikkanen  (1987:7): 


166 

(4)    indram     arabhya        cara 

I. -ace.       grasp-abs.     go-imper. 

a.  'Having  taken  hold  of  Indra,  move!"  / 
'Move  after  taking  hold  of  Indra!" 

b.  Take  hold  of  Indra  and  move!' 

c.  'Take  hold  of  Indra  before  moving!" 

d.  'Move  by  taking  hold  of  Indra!' 

e.  'Go  to  take  hold  of  Indra!' 

f.  'Keep  yourself  to  Indra!" 

But  in  spite  of  some  general  remarks  about  genre  differences,  Tikkanen  is 
basically  concerned  with  the  sentential  'meaning"  or  'function"  of  the  absolutive. 
He  does  not  examine  the  function  of  the  absolutive  in  the  discourse  structure  of 
entire  texts. 

Jacobi  (1903)  looks  at  examples  from  texts  that  are  considered  "scientific", 
and  claims  that  the  "nominal  style",  characterized  by  the  sparing  use  of  finite 
verbs,  is  appropriate  for  conveying  abstract  ideas.  Renou  (1956a),  elaborating 
on  Jacobi's  ideas,  argues  that  in  this  nominal  style,  the  absolutive  functions 
prominently.  Now,  it  is  true  that  absolutives  are  non-finite,  and  thus  more 
"nominal"  than  finite  verbs.  However,  as  this  paper  shows,  at  least  in  the  fable 
literature,  absolutives  are  eminently  'verbal"  or  action-oriented  rather  than 
nominal  or  abstract  state-oriented.  Moreover,  though  absolutives  do  not  take 
verbal  Inflection,  they  show  verbal  syntax  in  so  far  as  objects  take  the 
accusative  case,  and  not  nominal  syntax  (where  objects  could  take  the  genitive 
case).  The  difference  between  verbal  (5)  and  nominal  syntax  (6)  is  illustrated  in 
the  sentences  below: 

(5)  yatah  devadattah  katan        karoti    tatah  yajnadattah  modate 
since  D.-nom.        mats-acc.  do-3sg.  so      Y.-nom.  please-3sg. 
'Since  Devadatta  makes  mats,  Yajnadatta  is  happy.' 

(6)  a.     devadattasya  katakaratvat  yajiiadattah  modate 

D.-gen.  mat-makerhood-abl.     Y.-nom.         please-3sg. 

'Because  Devadatta  is  a  mat-maker,  Yajnadatta  is  happy.' 
b.      devadattasya  katanam     karanat        yajnadattah  modate 
D.-gen.  mat-gen.    making-abl.  Y.  please-3sg. 

"Because  of  Devadatta's  making  of  mats,  Yajnadatta  is  happy.' 

Finally,  as  Tikkanen  has  observed,  there  are  many  grammatical  treatises 
such  as  the  Astadhyayf  which  are  clearly  scientific  and  abstract,  yet  do  not 
contain  many  absolutive  constructions.  Renou's  nominal-style  argument 
therefore  does  not  contribute  much  to  a  better  understanding  of  the  absolutive. 

Gonda  (1971)  considers  the  differing  uses  of  a  variety  of  constructions  in 
different  Sanskrit  texts  and  types  of  Sanskrit  texts  in  his  discussion  of  "Syntax 
and  style"  (109-60).  He  discusses  the  avoidance  of  the  absolutive  in  the  longer 
sentences  of  the  Rig  Veda  and  ^atapathabrahmana,  and  its  absence  in  the 
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ancient  narrative  style,  but  notes  that  'the  authors  of  sutras  and  ritual  chapters  of 
the  brahmanas  readily  resort  to  it  exactly  to  indicate  the  chronological 
succession  of  the  ritual  acts  etc'  (136). 

The  use  of  absolutives  in  ritual  texts  is  more  systematically  discussed  in 
Renou  1956(a).  Illustrations  which  show  the  use  of  the  absolutive  to  briefly 
describe  and  focus  on  the  sequencing  of  events  in  a  ritual  may  be  found  in  the 
following  examples: 

(7)  tad    dhutvl      atha  anyam       dugdhva  punah        hotavyam 
so     offer-abs.  another     milk-abs.  again        offer-gdv. 
'So,  having  offered,  and  then  having  milked  another  (cow),  one 
should  offer  again.'   (KS  35.19) 

(8)  sa  vai        sammfjya  sammfjya      pratapya    pratapya  prayacchati 
he  brush-abs.  heat-abs.  hand-3sg. 
yatha        avamarsam       nirnijya             anavamarsam  uttamam 
as             touch                 rinse-abs.          not  touch  last 
pariksalayed     evam  tat  tasmat       pratapya  pratapya  prayacchati 
rinse                                 so             heat-abs.  hand-3sg. 
'Each  time  he  has  brushed  and  heated  (a  spoon),  he  hands  it  over. 
This  is  as  if  after  having  rinsed  (the  eating  vessels)  while  touching 
them,  one  would  rinse  the  last  one  without  touching  it.  For  this 
reason  he  hands  over  each  (spoon)  after  heating  it.'  (SB  1.3.1.8) 

A  more  general  usage  of  the  absolutive,  noted  by  Renou  and  Gonda,  is  its 
use  as  a  sentence  connector.  Absolutives  are  an  excellent  means  for  this 
function  because  they  can  link  sentences  by  summarizing  the  action  described 
in  the  preceding  sentence  in  a  short  phrase  introducing  the  next  sentence,  see 
for  instance  example  (15a)  below.  However,  this  usage  is  not  particularly 
dependent  on  discourse  type  and  is  found  throughout  the  classical  literature. 

3.0  Texts  to  be  examined 

The  Pancatantra  is  a  fable  collection  organized  as  a  series  of  stories  told 
by  the  sage  Visnusarman  to  the  ignorant  three  sons  of  King  Amarasakti  in  order 
to  instruct  them  in  nftisastra  (proper  conduct).  The  stories  in  each  of  the  five 
books  which  make  up  the  Pancatantra  are  linked  together  by  a  framestory, 
which  introduces  the  sage  and  the  King  and  sets  up  the  context  for  story-telling. 
Visnusarman's  stories  are  organized  as  five  books,  each  with  a  different  theme 
and  framestory.  The  end  of  each  story  either  introduces  a  new  story,  or  returns 
to  the  framestory,  with  the  result  that  many  of  the  stories  are  nested  within  other 
stories,  or  within  one  of  the  levels  of  framestory.  The  fables  are  written  in  prose 
with  verses  intercalated  throughout. 

I  have  also  examined  some  of  the  fables  in  their  HitopadeSa  versions.  The 
HitopadeSa  is  a  later  collection  based  on  the  Pancatantra,  also  written  mainly  in 
prose,  which  contains  versions  of  some  of  the  Pancatantra  fables  as  well  as 
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new  ones.   The  Hitopadesa  has  a  similar  framelike  organization,  with  a  parallel 
situation  of  a  sage  telling  practical  stories  to  young  princes. 

4.0  A  methodology  for  looking  at  single  texts 

A  priori  it  may  appear  difficult  to  say  anything  meaningful  about  the  usage 
of  a  particular  construction  by  looking  at  one  text  only.  However,  there  is  a 
methodology  which  has  been  used  in  other  discourse-oriented  studies  and 
which  makes  possible  an  intratextual  comparison  by  contrasting  stylistically 
different  sections  of  a  single  text,  specifically,  speech  passages  vs.  narrative 
passages.  The  idea  behind  this  method  is  that  if  there  is  a  difference  in  the  use 
of  a  particular  construction  which  corresponds  to  a  difference  in  discourse  type, 
then  there  must  be  a  discourse-level  explanation  for  its  appearance  or  absence. 
That  is,  it  must  have  a  discourse  function  of  some  kind. 

Application  of  the  method  will  consist  of  three  steps.  First,  I  will  establish 
by  independent  evidence  that  there  are  textual  differences  in  the  text.  Next,  I 
will  look  at  the  distribution  of  the  absolutive.  Finally,  I  will  look  for  a  correlation 
between  this  distribution  and  the  textual  differences. 

This  method  of  dividing  a  text  into  subparts  has  been  used  in  several 
studies  of  Sanskrit  texts.  Studies  which  divide  the  text  or  texts  under  consid- 
eration into  their  narrative  and  direct  speech  passages  include  Sohnen  1979 
for  the  Ramayana,  Hock  1982(a)  and  Jamison  (To  appear)  for  Vedic  prose,  and 
Wallace  1984  for  the  Vetalapancavimsati.  While  Sohnen  was  not  primarily 
concerned  with  grammatical  constructions.  Hock,  Wallace,  and  Jamison  all 
present  discourse-level  explanations  for  grammatical  facts.  Their  findings  are 
summarized  below. 

Hock  1982(a)  is  a  study  of  Vedic-Prose  formulae.  In  this  study.  Hock 
argues  that  the  verb  appears  in  second  position  in  phrases  like  sa  hovSca 
gargyah  'then  G.  said  ..."  because  of  discourse-based  fronting,  not  because  the 
verb,  as  a  kind  of  clitic,  occupies  second  position  according  to  clitic  placement 
rules. 

Jamison  (To  appear)  finds  that  the  referential  sa/tad  pronouns  are 
preferred  in  narrative  passages,  while  the  locational  demonstratives  ayam/asau 
are  preferred  in  speech  passages.  To  explain  this  distribution,  she  argues  that 
while  the  deictic  value  of  specifiying  'this  one'  or  'that  one'  can  be  fully  utilized 
in  a  speech  situation,  sa/tad  forms  have  a  discourse  function  of  relating 
sentences  and  referents  to  each  other  and  introducing  or  emphasizing  key 
points.  Her  detailed  discussion  of  the  distribution  of  the  particles  vai  and  eva 
also  offers  a  discourse-level  explanation,  namely  that  through  time,  these 
particles  were  reanalyzed  as  discourse  markers. 

Wallace  (1984)  links  the  choice  of  word  order  in  P(atient)-oriented 
constructions^  to  the  conversational  tone  of  the  expressions  used  in  a  particular 
text,  the  Vetalapancavimsati.  The  unmarked  order  of  such  constructions  is 
Agent-Object-Verb  (AOV);  cf.  Hock  1982(b).    Wallace  finds  that  the  marked 
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Object-Agent-Verb  (OAV)  word  order  occurs  significantly  more  frequently  in  P- 
oriented  constructions  occurring  in  direct  speech  passages  than  in  P-oriented 
constructions  occurring  in  narrative  passages.  He  attributes  this  to  the  use  of 
agent-backing  as  a  politeness  strategy  in  direct  speech,  as  illustrated  in  one  of 
his  examples  of  OAV  order: 

(9)  tvam         maya       adya  khaditavyah 
you-acc.   me-inst.    today  eat-gerundive 
'I  must  eat  you  today.'  (V) 

The  significance  of  Wallace's  investigation  lies  in  his  presentation  of  a 
plausible,  discourse-level  explanation  for  the  occurrence  of  marked  OAV  order. 

5.0  Differences  between  narrative  and  direct  speech  passages 
in  fable  literature 

Narrative  and  direct  speech  passages  differ  significantly  with  respect  to 
sentence  variety,  sentence  combination,  word  order,  pronominal  usage,  and  the 
distribution  of  verbal  forms.  These  differences  are  briefly  summarized  in  the 
following  sections. 

5.1  Sentence   variety 

Both  types  of  passages  contain  short,  simple  sentences,  though  this  type 
of  sentence  is  much  more  common  in  direct  speech  passages.  Direct  speech 
sentences  contain  an  average  of  slightly  more  than  one  verbal  form  per  clause, 
while  narrative  sentences  contain  an  average  of  nearly  two  verbal  forms  per 
clause.  (A  complete  listing  of  the  verbal  forms  is  given  in  section  5.5). 

Simple  sentences  with  an  unspecified  copula,  illustrated  in  (10)  occur 
almost  exclusively  in  direct  speech  passages: 

(10)  na  ayam  sva  |  yajnacchagah     ayam 
not           this      dog     sac-goat  this 

This  (is)  not  a  dog.  This  (is)  a  sacrificial  goat.'    (H  4.10) 

Questions  hardly  ever  occur  in  the  narrative,  but  are  common  in  direct  speech 
where  they  generally  have  a  transitional  function  as  illustrated  in  the  following 
examples: 

(11)  a.   katham     etad 

how         that 
•How's  that? 
b.   adhuna    kim      maya        kartavyam 
now         what    by  me       do-gdv. 
'What  should  I  do  now?'  (H  4.2) 
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5.2     Sentence   combination 

Almost  every  sentence  in  the  fables  begins  with  some  sort  of  introduction. 
One  type  of  sentence  which  is  not  generally  introduced  is  the  imperative,  found, 
of  course,  only  in  direct  speech  passages.  In  the  direct  speech  passages,  the 
introductory  phrase  tends  to  get  the  attention  of  the  listener,  as  illustrated  in  the 
following  sentences: 

(12)  a.   Bho     mitra         evam  karisyavah 

O        friend       so       do 

'O  friend,  so  we  two  will  do  it."  (Pill  1.16) 

b.  tat  sarnpratam    ksamasva  iti 

therefore  now  forgive 

Therefore  now  forgive  ..."  (PI  3.6) 

Occasionally,  reference  is  made  to  the  situation  described  in  a  preceding 
sentence,  using  a  pronoun  as  in  (13): 

(13)  ...    maya  etavantam     kalam       na       jnatam      yad     tvam 

me      this  much      time  not      know       that     you 

atra     vasasi  |    tena         puja         na        krta  | 
here    live  because  honor       no        do 

'...all  this  time  I  did  not  know  that  you  live  here.  Because  of  this  I 
gave  you  no  honor.'  (PI  3.6) 

On  the  other  hand,  most  of  the  introductory  phrases  in  the  narrative 
portions  either  move  the  discourse  along  or  summarize  the  previous  sentence. 
Those  phrases  which  move  the  discourse  along  included  atha  'so'  or  tatah 
'then';  or  an  NP  followed  by  ca  or  api  with  the  sense  of  'and  so';  illustrated  in 

(14). 

(14)  a.  tatah   bakah       cintayam  asa... 

then    heron       reflect       be 
'Then  the  heron  reflected  ...'  (PI  1.6) 
b.  sa       ca       sukhena  sasyam    caritah 
he       and     comfort     grain        go 
•And  so  he  went  (and  fed)  on  the  grain  in  comfort."    (H  3.3) 

Summarizing  expressions  often  use  an  absolutive  construction  (typically  tad 
srutva  'having  heard  this'  or  iti  uktva  'having  said  this*)  or  an  absolute  participle 
construction  (such  as  tatha  anusthite  'with  this  being  done'  or  evam  sati  'it 
being  thus').  These  beginnings  are  illustrated  in  (15). 

(15)  a.   ...   vyakulatvarn       nah     hrdi     vartate  | 

perplexity  our      heart  be 

tad  snjtva     kambugrfvah       aha  ... 
this  hear        K.  say 

'...perplexity  is  in  our  heart.  Having  heard  this,  Kambugrfva 
said  ...'(Pill  1.13) 


171 

b.   maunavratena    sthatavyam ...      tatha  anusthite        gacchata 
silence-vow        keep  this      do  go 

kambugnvena ... 
K. 

'...a  silence-vow  is  to  be  kept...  This  being  done,  by  the  going 
Kambugnva  ...'  (PI  1.16) 
Referential  pronouns  are  sometimes  the  first  element  in  the  sentence,  as  in  (16). 
(16)       tau      ucatuh 
two      say 
The  two  said  ...'       (Pill  1.13) 

5.3  Word  order 

Following  Wallace,  I  compared  the  word  order  for  P-oriented  con- 
structions in  the  direct  speech  portions  and  the  narrative  passages.  I  too  found 
evidence  of  agent-backing  in  the  direct  speech  portions.  For  example: 

(17)     atra     na       asukham  tvaya        karyam 

here    not      unfortunate         you  do 

'Now  you  should  not  do  something  unfortunate.'  (PI  1 .5) 

My  results  for  the  14  constructions  present  in  the  corpus  are  given  in  Table  I.'* 

Direct  Speech         Narrative 
OAV  5  2 

AOV  2  5 

Table  I:    Word  order  in  P-oriented  constructions 

5.4  Pronominals 

Naturally,  1st  and  2nd  person  pronouns  (also  polite  bhavant)  occur  only  in 
the  direct  speech  passages.  Not  including  the  defective  pronominal  type  ena, 
two  types  of  third  person  pronouns  occur,  relative  pronouns  and  deictic 
pronouns.  Among  the  latter,  the  sa/tad  pronouns  refer  to  characters  and  events 
in  the  discourse.  The  esa/etad  pronouns  also  refer  to  characters  or  events  in 
the  discourse,  but  tend  to  have  locational  connotations  of  the  type  'this  one 
here'  or  'that  one  there'.  The  ayam/asau  pronouns  are  locational  only:  'this 
here'  vs.  'that  yonder'.  In  the  direct  speech  portions,  the  referent  or  antecedent 
of  the  pronoun  tends  to  be  in  the  same  clause,  cf.  example  (10).  In  narrative 
portions,  on  the  other  hand,  it  tends  to  be  in  a  preceding  clause;  cf.  (15a).  The 
distribution  of  each  type  of  pronominal  is  given  in  Table  II. 

Rel.  pronouns^  Deictic  pronouns 

discourse  discourse/locational    locational 
(yah)                       (sa/tad)  (esa/etad)        (ayam/asau) 

%  in  Dir.  Sp.     62  20  80  82 

%inNarr.         38  80  20  18 

Table  II:    Distribution  of  pronominals 
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The  higher  incidence  of  esa/etad  and  ayam/asau  fornns  in  direct  speech 
and  their  lower  rate  of  occurrence  in  narrative  passages  is  consistent  with 
Jamison's  findings  for  Vedic  prose.  The  extremely  frequent  occurrence  of  sa 
/tad  pronouns  in  the  narrative  portions  (where  they  account  for  85%  of  all 
pronouns  used  in  narrative)  likewise  is  consistent  with  Jamison's  findings. 

Note  that  relative  clause/main  clause  (RCf^C)  constructions  occur  more 
often  in  the  direct  speech  portions.  Examples  of  this  type  of  construction  are 
given  below: 

(18)     a.   ...    yadi     tvam    uttararin     dadasi      tada    tava     maranam 
if         you     answer    give         then    your    death 
bhavisyati 
will  be 
'...  if  you  answer,  then  you  will  die.'  (H  4.2) 

b.  adya  punah      nagarasamFpe    yah  hradah     tasmin 

today  again       near  the  city       which       lake         to  it 
eva     gacchamah 
so       we  go 
'But  today  let  us  go  to  a  lake  which  is  near  the  city.'  (PI  1 .6) 

5.5     Verbal  forms 

The  complete  distribution  of  verbal  forms  is  given  in  Table  III.  In  the  table, 
'Past'  includes  the  perfect,  imperfect,  and  past  active  and  fa-participle  (when 
used  as  the  main  verb  of  the  sentence);  'Present'  includes  the  present 
indicative  active  and  passive  constructions;  'Absolute  participle'  includes  the 
genitive  and  the  locative  absolute  participle  construction  (cf.  note  3e  and 
example  (15b)  above  for  this  type).  The  figures  listed  show  the  perecentage  of 
occurrence  of  each  verbal  form  out  of  all  verbal  forms  occurring  in  direct  speech 
passages,  and  out  of  all  verbal  forms  occurring  in  narrative  passages. 


%  In  direct  speech 

%  in  narrative 

NON-SUBORDINATING 

past  (finite) 

10 

47 

present  (finite) 

38 

12 

future  (finite) 

14 

- 

imperative  (finite) 

14 

- 

gerundive 

6 

- 

SUBORDINATING 

absolutive 

14 

35 

present  participle 

- 

3 

infinitive 

2 

2 

absolute  participle 

2 

1 

Table  III:  Distribution  of  verbal  forms 
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Not  surprisingly,  the  past  tenso  dominates  in  narrative,  the  present  in 
direct  speech.  Moreover,  inherently  non-narrative  formations  such  as  the  future 
and  the  modal  forms  of  the  imperative  and  (non-finite)  gerundive  are  exclusively 
found  in  direct  speech.  Comparing  the  subordinating  verbal  forms  with  the 
distribution  of  RCMC  constructions  (see  Table  II),  we  can  again  observe  a  clear 
difference  between  narrative  and  direct  speech.  There  is  a  dominance  of 
relative  clauses  in  direct  speech,  while  absolutives  are  heavily  attested  in 
narrative. 

5.6  Summary  of  results 

These  results  are  generally  consistent  with  the  findings  of  Wallace  and 
Jamison,  and  provide  clear  evidence  that  the  narrative  and  direct  speech 
portions  represent  two  distinctly  different  types  of  discourse.  There  is  also  clear 
evidence  that  the  number  of  occurrences  of  the  absolutive  differs  significantly 
in  the  two  types  of  discourse.  As  can  be  seen  from  Table  III,  the  absolutive 
accounts  for  a  much  greater  percentage  of  all  verbs  in  the  narrative  portions 
(35%),  than  in  the  direct  speech  portions  (14%). 

If  only  this  evidence  were  taken  into  consideration  it  might  be  possible  to 
propose  the  following  hypothesis.  Absolutives  occur  more  frequently  in 
narratives  because  narratives  are  more  complex  and  therefore  have  more 
subordination  than  direct  speech.  However,  as  noted  earlier.  Table  II  shows  a 
much  greater  percentage  of  subordination  by  means  of  RCMC  structures  in 
direct  speech.  Moreover,  Table  III  indicates  that  subordination  by  means  of 
absolute  participle  structures  is  about  equally  frequent  in  narrative  and  direct 
speech  (though  both  percentages  are  low).  This  evidence  suggests  that  the 
notion  'subordination'  is  not  sufficient  to  account  for  the  greater  frequency  of 
absolutives  in  direct  speech. 

6.0  The  discourse  function  of  the  absolutive 

6.1  Rhetorical  structure 

A  more  promising  avenue  for  explaining  the  dominance  of  absolutives  in 
narrative  consists  in  considering  rhetorical  structure.  Previous  studies,  such  as 
Kaplan  1966,  Clyne  1982,  as  well  as  Pandharipande  1982  (on  Indian  langu- 
ages), have  shown  that  many  texts  show  a  clearly  discernable  rhetorical 
organization.  TsiangA/Vatanabe  1987  characterizes  the  rhetorical  structure  of 
the  Pancatantra  as  ascending,  in  the  sense  that  important  information  is 
gradually  led  up  to,  as  compared  to  the  rhetorical  structure  of  Aesop's  fables 
which  is  descending,  since  important  events  and  characters  are  first  introduced, 
and  then  elaborated  on  later.6 

The  Pancatantra  is  organized  as  a  series  of  embeddings.  The  outermost 
level  is  the  context  of  Visnusarman  telling  the  fables  to  the  young  princes. 
These  fables  are  organized  into  five  books,  each  having  its  own  framestory. 
Within  this  framestory,  the  individual  fables  are  told  as  illustrations  of  various 
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points.   The  'point'  is  expressed  as  a  verse,  and  this  verse  is  the   moral  of  the 
illustrative  fable,  and  therefore  repeated  at  its  conclusion. 

For  example,  Hitopadesa  4:5  is  embedded  in  Hitopadesa  4:2  in  the 
following  way.  In  4:2,  we  have  the  beginning  of  a  story  about  two  geese 
and  a  tortoise  who  learn  that  fishermen  are  planning  to  kill  them.  As  they 
worry  about  what  to  do,  the  tortoise  quotes  the  following  verse: 
'Forethought  and  Readywit,  these  two  thrive  well.  Whatever  Will  Be, 
perishes'.  Asked  to  explain  the  verse,  the  tortoise  tells  story  4:3  which  is 
about  three  fish  named  Forethought  (anagatavidhata),  Readywit 
(pratyutpannamati),  and  Whatever  Will  Be  (yadbhavisya).  Naturally, 
Forethought  and  Readywit  are  saved,  while  Whatever  Will  Be  perishes. 
(Story  4.4  is  told  by  Readywit  about  a  merchant  and  his  clever  wife,  but  this 
further  complication  is  not  discussed  here.)  The  tortoise  concludes  his 
story  by  repeating  the  first  part  of  the  verse:  'And  so  I  say:  Forethought, 
and  so  forth.'  The  tortoise  and  geese  then  continue  to  deliberate  about 
what  they  should  do,  and  decide  on  a  plan.  The  geese  then  warn  of 
possible  drawbacks  to  the  plan,  and  quote  the  verse:  'A  Wiseman  should 
think  of  a  strategy  and  he  should  think  of  the  danger.  While  the  foolish 
heron  watched,  the  herons  were  eaten  by  the  mongooses.'  Next  4:5  tells 
the  story  of  how  the  herons  were  eaten  by  the  mongooses  (given  below  in 
example  20).  And  so  forth. 

In  addition  to  the  nesting  of  stories  within  stories,  direct  speech  portions 
are  nested  within  the  narrative,  each  speech  passage  being  introduced  by 
phrases  like  'He  said  ...'  and  followed  by  phrases  like  'Having  said/heard  ...' 
This  kind  of  nesting  is  illustrated  in  (19). 

(19)  tena    uktam       aham  tavat         jalasayantaram  gacchami  | 
he       say  I  at  once     another  lake       go 

it!         uktva        sa        hradantaram       gatah 

this      say  he       another  lake       go 

'He  said:  I  will  go  at  once  to  another  lake.  Having  said  this,  he 

went  to  another  lake.'  (H  4.3) 

The  organization  of  the  fables  is  cumulative  and  ascending.  At  the 
sentence  level,  the  introduction  of  significant  characters  and  actions  generally 
comes  toward  the  end  of  the  sentence,  and  at  the  discourse  level,  the  stories 
unfold  gradually  to  their  climax,  the  moral.  The  ascending  features  can  be  seen 
in  Hitopadesa  4:5,  whose  introduction  has  been  described  above. 

(20)  (1 )  asti  uttarapathe  grdhrakutah  nama  parvatah  (2)  tatra  eva  revatTre 
nyagrodhapadape  bakah  hivasanti  (3)  tasya  vatasya  adhastat 
vivare  sarpah  tisthati  (4)  sa  ca  bakanam  balapatyani  khadati 
(5)  tatah  sokartlnam  bakanam  pralapam  _srutva  kena  cid 
vrddhabakena  uktam  (6)  bhoh  (7)  evam  kuruta  yuyam  (8)  matsyan 
anfya  nakulavivarat  arabhya  sarpavivaram  yavad  patiktikramena 
ekaikasah  matsyan  dhatta  (9)  tatah  tad  aharavartmana  nakulaih 
agatya  sarpah  drastavyah  svabhavadvesat  vyapadayitavyah  ca 
(10)  tatha  anusthite'sati  tad  vrttam  (11)  atha  nakulaih  vrksopari 
paksisavakanim  ravah  srutah  (12)  pascat  taih  vrksam  aruhya 
savakah  sarve  eva  khaditah     (H  4.5) 
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(1)  There  is  in  the  north  country  a  mountain  called  Grdhrakuta. 

(2)  There,  in  a  banyan  tree  on  the  bank  of  the  Reva  some  herons 
dwell.  (3),  (4)  In  a  hole  under  the  banyan  tree  is  a  snake  who  eats 
the  young  offspring  of  the  herons.  (5)  Having  heard  the  chatter 
among  the  sorrow-stricken  herons,  an  old  heron  said:  (6),  (7)  Hallo! 
So  ye  should  act!  (8)  Take  some  fish  and  lay  the  fish  out  one  by  one 
in  a  series  of  steps  from  the  mongoose  hole  to  the  serpent  hole. 
(9)  The  snake  will  be  seen  by  the  mongooses  in  search  of  food,  and 
will  be  destroyed  because  of  the  natural  hatred  (between  snakes  and 
mongooses).  (10)  This  having  been  accomplished,  that  came  about. 
(11),  (12)  Then  the  mongooses  heard  the  cries  from  the  young  birds. 
Thereafter,  they  climbed  the  tree  and  ate  all  the  young  birds. 

Notice  how  important  subjects  are  introduced  at  or  near  the  end  of  the 
sentences  as  in  {^)  parvatah  'mountain'  (the  site  of  the  story),  (2)  bakah 
'herons'  (the  first  protagonists),  and  (3)  sarpah  'snake'  (a  second,  but  as  it  turns 
out,  minor  character).  The  major  adversaries  of  the  herons,  the  murderous 
mongooses  are  not  introduced  until  (9),  where  they  appear  in  an  agentive 
phrase.  But,  we  are  given  a  hint  in  (8)  that  there  are  mongooses  by  the  fact  that 
there  is  a  mongoose-hole. 

The  ascending  and  framelike  organization  of  the  fables  can  be 
characterized  as  a  series  of  terraces.  At  the  end  of  each  terrace  is  a  turning- 
point  which  moves  the  discourse  up  to  the  next  level;  from  one  frame  to  another. 
At  the  story-level,  the  turning  point  is  the  verse  which  introduces  the  next  story. 
Within  a  story,  the  turning-point  is  the  main  verb  of  the  narrative  sentences.  A 
survey  of  the  fables  indicates  that  this  verb  is  commonly  a  verb  of  speaking,  or  a 
verb  of  going,  which  moves  the  discourse  to  a  new  type  of  discourse,  or  to  a 
new  setting.  This  structure  can  be  illustrated  with  the  text  of  Hitopadesa  4.10 
given  in  example  (21b).  The  translation  (21a)  is  given  first,  with  the  turning- 
points  marked  in  capitals. 
(21)  a. 

(1)  Once  upon  a  time  in  the  Gautama  wilderness  there  was  a  certain 
brahmin  famous  for  his  sacrifices.  (2)  Having  bought  a  goat  from 
another  village  for  sacrifice,  having  put  it  on  his  shoulder,  going,  he 
WAS  SEEN  BY  A  TRIO  OF  ROGUES.  (3)  Then  these  rogues  thought: 
If  by  some  means  this  goat  is  obtained  and  eaten,  this  will  be  a  good 
plan.  And,  on  the  road  under  three  trees,  in  the  brahmin's  path,  THEY 
SAT  DOWN  AND  STAYED.  (4)  Then  by  one  rogue,  the  going 
brahmin  WAS  ADDRESSED:  (5)  O  brahmin!  (6)  Why  are  you 
carrying  a  dog  on  your  shoulder?  (7)  The  brahmin  SAID:  (8)  This  is 
not  a  dog.  (9)  This  is  a  sacrifice  goat.  (10)  Immediately  afterwards,  by 
the  second  rogue  stationed  at  the  distance  of  a  kos.  The  same 
question  WAS  ASKED.  (11)  Having  heard  this,  laying  the  goat  on 
the  ground,  reflecting  repeatedly,  putting  the  goat  on  his  shoulder 
again,  having  a  wavering  mind,  the  brahmin  STARTED  OFF.  (12) 
Then  continuing  on  his  way,  by  the  third  rogue,  the  brahmin  WAS 
ADDRESSED.  (13)  O  brahmin!  (14)  Why  are  you  carrying  a  dog  on 
your  shoulder?  (15)  Hearing  this,  and  thinking, 'Surely  this  is  a  dog,' 
having  abandoned  the  goat,  having  bathed,  he  WENT  TO  HIS 
HOUSE.  (16)  Having  carried  that  goat  away,  by  the  three  rogues,  it 
WAS  EATEN. 
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(21)  b. 


(1)  asti  gaulamaranye  prastutayajnah         kascid  brahmanah 
be-3  Sg.         in  the  forest                 sacrifice-famous          a  certain         brahmin 

(2)  sa   ca   yajftartham  gramantarat       chagam       upakn'ya      skandhe      krtva 

&  he        for  sacrifice  another  village      goat  buy-abs.        shoulder        do-abs. 

gacchan      DHURTATRAYENA        AVALOKITAH     (3)        tatah     le 
go-pr.  ppl.      by  3  rogues  see-ta-ppl.  then        those 

dhurtah       yadi       esa       chagah        kena  api      upayena      prapya 
rogues  if  this         goat  some  means  obtain-abs. 

khadyate         tada  matiprakarsah    bhavati        iti  alocya         prantare 

eat-3  Sg.-pass.  then  wit-superiority      be-3  Sg.        "quote"  think-abs.      road 

vrksatrayatale     brahmanasya  vartamani  UPAVI6YA  STHITAH   (4)    talra      ekena 
under  3  trees        brahmin"s  path  sit-abs.  remain-ta-ppl.       there      one 

dhurtena     sa         brahmanah        gacchan      ABHIHITAH    (5)  bhoh  brahmana 
by  a  rogue     that        brahmin  go-pr.  ppl.      address-ta-ppl.      O  brahmin 

(6)  kirn  iti     Ivaya     kukkurah     skandhena        uhyate    (7)  brahmanah    BRUTE 
why         by  you    dog  on  shoulder  carry-3  Sg.     brahmin  say-3  Sg. 

(8)  na         ayam     sva        (9)  yajftacchagah    ayam      (10)      ananlaram   punah 
not  this         dog  sacrifice-goat       this  after  again 

dvitTyena     krosamatravasthitena     tad        eva       UKTAM       (11)   lad    akaniya 
by  the  2nd     positioned  1  kos  farther     the  same  thing      say-ta-ppl.  this     listen-abs. 

brahmanah        tarn       chagam       bhumau      nidhaya       muhurmuhuh 
brahmin  that        goat  on  ground      lay-abs.         repeatedly 

nirlksya       punah  skandhe      krtva     dolayamanamatih     CALITAH 
reflect-abs.    again      on  shoulder   do-abs.  w/ wavering  mind  start-ta-ppl. 

(12)  tadanantaram   punah  gacchan      sa         brahmanah  trtTyena        dhurtena 
thereupon  again      go-pr.  ppl.      that         brahmin  by  the  third    rogue 

UKTAH    (13)     bhoh     brahmana     (14)      kim  iti    kukkuram    skandhena 
say-ta-ppl.  O  brahmin  why        dog  on  shoulder 

bhavan        vahati     (15)      tad        akarnya       niscitameva       ayam     kukkurah 
thou  carry-3  Sg.  this         hear-abs.       surely  this         dog 

iti  matva  chagam       tyaktvS  snitva       SVAGRHAM      YAYAU 

"quote"    think-abs.      goat  abandon-abs.  bathe-abs.    own  home  go-3  Sg. 

(16)       sa         chagah        tair  dhurtaih       nUva  BHAKSITAH     (H4.10) 

that        goat  by  the  rogues       carry-abs.     eat-ta-ppl. 


6.2  The  function  of  the  absolutive  within  the  rhetorical  structure  of 
the  fables 

In  this  sample  story,  as  in  the  other  fables,  the  turning  points  from  narrative 
to  speech,  or  from  location  to  location  are  marked  by  finite  verbs  (or  the  ta  - 
participle  used  as  a  finite  verb).  In  the  narrative,  besides  the  initial  asti  which 
functions  as  the  equivalent  of  'Once  upon  a  time'  and  begins  most  of  the  fables, 
the  finite  verbs  mainly  express  the  activities  of  'speaking',  and  'going'  (or 
'remaining').  The  only  exceptions  seem  to  be  avalokitah  '(was)  seen'  and 
bhaksitah  '(was)  eaten';  which  focus  on  more  specific  activities  or  events. 

But  note  that  the  rogues'  seeing  the  brahmin  in  effect  constitutes  the 
starting  point  of  the  story,  since  it  gives  rise  to  all  the  later  events.  And  the 
eating  of  the  goat  by  the  rogues  represents  the  story's  successful  conclusion  or 
endpoint.  The  two  verbs  thus  mark  the  two  most  important  turning  points  and  in 
that  sense  are  not  true  exceptions. 

On  the  other  hand,  absolutives  are  used  to  express  the  different  events  of 
the  story  that  lead  to  the  turning  points:  'buying',  'putting',  'laying',  'abandoning', 
'bathing',  'carrying';  as  well  as  the  less  'active'  activities  of  'thinking',  'hearing', 
and  'sitting'.   (The  present  participle  is  used  to  express  'going'.) 

Thus,  with  the  exception  of  the  starting  and  concluding  actions,  main  verbs 
and  absolutives  are  in  complementary  distribution:  The  major  activities 
described  in  the  narrative  are  expressed  as  absolutives,  while  finite  verbs  are 
used  to  signal  turning  points.  Absolutives  can  be  seen  as  a  means  to  pace  the 
story  in  its  cumulative  ascent  by  grouping  together  actions  on  a  specific  level  via 
their  common  subordination  to  the  finite  verb,  so  that  the  story  progresses  level 
by  level. 

It  is  this  key  role  of  the  absolutive  in  the  rhetorical  structure  of  the  fables 
that  accounts  for  their  great  frequency,  especially  in  sentences  like  (11)  and 
(15)  in  (21b)  where  as  many  as  four  actions  are  'subordinate'  or  lead  up  to  the 
relevant  turning  points.  Now,  the  relation  between  absolutives  and  finite  verbs 
agrees  very  well  with  the  often  observed  tendency  of  Classical  Sanskrit  to 
permit  only  one  finite  verb  at  the  end  of  a  sentence  and  to  require  the  other 
actions  to  be  expressed  by  non-finite  verbs.  But  first,  note  that  this  is  a  tendency 
only.  We  do  find  relative-correlative  structures  with  finite  verbs  in  both  clauses; 
of.  example  (18a)  above.  Secondly,  coordination  with  two  finite  (or  main)  verbs 
is  a  viable  alternative  to  the  absolutive  construction.  Thus  the  above  story  H 
4.10  ends  with  the  absolutive  nftva  and  the  main  verb  bhaksitah  'eaV. 
Hitopadesa  1.2  instead  ends  with  two  coordinated  main  verbs:  vyapaditah 
khaditah  ca  'was  killed  and  eaten'.  And,  in  fact,  sentences  like  11  of  example 
(21b)  could  be  alternatively  rendered  by  coordination  as  in  (22): 
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(22)  brahmanena     tad  akamitam  |        tena  ca     chagah     bhumau 

brahmin  this  heard  and  he      goat  ground 

nihitah      tad    ca  sarvam  niriksitam  |        tatah         chagah 

lay  and  this  all  reflect  then  goat 

punah       skandhe  krtah  |  atah  brahnnanah 

again        shoulder  put  then  brahnnin 

dolayamanamatih  calitah  | 

w/  wavering  mind  start 

The  brahmin  heard  this.  And  he  laid  the  goat  on  the  ground  and  he 
reflected  about  all  this.  Then  he  put  the  goat  on  his  shoulder  again. 
Then,  the  brahmin,  with  a  wavering  mind,  started  off.' 

It  is  thus  not  grammar,  but  considerations  of  discourse  and  rhetorical  structure 
which  crucially  determine  the  use  to  which  the  absolutive  is  put  in  fable 
narrative. 

7.0  Conclusion  and  Implications 

This  study  has  shown  that  the  predominance  of  absolutives  in  one  type  of 
classical  Sanskrit  text,  namely  the  narrative  passages  of  fable  literature,  has 
very  specific  discourse  and  rhetorical  foundations:  The  (non-finite)  absolutive 
serves  to  characterize  actions  that  lead  up  to  (and  in  that  sense  are  subordinate 
to)  the  major  turning  points  in  the  narrative,  turning  points  which  are 
characterized  by  finite  verbs  (including  the  ta-participle  used  as  a  main  verb). 

This  finding  is  convergent  with  the  earlier  observations  of  Renou  and 
Gonda  on  the  use  of  absolutives  in  Vedic  ritual  texts.  However,  it  shows  that  the 
specific  reasons  for  the  use  of  absolutives  may  differ  consistently  between 
different  texts. 

Moreover,  spot  checks  of  other  classical  texts  that  can  be  broadly 
described  as  narrative  (such  as  the  story  of  Nala  in  the  Mahabharata)  reveal 
that  in  those  texts  the  absolutive  is  used  more  sparingly.  Notice  also  the  relative 
rarity  of  absolutives  in  the  direct  speech  passages  of  the  fables.  This  suggests 
that  far  from  there  being  an  across  the  board  increase  of  major  proportions  in 
the  use  of  the  absolutive,  the  frequency  of  absolutives  depends  to  a  significant 
degree  on  the  nature  of  discourse  and  literary  genre  and  is  appropriate  for 
certain  types  of  text.  These  findings  suggest  caution  vis-a-vis  claims  like 
Whitney's  that  the  absolutive  is  used  to  excess  in  later  literature  or  Tikkanen's 
hypothesis  for  a  general  increase  in  use  brought  about  by  Dravidian  influence. 


Notes 

*l  would  like  to  thank  Hans  Henrich  Hock  for  his  extremely  useful  com- 
ments and  discussion  of  various  stages  of  this  paper.  This  paper  also  benefited 
from  helpful  comments  received  when  an  earlier  version  was  read  at  the  South 
Asian  Languages  Analysis  Roundtable  X,  1988,  at  the  University  of  Washing- 
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ton.     Any  misunderstandings  are  my  own,  of  course.   The  examples  are  given 
with  their  sandhi  undone. 

iCf.  Gonda  1971:136,  note  1  for  a  criticism  of  Whitney's  claim  that  the 
character  of  the  absolutive  did  not  change. 

2The  ta  -participle,  often  called  the  'past  passive  participle'  indicates 
general  past  time  and  can  substitute  for  a  finite  verb  as  in  the  following 
example: 

devadattena     katah  krtah 

D.-ins.  mat-nom.  do-ta-ppl. 

'Devadatta  made  a  mat,' 

3|n  addition  to  consecutive  sentences,  or  sequential  coordinate  clauses, 
there  are  nine  means  of  indicating  subordination  in  Sanskrit,  listed  and 
illustrated  below: 

a.  the  absolutive  (cf.  examples  1  and  2) 

b.  quasi-clausal  participial  constructions  (cf.  example  3b) 

c.  relative  clause/main  clause  constructions  (cf.  example  3c  and  5) 

d.  participial  adnominal  constructions 
krtah  katah 

do-ta-ppl.       mat-nom. 
'the  finished  mat' 

e.  genitive/locative  absolute  participles  (for  locative  absolute  cf.  15b) 
pasyato    yajhadattasya         devadattah    katam  karoti 
see-gen.  Y.-gen.                    D.-nom.         mat-acc.        do-3  sg. 
'Even  though  Yajfiadatta  is  watching,  Devadatta  makes  a  mat.' 

f.  'raising'  constructions 

yajnadattah  devadattam   katam  kurvantam  pasyati 

Y.-nom.  D.-acc.  mat-acc.        do-pres.  part,      see-3  sg. 

'Yajnadatta  watches  Devadatta  making  a  mat.' 

g.  quotative  constructions 

devadattah    katam  karomi  iti         vadati 

D.-nom.         mat-acc.        do-1  sg.         quot.   say-3  sg. 

'Devadatta  said  he  would  make  a  mat.' 

(lit.  'Devadatta  said:  I  will  make  a  mat.') 
h.   'subordinating  conjunctions' 

devadattah    ced     etad         kuryat 

D.-nom.         if         it-acc.       do-opt. 

(tarhi/tada)     katah        sundarah  syat/bhavet 

then  mat-nom.  beautiful-nom.    be-opt. 

'If  Devadatta  makes  it,  (then)  the  mat  will  be  beautiful.' 
i.     infinitive 

devadattah    katam  kartum      icchati 

D.-nom.         mat-acc.  do-inf.      want-3  sg. 

'Devadatta  wants  to  make  a  mat.' 
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'^P-oriented  constructions  refer  to  te -participle,  gerundive,  and  passive 
constructions  which  all  have  promoted  transitive  patients  and  dennoted  agents. 
Their  unmarked  word  order  is  AOV.  These  three  constructions  are  illustrated 
below: 

a.  ta-participle:  devadattena       katah  krtah 

D.-ins.  mat-nom.       do-ta-ppl. 

'Devadatta  made  a  mat.' 

b.  gerundive:     devadattena       katah  kartavyah 

D-ins.  mat-nom.       do-gdv. 

'Devadatta  should  make  a  mat.' 

c.  passive:        devadattena       katah  kriyate 

D-ins.  mat-nom.       do-pass. 

'A  mat  is  made  by  Devadatta.' 

5The  sample  is  small  because  many  of  the  attested  P-oriented 
constructions  lacked  a  specified  agent  or  object. 

^including  relative  adverbs  such  as  yatha  'as',  yada  'when',  and  yadi  'if. 

''in  that  paper,  we  also  described  the  Pancatantra  as  having  a  nominal 
style  whereas  Aesop's  fables  showed  a  more  verbal  style.  However,  our  notion 
of  'nominal'  and  'verbal'  was  based  on  an  overly  broad  distinction,  'noun-like'  or 
'verb-like',  where  all  non-finite  verbs  were  considered  noun-like. 

Abbreviations  of  Textual  References 

The  abbreviations  of  textual  references  are  as  follows: 

PI  =  Pancatantra  (HerteM  908) 

Pll  =  Pancatantra  (Hertel  1915) 

PIN  =  Pancatantra  (Kale  1 982) 

H  =  Hitopadesa  (Lanman  1884) 
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ON   THE   HEAD    MOVEMENT   CONSTRAINT 
Xinping   Zhou 


The  universal  nature  of  the  Head  Movement  Constraint 
(HMC)  (Chomsky  1986)  is  here  examined  by  looking  closely 
at  two  X°  movement  phenomena  in  Mandarin  Chinese:  qu- 
raislng  and  l-lowerlng.  It  is  argued  that,  even  without  the 
prototypical  properties  of  X°  movement  such  as  verb 
inflection,  syntactic  agreement,  and  derivational  morphology 
commonly  found  in  Indo-European  languages,  Chinese 
provides  positive  evidence  for  the  HMC  as  a  universal 
principle  on  syntactic  representation.  A  number  of  otherwise 
unrelated  and  unexplained  facts  in  Chinese  syntax  are 
shown  to  fall  nicely  under  the  general  condition  of  the  HMC. 
A  unified  theory  of  verb  raising,  negation,  and  aspect 
marking  in  Chinese  is  proposed  and  discussed. 


1.  Introduction 


In  the  recent  literature  of  Government  and  Binding  theory,  it  has  been  proposed 
that  the  movement  of  an  X°  (word-level)  category  mirrors  that  of  an  X""^  (phrasal 
level)  category  (Baker  1988.  Chomsky  1988.  1986.  Pollock  1987.  Koopman 
1984,  Travis  1984).1  X°  movement  is  a  head-to-head  movement  of  an  X° 
element  to  another  X°  element  via  syntactic  adjunction;  X""^*  movement,  on  the 
other  hand,  is  a  movement  of  a  maximal  projection  to  another  maximal 
projection  by  either  substitution  or  adjunction.  It  has  been  assumed  that  the 
movement  constraint  on  phrasal  categories,  namely,  the  ECP,  applies  equally 
well  to  word  level  elements  in  the  X°  movement. 

Chomsky  (1986)  defines  the  head  movement  constraint  (hereafter  HMC) 
by  two  interrelated  conditions  on  the  syntactic  representation:  the  government 
condition  that  requires  a  head  to  be  in  a  c-commanding  relation  to  its  residue, 
and  a  strict  locality  condition  which  requires  the  head  movement  chain  [X°, ...  e,] 
not  to  be  intervened  by  a  barrier  which  is  either  a  maximal  projection  or  another 
c-commanding  head,  hence  the  minimality  barrier. 


184 


(1)  Head  Movement  Constraint  (HMC)  (Chomsky  1986:71) 

The  movement  of  a  zero  level  category  p  is  restricted 
to  thehead  position  of  a  head  a  that  governs  the  maximal 
projection  yof  p,  where  a  e-governs  or  L-marks  yif  a^^C. 

Chomsky  defines  e-government  and  L-marking  as  follows. 

(2)  a  0-governs  p  iff  a  is  a  zero-level  category  that  e-marks  p,  and  a 
and  p  are  sisters. 

a  L-marks  p  iff  a  is  a  lexical  category  that  e-governs  p. 

According  to  this  HMC,  both  stnjctures  in  (3),  are  ill-formed. 

(3) 

a.  A  b. 


In  (3a),  the  head  a  doesn't  c-command  the  head  p.  Thus  movement  of  p  into  the 
head  position  of  a  violates  the  HMC.  In  (3b),  although  the  head  a  does  c- 
command  the  head  p,  movement  of  P  into  the  head  position  of  a  at  one  swoop 
is  also  banned,  for  (p  is  a  barrier  by  the  minimality  condition  (4). 

(4)       Minimality  Condition  (Paraphrased  from  Chomsky  1986:10) 

Given  three  heads  a,  (p,  p,  where 
...a...[...(p...[...  p...]] 

a  does  not  govern  p  if  there  is  a  closer  governor  cp  that 
governs  p  even  though  a  othenwise  satisfies  all  conditions 
on  government. 
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(3b)  falls  afoul  of  ECP  since  the  head  a  doesn't  govern  its  trace  because  of 
the  minimality  barrier  (p. 

In  contrast  to  (3),  both  structures  in  (5)  are  well-formed,  conforming  to  the 
HMC. 


In  {5a)  the  head  a  L-marks  and  thus  properly  governs  the  maximal  projection  X. 
So  the  movement  of  (3  to  the  head  position  of  a  is  valid.  In  (5b),  the  head 
movement  of  p  to  a  via  the  head  position  of  cp  is  equally  legitimate  since  the 
head  movement  in  each  stage  is  strictly  local  with  the  trace  of  (3  being  either 
antecedent-governed  or  head-governed. 2 

It  remains  an  open  question,  however,  whether  the  HMC  is  a  valid 
principle  on  syntactic  representation  universally,  thought  it  has  been  shown  to 
be  descriptively  correct  in  several  languages  such  as  English  (Chomsky  1988), 
German  (Travis  1984,  1987),  and  French  (Pollock  1987).  It  is  extremely 
worthwhile  to  test  the  HMC,  and  ultimately  ECP,  for  the  movement  of  X° 
elements  in  a  language  that  differs  from  most  Indo-European  languages,  a 
language  that  lacks  the  prototypical  head  movement  properties  such  as  verb 
inflection,  syntactic  agreement,  and  subject-aux(iliary)  inversion.  This  is 
therefore  the  major  goal  of  this  study. 

In  this  paper,  I  explore  the  universal  nature  of  the  HMC  by  examining 
some  X°  movement  phenomena  in  Chinese.  I  argue  that  positive  evidence  can 
be  found  for  HMC.  This  argument  is  supported  by  the  fact  that  a  number  of 
otherwise  unexplained  and  unrelated  syntactic  phenomena  in  Chinese  can  be 
nicely  accounted  for  if  we  assume  HMC  as  a  universally  viable  principle  on 
syntactic  representation. 

In  what  follows  I  will  discuss  two  syntactic  phenomena  in  Mandarin 
Chinese  in  relation  to  the  Head  Movement  Constraint-the  verb  movement  of  qu 
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in  qu-yp  permutation,  and  the  1°  movement  in  relation  to  the  well-known 
Neg-Aspect  Marker  alternation  in  this  language. 

2.  qu  Permutation 

2.1.  The  facts 

Mandarin  is  known  to  be  an  isolating  language.  One  piece  of  evidence  for  such 
typological  classification  is  that  it  completely  lacks  verbal  inflection  for  tense 
marking.  Consider  (6). 

(6)  a.      English:     I  am  go/ng  to  the  movie, 
b.      Chinese:   wo  kan  dianying  qu 

I    see      movie     go 

'I  am  going  to  see  the  movie.' 

Here,  a  bare  verb  qu  (to  go)  is  contrasted  to  the  inflected  verb  form  of 
English  'be  going'  in  denoting  a  future  event.  However,  though  it  has  no  verb 
inflection  for  tense  marking,  Chinese  arguably  has  verb  movement.  Let's  again 
consider  the  same  word  qu.  in  structures  like  (6b)  in  Mandarin,  the  verb  qu  is 
generally  permutable  with  a  preceding  VP.  Thus,  the  sentence  in  (6b)  has  a 
syntactic  variant  (7)  with  the  same  semantic  interpretation. 

(7)  wo  qu  kan  dianying 

In  (7),  what  we  find  is  a  pre-VP  qu,  parallel  to  the  English  word  order  in 
(6a).  Both  (6b)  and  (7)  denote  the  same  future  event.  This  permutation  pattern 
can  be  summarized  in  (8). 

(8)  a.       VP  +  qu 
b.       qu  +  VP 

Such  permutation  proves  to  be  free  in  all  of  the  following  sentences  noted 
by  Lu  (1988). 

(9)  a.       wo  [kan  dianying]  qu 

I     see     movie     go 
b.       woqtVi         [  kan  dianying]    ei 
'I  am  going  to  see  the  movie.' 

(10)  a.       ni       [shui        huir]      qu 

you     sleep  a  while     go 
b.       ni    qu,  [shui  huir]    e; 

'You  go  to  sleep  for  a  while.' 
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(11)  a.      ni  [   gan       ma]     qu? 

you  do        what    go 
b.      ni      qu\  [gan  ma]     ei 
you  go     do  what 
'What  are  you  going  (there)  to  do  ?  ' 

(12)  a.      wo  [kankan]  qu 

I    see         go 
b.      wo  qu  I    [kankan]    ei 

'I  will  go  and  take  a  look  (at  it).' 

In  all  these  instances,  the  post-VP  qu  permutes  with  the  preceding  VP. 
However,  as  Lu  observes,  qu  permutation  is  not  always  free. 

First,  when  in  an  embedded  clause,  qu  cannot  move  to  the  pre-VP  position 
out  of  that  clause. 


(13)  a.      ta    bi     [  woqu] 

he  force  me  go 
b.      *ta  qu,  bi  [wo    ej  ] 
'He  forced  me  to  go.' 

(14)  a.      women  pai  [taqu] 

we        send  he  go 
b.      *women  qu\  pai  [ta    ej  ] 
'We  sent  him  to  go  (there).' 

(15)  a.      wo  qing  [ta  qu  ] 

I      ask  him  go 
'I  asked  him  to  go.' 
b.      *wo  qu  i  qing  [ta   ei    ] 

(13b)  and  (14b)  are  ill-formed.  (15b)  is  a  well-acceptable  sentence,  but  it 
has  a  different  semantic  interpretation  than  (15a),  meaning  'I  will  go  to  ask  him." 

Secondly,  when  the  preceding  VP  is  an  adjunct,  the  raising  of  qu  is  also 
prohibited. 


(16)         a.      wo  [zoulu    ]  qu 
I      walk      go 
b.       *wo  qu,  zoulu  ei 

'I  will  go  there  by  walking. 
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(17)  a.      wo  [chile  fan]  qu 

I     eat  meal     go 
b.      *wo  qu  i   [chile  fan]  e, 

'I  will  go  after  eating  the  meal.' 

The  contrast  here  is  rather  obvious.  In  the  ill-formed  examples  of  (13)-{15) 
and  (16)-(17),  the  verb  qu\s  itself  the  single  primary  predicate,  while  in  the  well- 
formed  free  permutation  examples  of  (9)-(12),  it  is  not.  Why  should  this  be  the 
case?  In  what  follows,  I  will  try  to  provide  a  principled  account  for  this  contrast. 

2.2.  Head  Movement  Constraint  and  qu-raising 
2.2.1.  The  Status  of  qu  and   predication 

Let's  first  consider  some  alternative  approaches  towards  this  qu/VP 
permutation.  To  begin  with,  a  crucial  generalization  for  the  data  in  consideration 
is  that  qu  raising  is  possible  only  if  the  preceding  VP  is  a  unitary  predicate  of  an 
argument,  which  qu  is  also  predicated  to,  following  the  predication  theory  of 
Williams  (1980,1982,1987).  In  the  well-formed  sentences  of  (9)-(12),  both  qu 
and  the  preceding  VP  are  linked  to  the  same  external  argument,  i.e.  the  subject 
of  the  whole  clause. 3  However,  in  the  ill-formed  permutation  cases  of  (13)-(17), 
such  unitary  predication  no  longer  holds.  When  qu  is  the  primary  predicate  of 
an  embedded  clause  as  in  (13)-(15),  it  is  linked  to  the  external  argument  of  that 
embedding  clause.  Descriptively,  a  generalization  can  be  formulated  as  (18). 

(18)       A  post-VP  qu  can  raise  to  a  pre-VP  position  if  it  is  linked 

in  predication  to  the  external  argument  of  that  VP. 

Thus,  a  permutable  case  like  wo  kan  dianying  qu  (=(8a))  'I  am  going  to  a 
movie'  with  its  permutation  variant  of  wo  qui  kan  dianying  t ,  (=8b)  will  have  the 
following  predication  form. 

(19)  a.  qu'(wOi)  ^  kan'(wOi,  dianying) 
b.  *qu  '(dianying) 

It  seems  plausible  then  to  assume  that  the  permutable  stmctures  all  have  a 
complex  VP  which  is  made  up  of  [VI  (XP)  V2],  whose  argument  is  the  subject 
(S),  that  is,  the  external  argument  of  the  complex  VP. 

(20)  [    IP     NP  Pred  P  ] 

(X)  (Y) 

I 
(vp    VI    (XP)    V2    ] 
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(20)  explains  the  predication  relation  of  the  complex  VP  (Y)  to  the  external 
argument  (X)  within  the  same  clause  by  the  form  of  predicate-linking,  borrowing 
a  term  from  Rothstein  (1983). 

Consequently,  for  sentences  like  (9)-(12),  permutation  is  predictably 
allowed,  since  the  complex  VP  ([V  NP  qu  ])  is  linked  to  one  unique  argument  as 
in  (21).  Consequently  Qtv-raising  is  sanctioned  in  (22). 


(21)    [^^wo    [^p[^.    kan     dianying  ]     [^q    Qu 

I                   see        movie             go 
I  I I 


(22)  qu  -raising 

[ip  wo  [  [vo  qu,  ]  [vp  [vo  kan  ]  dianying  ]  t,    ]] 
I  go  see      movie 

This  predication  account  correctly  predicts  that  sentences  like  (13)-(15)  will 
disallow  qu  permutation  since  the  word  qu  and  the  preceding  VP  are  not  linked 
to  the  same  argument  by  predication  due  to  the  fact  that  they  are  not 
clausemates.  For  sentences  (16)-(17),  qu  permutation  is  equally  illicit,  for  the 
subject  of  the  adjunct  VP  is  a  PRO.  Therefore,  the  qu  and  the  adjunct  VP  are 
not  linked  to  the  same  argument  even  though  the  two  arguments,  namely  the 
PRO  in  the  adjunct  clause  and  the  subject  of  the  matrix  clause,  happen  to  be 
co-indexed.  The  ill-formed  derivation  of  (17),  for  example,  is  shown  in  (23)-(24) 

(23)  [  ,p  wo.  [,p  PRO       [^^  Chi     le       fan   ]]  [^^  [  ^^qu  ]]] 

I  I  eat  PERF  meal  go 


(24)    qu-raising:       *  wo    [   qu  ,      chile    fan  tj  ] 

The  predication  account  makes  correct  predictions  for  the  problems  in 
consideration.  However,  it  is  still  not  clear  why  the  Qumovement  constraint 
should  be  as  such.  In  other  words,  we  are  still  left  without  a  principled 
explanation  as  to  why  the  clause  relation  and  predicate  linking  should  have 
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such  impact  on  verb  movement  in  Mandarin.  We  obviously  need  to  ask  what 
underlying  principle  is  responsible  for  the  surface  variations  of  qu  placement 
discussed  above.  Let's  consider  a  second  alternative  approach. 


2.2.2.  The  Head  Movement  Constraint 

As  I  have  demonstrated  earlier,  free  permutations  of  qu  all  involve  a  structure 
that  can  be  construed  as  having  a  complex  VP.  This  syntactic  structure  is 
fundamentally  different  form  those  of  the  non-permutable  sentences,  namely, 
those  that  have  the  verb  qu  as  the  main  predicate  of  an  embedded  sentence  or 
those  that  have  an  embedded  adjunct  clause.  In  what  follows  I  will  argue  that 
the  difference  in  this  qu  movement  is  what  we  would  exactly  expect  from  the 
principle  of  HMC. 

Following  Larson  (1988),  we  can  construe  the  complex  VP  in  question  as 
consisting  of  an  embedded  VP. 


(26) 


Spec 


The  structure  of  (26)  is  compositionally  determined  by  the  semantic 
relation  between  the  verb  qu  and  the  preceding  VP2,  both  of  which  form  a 
unitary  predicate  for  the  external  argument  of  the  complex  VP.  We  can  assume 
that  an  optional  Move-alpha,  probably  stylistic  inversion,  applies  to  (26).  By 
verb-raising,  qu  raises  to  the  minimally  c-commanding  empty  head  position  [e]. 
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Consequently  the  surface  qu  permutation  follows.  What  we  get  is  a  free  pre-VP 
and  post-VP  qu  permutation  within  the  VP  compound.  Is  the  HMC  obeyed  here? 
As  shown  in  (27),  the  residue  of  the  X°  element  qu  is  minimally  c-commanded 
by  its  antecedent,  the  raised  verb  qu.  After  Qu-raising,  the  upper  V°  in  (27) 
becomes  lexical,  thus  it  L-marks  VPi.  Hence  it  governs  VPi  and  every 
constituent  included  in  VPi.  Therefore,  HMC  is  observed,  since  there  is  no 
barrier  intervening  between  the  upper  V°  and  the  lower  V°in  (27).  Thus,  qu- 
raising  is  legitimate. 


(27) 


We  now  turn  to  the  non-permutable  cases  of  (13)-(15)  and  (16)-(17).  First, 
consider  the  sentences  in  (13)-(15).  All  these  sentences  have  an  embedded 
clause  similar  to  the  ECM  structure  in  English.  As  Li  (1985)  argues,  the  parallel 
ECM  stnjcture  in  Chinese  is  a  V  +  S'  (=CP).  If  this  line  is  correct,  we  then  would 
expect  the  structure  of  (13)-(15)  to  be  that  of  (28). 
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qu 

In  (28),  for  the  V°  qu  to  be  moved  to  the  upstairs  head  position  of  V°  bi,  it 
has  to  move  through  the  intervening  head  positions  of  1°  and  C°  as  required  by 
the  Minimality  Condition  (4).  Then,  the  question  boils  down  to  whether  Chinese 
allows  a  V°->r-^C°  movement.  If  this  kind  of  head-to-head  movement  is 
possible,  we  then  would  expect  qu  permutation  to  be  well-formed  in  (13)-(15), 
and  ill-formed  otherwise.  It  turns  out  that  Chinese  prohibits  such  movement. 
This  point  is  made  clear  by  the  following  contrast. 


(29)     a.   ta  xihuan  Mali 
he  like       Mary 
'He  likes  Mary.' 
b.  *  Xihuan  ta  Mary? 
'Does  he  like  Mary?' 

The  fact  that  Chinese  lacks  a  y°-^\°^C°  movement  explains  why  verb 
inversion  with  the  subject  in  interrogative  sentences  like  (29b)  is  never  possible. 

This  reflects  a  similar  test  in  Pollock  (1987)  who  distinguishes  tense/aspect 
marking  patterns  in  English  and  French.  In  English,  movement  of  the  main  verb 
to  the  COMP  position  is  not  allowed  as  in  'Likes  he  Mary?  French,  on  the  other 
hand,  permits  such  verb  raising  as  in  Aime-t-il  Marie? 
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Pollock  argues  that  this  verb  movement  constraint  is  attributable  to  the 
parametric  setting  for  the  node  Agr  P.  English  Agr  P  is  opaque,  i.e.  suppressing 
the  0-role  assignment  property  of  the  main  verb,  thus  V°-^I°-^C°  movement  is 
not  allowed  in  English.  French,  on  the  other  hand,  has  a  transparent  Agr  node, 
thus,  the  0-role  assignment  property  of  the  main  verb  is  maintained  even  if  the 
main  verb  undergoes  V°->I°->C°  movement. 

Since  \/°-^l°->C°  is  not  allowed  in  Chinese,  the  verb  qu  in  (28)  cannot 
raise  to  the  pre-VP  position  of  the  matrix  IP.  If  it  were  to  raise,  it  would  violate 
HMC  since  both  intervening  1°  and  C°  are  minimality  barriers. 

(30)     (  ,p...  V°  (^  C  [,p     |0  qu   111 

I . 1 


Similarly,  the  illicit  (16b)  and  (17b)  should  also  fall  afoul  of  HMC.  Let's 
assume  that  in  (16)  and  (17),  the  adjunct  VP  is  adjoined  to  I'  inside  an  adjunct 
clause  IPaOf  time  and  manner.'* 


(31) 


Spec 


In  (31),  for  qu  to  be  raised  into  IPais  a  violation  of  HMC  since  none  of  the 
constituents  contained  in  IP2  stands  in  a  c-commanding  position  to  the  verb  qu. 
Therefore,  the  qu  permutation  with  a  preceding  VP  in  (16)-(17)  is  illicit. 

Summarizing,  we  have  shown  that  the  array  of  qu  raising  possibilities  in 
Mandarin  Chinese  falls  under  one  general  principle  of  HMC.  Both  the 
government  condition,  i.e.  the  c-commanding  requirement  (Aoun  &  Sportiche 
1982)  between  an  X°  category  and  its  movement  trace,  and  the  minimality 
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requirement  (Travis  1984,  Chomsky  1986,  Rizzi  1987),  i.e.,  the  strict  locality 
condition  between  that  X°  category  and  its  movement  trace,  play  a  vital  role  in 
determining  the  permutation  of  verb  qu  with  the  preceding  VP. 


3.  l-movement  in  Mandarin 

In  this  part,  I  try  to  identify  certain  properties  of  1°  movement  existing  in  Mandarin 
Chinese.  In  particular,  I  will  examine  a  possible  process  of  syntactic  affixing  of 
aspect  marking,  and  furthermore.  I  will  show  that  this  1°  movement  must  obey  the 
Head  Movement  Constraint,  parallel  to  many  other  forms  of  head  movement  in 
other  languages. 

3.1.  le,  zhe,  guo  and  Chinese  clause  structure 

Chinese  has  three  particles  for  aspect  marking,  which  can  be  viewed  as  suffixes 
to  the  main  verb  predicate  --  V+  le  (perfective),  V+zhe  (progressive),  and  V+guo 
(perfective).  Aspect  marking  by  le,  zhe,  and  guo  is  shown  in  (32)  a.b.c 
respectively. 

(32)     a.    wo  du    le        zheben  shu 
I  read  PERF  this         book 
'I  have  read  this  book.' 

b.  wo  du  zhe       zheben  shu 

I  read  PROG  this  book 

'I  am  reading  this  book." 

c.  wo  du     guo      zheben  shu 

I    read  PERF    this        book 
"I  have  read  this  book.' 

In  traditional  Chinese  grammar,  these  three  particles  are  grouped  in 
one  category  called  'xuci'  (form  words  or  verb  auxiliaries),  and  are  given  a 
unitary  treatment.  A  legitimate  question  about  this  approach  is  that  while 
both  le  and  guo  denote  some  action  that  has  been  completed  in  time,  do 
they  differ  syntactically?  Evidence  can  be  found  that  they  do.  The  crucial 
difference  between  the  aspect  markers  le  and  guo  is  that  while  guo  can 
only  be  attached  to  a  verb,  le  can  be  attached  either  to  a  verb  or, 
most  importantly,  to  a  V+guo.  Thus,  (32c)  can  be  further  stated  as  (33a) 
equally  well,  but  (32a)  cannot  be  further  stated  as  (33b). 

(33)    a.      wo  duguo/e  zheben  shu 

I  read+guo+le  this  book 

'I  have  read  this  book.' 
b.      *wo  duleguo         zheben    shu 
I    read+le+guo  this  book 
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The  cx>ntrast  here  is;  le  can  be  an  affix  to  be  attached  to  a  V+guo,  but  guo, 

on  the  other  hand,  can  never  be  attached  to  a  V+/e.  Empirically,  it  is  perhaps 
plausible  for  us  to  construe  le  as  a  syntactic  morpheme,  and  guo  as  a  lexical 
morpheme.  I  will  show  in  the  later  discussion  that  this  assumption  is  correct,  as 
it  correlates  with  some  1°  movement  phenomenon  in  this  language.  This 
distinction  has  some  vital  consequences:  Le  and  guo,  being  syntactic  affixes, 
should  be  dominated  by  Infl.  Moreover,  guo,  being  a  lexical  morpheme,  cannot 
be  an  1°  element,  but  is  only  part  of  the  lexical  item  whose  derivation  is  then 
restricted  in  the  lexicon,  under  the  old  lexicalist  hypothesis. 

Zhou  (I988a,b),  following  the  X'  schema  of  Chomsky  (1986),  proposes 
that  the  canonical  clause  stnjcture  for  Mandarin  is  somewhat  like  (34),  with  a 
unified  right-branching  tree  in  which  all  complement  maximal  projections  follow 
the  head  X°.5 

(34) 


Mandarin  is  known  to  have  mixed  word  order  parameters  of  SVO.  SOV, 
and  OSV,  as  shown  in  (35). 


(35)    a.       Lihua  chi      le  fan  (NP  V  NP) 

Lihua  eat     PERF       meal 
'Lihua  has  eaten  his  meal.' 
b.       Lihua  henkuaide  chi    le  fan  (NP  AdvP  V  NP) 

Lihua  quickly        eat    PERF  meal 
'Lihua  quickly  ate  his  meal.' 
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c.  fan  Lihua      chi        le  (NPobj  NP  subj  V ) 
meal      Lihua       eat      PERF 

'Lihua  ate  his  meal.' 

d.  Lihua     fan      chi    le  (NPsubj  NPobj  V ) 
Lihua    meal    eat    PERF 

'Lihua  ate  his  meal.' 

Zhou  (1988a)  argues  that  the  surface  word  order  variations  (35c-d)  are 
derived  from  the  canonical  SVO  order  in  (35a-b).  Both  the  non-gap  topic 
structure  and  the  topic  structure  with  gapping  in  (35),  i.e.  SOV  and  OSV  word 
orders,  involve  an  adjunction  of  the  nominative  maximal  projection  to  IP. 

fviost  crucial  to  this  study  is  the  internal  structure  of  IP  in  Mandarin. 
Following  Pollock  (1987),  I  propose  further  that  the  clause  structure  in  (34)  be  a 
partitive  one.  As  Pollock  shows  convincingly,  clause  structure  in  a  language  like 
English  or  French  may  be  leveled  into  several  subcomponents,  each  of  which  is 
an  independent  maximal  projection  --  TP,  Neg  P,  AgrP  VP,  NP,  etc,  where  T 
stands  for  tense,  Neg  for  negation,  and  Agr  for  agreement.  The  syntactic 
derivations  of  verb  morphology  in  a  certain  language  are  then  accurately 
captured  by  this  postulation  of  multi-leveled  IP  stnjcture.  This  position  was  also 
adopted  in  Chomsky  (1988)  in  his  treatment  of  English  verb  inflection. 

It  has  been  accepted  in  generative  syntax  that  the  Infl  node  in  various 
natural  languages  has  certain  Infl  features,  which  are  [+/-  Tense]  and 
[-K/-Agreement]  (e.g.  Raposo  1987).  Given  these  two  syntactic  features,  we  can 
propose  that  languages  fall  into  the  following  four  categories  with  regard  to  the 
specification  of  Infl. 


(36) 


Under  this  feature  system,  Infl  in  Chinese  finite  clauses  ought  to  be  marked 
as  [+tense]  [-agr]  for  impoverished  tense  marking.  By  impoverished  tense 
marking,  it  is  meant  that  the  language  has  only  aspect  marking  for  the  main 
verb,  while  lacking  morphological  marking  of  tense.  Therefore,  a  finite  clause  for 
Chinese  can  perhaps  be  (37).  To  make  the  distinction  clearer,  the  feature 
[+/-Aspect]  is  used  to  replace  the  feature  [+/-Tense]  here. 


a.  [-t-tense] 

[+agr] 

b.  [-tense] 

[+agr] 

c.  i-t-tense] 

[-agr] 

d.  [-tense] 

[-agr] 
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(37) 


(ADV  P)  VP 


Importantly,  once  a  negation  word  meiyou  is  used,  it  should  project  to  a 
maximal  projection  Neg  P,  mediating  between  I  and  VP  as  in  (38). 


(38) 


CO 

IP 

NP 

""^r 

|0 

[+/-  Aspect] 

(Nm 

P) 
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VP 

le 
zhe 

A 
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m€ 
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ji(you) 

/ 

(ADV  P) 

\ 

VP 
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With  these  preliminary  considerations,  we  now  turn  to  the  Neg-Aspect 
marker  alternation  in  Mandarin,  I  will  propose  an  alternative  solution  to  this 
problem  against  two  previous  proposals  by  Wang  (1965)  and  Huang  (1988). 


3.2.   Neg-Aspect  marker  alternation 

Wang  (1965)  observes  that  the  Mandarin  aspect  le  must  be  in  complementary 
distribution  with  the  negation  word  mei(you).  To  put  it  more  simply,  in  Mandarin, 
a  verb  with  an  aspect  marker/e  in  the  form  of  V+le  cannot  be  negated. ^ 
However,  in  contrast,  a  verb  followed  by  the  suffix  guo  in  the  form  of  V+guo 
does  not  respect  such  a  constraint. 

(39)  a.      ta  mai     shu 

he  buy    books 
'He  bought  books.' 

b.  ta  bu/meiyou   mai   shu 
he  not  buy  books 
"He  did  not  buy  books.' 

c.  ta     mai  le         shu 
he  buy    PERF    books 
'He  has  bought  books.' 

d.  *  ta    bu/meiyou  mai  le  shu 

he  not  buy  PERF   book 

'He  has  not  bought  books." 

e.  ta  bu/meiyou    mai  shu 
he       not  buy  books 
'He  has  not  bought  books.' 

(40)  a.      ta  mai  guo     shu 

PERF 
'He  has  bought  books.' 

b.       ta  mei(you)  mai  guo  shu 

'He  has  not  bought  books.' 

In  (39).  the  perfective  aspect  marker  le  cannot  co-occur  with  a  Neg 
mei(you)  or  bu,  though  the  verb  here  clearly  denotes  a  past  action.  In  (40),  the 
aspect  marker  guo  is  shown  to  be  free  of  this  constraint.'^ 


{ 
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Beyond  Wang's  observation  with  regard  to  le  in  Mandarin,  we  find  that  the 
progressive  aspect  marker  zhe  displays  the  same  syntactic  property  as  le  does. 
It  must  also  be  in  complementary  distribution  with  a  Neg. 

(41)  a.      wo  Zheng    zai      kan  zhe       shu 

I    just  at       read  PROG  book 

'I  was  just  reading  the  book.' 

b.       *?wo   mei/bu  zheng  zai  kan  zhe     shu 

not 
'I  was  not  reading  the  book." 

(42)  a.      wo  bu  zai  kan    shu 

I      not    at  read  book 

'I  was  not  reading  the  book.' 

b.      wo    mei  (zai)  kan    shu 
I        not     at      read    book 
'I  was  not  reading  the  book  .' 

In  (41b),  the  Neg  me/' co-occurs  with  the  progressive  aspect  zhe,  making 
the  sentence  illicit.  In  contrast,  (42)  is  well-formed  when  the  aspect  marker  zhe 
is  non-existent. 

Wang  (1965)  proposes  an  affix  hopping  analysis  for  the  Neg-/e  alternation, 
under  which  an  auxiliary  you  affixes  to  a  main  verb  only  in  affirmative 
sentences.  When  you  is  affixed  to  the  main  verb,  it  is  realized  as  le  perhaps  by  a 
morphological  rule.  In  negation,  this  affix  hopping  is  inapplicable.  Then  you 
stays  in  the  pre-verbal  position,  that  is,  the  position  dominated  by  the  Infl  node  if 
we  reinterpret  Wang's  account  in  a  more  recent  framework. 

Though  Wang's  analysis  is  observation  ally  correct,  it  fails  to  explain  why 
the  Neg-/e  alternation  should  be  so  constrained.  It  remains  unexplained  what  in 
principle  blocks  the  affix  hopping  of  the  syntactic  suffix  you/leXo  the  main  verb 
when  the  main  verb  is  being  negated. 

More  recently,  Huang  (1988)  proposes  an  alternative  solution  to  this  Neg- 
le  alternation.  Huang  argues  that  the  fact  that  le  has  to  be  in  complementary 
distribution  with  a  Neg  as  shown  in  (43)  follows  from  an  independent  semantic 
constraint. 

(43)  a.      *tamen  bu    pian-le  Lisi 

they      not    cheat-i-PERF  Lisi 

b.        *tamen  pian-bu-  pian-le  Lisi 

they        cheat-not-cheat+PERF  Lisi  (Huang  1988:283) 
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According  to  Huang,  the  negative  morpheme  bu  forms  an  immediate 
construction  with  the  first  V°  element  following  it.  Therefore  the  relevant  structure 
for  (43)  is  (44). 

(44)  [[  vo  bu  [vo  plan  ]]  le  ] 

The  Neg  bu  is  first  locally  Chomsky-adjoined  to  N/o,  and  then  the  perfective  le  is 
further  attached  to  this  negative  verb.  Huang  argues  that  such  a  structure  must 
be  ruled  out  for  'it  is  absurd  to  assign  the  perfective  to  a  negated  verb,  just  as  it 
is  contradictory  to  assert  the  completion  of  some  event  that  one  says  does  not 
happen'  (Huang:284). 

Though  Huang's  analysis  works  well  for  the  observed  Neg-/e  alternation 
constraint  exhibited  in  (43),  it  could  not  explain  why  the  negation  of  M+guo 
discussed  in  (40b)  is  well-formed.  Recall  that  guo  and  le  here  are  semantically 
similar.®  Both  denote  a  completion  of  action.  (40b)  is  repeated  in  (45). 

(45)  ta  [  Neg  nnei(you)  [  vo  mai  guo  shu]] 

As  is  generally  true,  (45)  has  a  negated  verb  with  a  perfective  suffix  guo,  yet  it  is 
perfectly  well-formed.  According  to  Huang,  sentences  like  (45)  should  be 
incorrectly  ruled  out  on  the  same  semantic  ground  as  (44). 

Furthermore,  it  has  not  been  spelt  out  explicitly  in  Huang's  analysis 
whether  such  a  semantic  constraint  is  cross-linguistically  viable,  for  in  many 
languages,  a  negated  verb  can  certainly  take  an  aspect  marker  to  mean 
'something  not  yet  completed'.  This  being  tme,  the  semantic  explanation  offered 
by  Huang  for  this  Neg-te  alternation  may  be  explanatorily  undesirable. 

Unlike  both  authors,  in  what  follows  I  will  argue  that  the  syntactic 
properties  of  aspect  marking  in  fy/landarin  involve  a  syntactic  affixation  by  l°- 
movement  which  is  constrained  by  the  general  principle(s)  governing  X° 
movement  in  Universal  Grammar. 

3.3.  l-lowering  and  the  Minimality  Barrier 

The  Neg-Aspect  Marker  alternation  observed  in  the  previous  section,  as  I  will 
argue,  is  exactly  what  we  would  expect  from  the  Head  Movement  Constraint  on 
the  movement  of  X°  categories.  Before  we  justify  this  assumption,  we  need  to 
consider  a  few  related  questions.  First,  we  need  to  ask  what  exact  mapping 
mechanism  between  P-makers  Mandarin  probably  has.  Second,  we  need  to 
know  the  possible  derivational  patterns  in  syntactic  affixation  that  exist  in  this 
language.  Let's  turn  to  the  first  question. 

It  has  been  widely  claimed  in  generative  syntax  that  inflectional  derivations 
are  realized  by  syntactic  affixation  of  Infl  elements,  i.e.  tense/aspect  and 
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agreement  features,  to  verbs.  In  such  derivations,  presumably  two  types  of 
movement  are  possible:  an  l°-element  first  attaches  to  the  main  verb,  followed 
by  LF  raising  of  the  amalgamated  verb,  or  alternatively,  the  head  verb  of  the 
main  predicate  directly  raises  to  the  Infl  node  in  the  syntactic  component  to  take 
up  tense/aspect  features.  Let's  call  the  first  type  of  mapping  l-lowering  and  the 
second  type  of  mapping  V-raising.  Pollock  (1987)  and  Chomsky  (1988)  both 
argue  that  this  difference  in  structural  mapping  can  be  a  parameter  setting  with 
regard  to  a  specific  language.  For  example,  French  very  likely  has  a  V-raising  to 
Infl  (and  later  to  Comp)  in  its  finite  clause,  while  English,  on  the  other  hand,  has 
a  lowering  for  |0-elements  to  be  affixed  to  the  main  verb,  later  followed  by  a 
verb-raising  at  LF.  This  proposal  is  supported  by  the  following  contrast  between 
English  and  French  from  Pollock  (1987:3). 

(46)  a.  *John  likes  not  Mary 

b.  Jean  (n')  aime  pas  Marie 

(47)  a.  *Likes  he  Marie? 
b.  Aime-t-il  Marie? 

(48)  a.  *John  kisses  often  Mary 

b.  Jean  embrasse  souvent  Marie 
0.  John  often  kisses  Mary 

(49)  a.*  My  friends  love  all  Mary 

b.  Mes  amis  aiment  tous  Marie 

c.  My  friends  all  love  Marie 

d.  *Mes  amis  tous  aiment  Marie 

Assume  that  the  clause  structure  for  English  and  French  is  (50). 
(5°)       (,pNP  I  ([^3g  not/pas])  [^p(Adv)V...]] 

Then,  since  verb  raising  to  I  is  prohibited  in  English,  the  English  1° element  must 
lower  to  V  in  the  syntactic  component. ^  Thus,  a  pre-Neg  verb  in  (46a)  is  illicit 
because  for  the  verb  'like'  to  end  up  derivationally  in  such  a  position,  it  has  to 
undergo  raising  to  Infl.  As  for  French,  since  it  allows  V-*  I  raising  in  finite 
clause,  the  opposite  effect  should  happen.  This  is  exactly  what  we  see  here. 
(46b)  is  well-formed.  Similarly,  (47a)  is  ruled  out  since  a  V-NP  inversion  always 
requires  verb  raising  to  COMP  via  Infl.  Again,  for  French,  such  V-raising  is 
legitimate  in  (47b).  The  English  sentences  (48a)  and  (49a)  are  ill-formed 
because  in  both  sentences  the  main  verb  raises  to  Infl  across  an  adverb  or  a 
quantifier  that  is  adjoined  to  VP,  assuming  the  quantifier  takes  the  scope 
position  of  an  adverb  here.  For  the  French  sentences  (48b)  and  (49b),  such 
verb  raising  to  Infl  is  again  well-formed  and  is  indeed  obligatory  as  shown  by 
the  ungrammaticality  of  (49d). 

I  want  to  argue  that  in  Chinese,  the  aspect  marking  by  the  syntactic 
morphemes  le  and  zhe,  though  not  guo,  pairs  with  their  English  counterparts. 
There  is  an  l-lowering  movement  in  the  syntactic  component  by  which  the 
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syntactic  affixes  le  and  zhe  attach  to  the  nnain  verb,  followed  by  an 
amalgamated  verb  raising  to  Infl  in  the  LF  component,  required  by  the  ECP,  as 
will  be  shown  in  the  later  discussion.  This  argument  is  supported  by  empirical 
data  from  this  language.  The  relevant  clause  structure  is  (37)  or  (38).  First,  let's 
consider  the  following  contrasts.  The  tests  of  negation,  V-NP  inversion,  adverb 
placement,  and  qualification  are  given  in  (51)-(54)  respectively. 

(51)  Negation 

a.  *Zhangmin  da     le        mei  Lihua 
Zhangmin     beat  PERF  NOT  Lihua 

b.  Zhangmin  mei  da  Lihua 
Zhangmin  NOT  beat  Lihua 
'Zhangmin  didn't  beat  Lihua.' 

(52)  Inversion 

a.  *da  le  Zhangmin  Lihua 
beat  PERF  Zhangmin    Lihua 

b.  Zhangmin  da  le  Lihua 
Zhangmin  beat  PERF  Lihua 
'Zhangmin  beat  Lihua.' 

(53)  Adverb  Placement 

a.  'Zhangmin  da     le         henhende  Lihua 
Zhangmin    beat  PERF  bitterly        Lihua 

b.  Zhangmin  henhende  da  le  Lihua 
Zhangmin  bitterly  beat  PERF  Lihua 
'Zhangmin  beat  Lihua  bitterly.' 

(54)  Quantification 

a.      *tamen  chang  zhe         dou    neizhi  ge 
they        sing   PROG     all      that   song 
tamen  dou  chang  zhe     neizhi    ge 


lamen  aou  cnang  zne      neizni    ge 
they     all    sing     PROG  that       song 
'They  are  all  singing  that  song.' 


A  priori,  with  a  clause  structure  of  (37)  or  (38),  verb  raising  across  the  Neg  in 
(51a)  proves  impossible,  parallel  to  the  situation  of  English  Illustrated  In  (46), 
with  the  Neg  P  mediating  between  the  Infl  and  the  predicate  VP.  Furthermore, 
since  verb  raising  to  Infl  Is  not  possible  in  Mandarin,  the  movement  of 
V->lnfl-»Comp  should  be  equally  disallowed.  The  ill-formed  V-NP  inversion  in 
(52a)  Is  explained  accordingly.  Similarly,  verb  raising  across  an  adverb  in  (53a) 
and  a  quantifier  in  (54a)  results  in  ungrammaticallty.  In  both  (53)  and  (54),  the 
aspect  marker  le  and  zhe  must  lower  to  the  main  verb  to  derive  the  correct  (53b) 
and  (54b).  Such  head  movement  of  l-lowering  is  shown  to  be  legitimate  in 
Mandarin. 


203 

Having  established  a  theoretiCr-J  basis  for  discussion,  let's  turn  to  the 
problem  of  Neg  -  Aspect  Marker  alternation.  Recall  that  for  the  main  verb  to  take 
aspect  marking,  the  1°  element  le  or  zhe  has  to  be  lowered  to  the  main  verb  in 
the  syntactic  component  and  the  amalgamated  verb  later  raises  in  the  LF 
component  to  cover  up  the  traces  left  by  the  lowering  of  an  1°  element  in  each 
derivational  stage,  thus  satisfying  the  ECP  which  requires  that  every  trace  left  by 
any  syntactic  movement  be  properly  governed.  Let's  first  consider  the  case 
when  a  Neg  is  not  present  in  the  derivation.  Consider  (55). 

(55)     a.  Zhangmin  da      le  Lihua 

Zhangmin  beat  PERF     Lihua 
'Zhangmin  has  beaten  Lihua.' 
b.  Zhangmin  du      zhe        neiben  shu 
Zhangmin  read  PROG    that       book 
'Zhangmin  is  reading  that  book.' 

Derivations  of  both  (55a)  and  (55b)  are  illustrated  in  (56)  with  1° -lowering 
to  V  in  the  syntactic  component  and  V-raising  in  the  LF  component. 


(56)     a.    S-structure  verb  amalgamation 

r 


|0 

[+/-  Aspect] 


VP 

I 


Lihua 
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|0 
[+/-  Aspect] 


VP 

I 

V 


zhe; 


du 


NP 


Shu 


b.        LP  component:  V-raising 

r 


|0 


VP 


NP 


le/zhe 


In  the  syntactic  component,  l°-lowering  in  (56a)  is  required  to  derive  (55a)  and 
(55b).  In  (56a),  the  1° element  lowers  to  V,  and  this  amalgamated  verb  G-marks 
and  Case-marks  the  complement  NP,  satisfying  the  projection  principle  and 
Case  theory.  In  the  LP  component  as  shown  in  (56b),  the  amalgamated  verb 
raises  to  Infl  as  required  by  the  ECP;  the  trace  left  by  the  l-lowering  in  syntax  is 
covered  up,  following  a  proposal  by  Chomsky  (1988).  The  amalgamated  verb  L- 
marks  the  VP  in  (56b)  since  it  m-commands  the  VP.  Thus,  VP  is  not  a  barrier. 
The  trace  left  by  the  head  movement  of  this  amalgamated  verb  is  properly 
governed.  The  ECP  is  satisfied  accordingly.  Since  this  is  an  LP  movement. 
Case  is  irrelevant,  and  the  trace  left  by  the  raised  V  retains  the  ability  to  e-mark 
the  complement.  This  explains  why  syntactic  affixation  for  aspect  marking  in  (55 
a,b)  is  legitimate  when  there  is  no  Neg  present. 

What  will  happen  when  a  Neg  is  indeed  present?  At  the  first  stage  of  the 
derivation,  the  l°-element  le  or  guo  will  have  to  move  across  the  head  Neg 
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mei(you)  to  be  amalgamated  with  the  vo  as  shown  in  (57),  which  causes  no 
violation  of  any  known  principle  since  the  head  of  NegP  doesn't  c-command  the 
trace  left  by  the  1°  element  le  or  zhe.  Also  recall  that  this  l-lowering  is  followed 
by  the  LF  raising  of  the  amalgamated  verb. 

(57) 


'©i/zhe: 


However,  it  is  in  the  second  derivational  stage  of  the  LF  component  that  an 
obvious  violation  of  the  Head  Movement  Constraint  is  incurred  because  the 
amalgamated  verb  has  to  move  across  an  intervening  head  Neg,  which  does  c- 
command  the  trace  of  the  amalgamated  verb. 


In  (58)  the  Neg  head  is  a  minimality  barrier  by  the  definition  of  (4)  above. 
Therefore  the  LF  raising  of  the  amalgamated  verb  causes  an  ECP  violation. 


(59)  a.      *  Zhangmin         mei(you)/bu       da         le         Lihua 
Zhangmin  not  beat    PERF  Lihua 

'Zhangmin  has  not   beaten  Lihua.' 
b.       *?Zhangmin        mei(you)  /bu       du     zhe      shu 
Zhangmin  not  read  PROG  book 

'Zhangmin  is  not  reading  (a)  book.' 
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As  illustrated,  the  derivation  of  1° -lowering  of  Infl  element  to  the  main  verb  must 
be  prohibited  when  an  overt  Neg  is  present  because  of  the  HMC,  and  ultimately 
of  the  ECP.  So,  the  verb  can  only  appear  in  its  bare  form  due  to  the  lack  of  a 
verb  amalgamation  process  discussed  above.  This  explains  exactly  why  Neg 
and  aspect  markers  must  be  in  complementary  distribution  in  Mandarin 
Chinese. 

The  fact  that  le  and  guo,  the  two  aspect  markers  in  Chinese,  behave 
differently  with  regard  to  the  Neg-Aspect  marker  alternation  confirms  our 
hypothesis  that  there  must  be  a  basic  syntactic  difference  between  these  two 
morphemes.  That  is,  while  le  is  a  syntactic  affix  whose  derivation  is  subject  to 
HMC,  guo  is  a  lexical  morpheme  whose  derivation  is  restricted  in  the  lexicon,  as 
I  have  claimed  earlier.  This  le  /guo  distinction  is  partially  supported  by  their 
distributional  properties  demonstrated  in  (33).  Under  the  present  hypothesis, 
the  le/guo  difference  is  exactly  what  we  will  expect.  As  the  theory  predicts,  guo 
does  not  undergo  l°-lowering  in  the  syntactic  component  as  le  and  zhe  do,  nor 
does  the  V+guo  undergo  LF  raising,  since  it  is  not  an  1°  element  under  the  Infl 
node,  and  the  ECP  requirement  is  irrelevant.  Therefore,  in  contrast  to  le  and 
zhe,  guo  can  co-occur  with  a  Neg.  if,  on  the  other  hand,  guo  is  not  considered 
lexical,  in  contrast  to  the  syntactic  le  ,  then  there  would  be  no  way  to  account  for 
the  apparent  difference  in  their  behavior,  except  by  ad  hoc  assumptions. 

The  Neg-Aspect  marker  alternation  in  Mandarin  construed  here  as  X° 
movement  mirrors  the  X""^"  movement  constraint  of  the  that-Xxace  effect  in 
English.  Just  as  an  intervening  COMP  element  thatb\ocWs  an  A'  movement  of  a 
phrasal  category  out  of  a  CP,  Neg  in  Chinese  blocks  the  LF  raising  of  an 
amalgamated  verb.  Consider  the  similarities  in  the  English  fftaMrace  effect  in 
(60a)  and  the  Neg-Aspect  marker  alternation  in  (60b). 


(60) 


*CP 


Spec 


f         C°  IP  V°+le  Neg  VP 


A 


that  t 

S-Structure  component 


mei(you)         t 
LF  component 
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This  comparison  provides  a  strong  argument  against  treating  the  Neg- 
Aspect  marl<er  alternation  as  a  language-specific  phenomenon,  whether  in  a 
syntactic  approach  as  in  Wang  (1965)  or  in  a  semantic  approach  as  in  Huang 
(1988).  Clearly,  as  has  been  shown  in  this  study,  this  phenomenon  falls  under 
the  HMC  and  ultimately  the  ECP  in  universal  grammar  as  we  would  expect. 

4.  Conclusion 

The  two  head  movement  phenomena  of  qu  permutation  and  Neg-Aspect 
fvlarker  alternation  provide  additional  evidence  for  the  universality  of  the 
principle  of  the  Head  Movement  Constraint,  and  ultimately  the  ECP.  We  have 
seen  that  a  number  of  syntactic  phenomena  in  Chinese  are  nicely 
accommodated  in  one  unified  account  under  this  general  condition.  Theory- 
internally,  this  study  argues  for  dual  conditions  of  HMC:  a  locality  requirement 
and  a  structural  c-command  relation  between  an  X°  head  and  its  trace,  and  it 
provides  further  evidence  for  a  two  stage  derivation  for  aspect  marking,  namely, 
the  lowering  of  Infl  element,  in  the  syntactic  component  and  verb  raising  in  the 
LF  component,  in  Mandarin  Chinese. 


NOTES 

^  Portions  of  this  paper  were  presented  at  the  annual  conference  of  the 
Linguistics  Society  of  America  in  December  1988  in  New  Orleans.  Thanks  are 
due  to  C.C.Cheng,  P.  Cole,  T.  Ernst,  G.  Hermon,  H.  H.  Hock,  J.  McCawley,  D. 
Wible,  and  J.  Yoon  for  helpful  comments  and  conversations  on  various  issues 
related  to  this  paper.  All  errors  are,  of  course,  mine. 

^  There  have  been  various  theories  as  to  how  the  ECP  is  satisfied.  Aoun  et 
al.  (1987)  argue  that  head  government  is  the  necessary  condition  of  the  ECP, 
abandoning  a  more  traditional  approach  of  a  disjunctive  ECP.  Chomsky  (1988) 
seems  to  propose  a  similar  solution.  Here  I  am  taking  the  more  traditional  view 
by  assuming  a  disjunctive  ECP  as  is  done  in  the  Barriers  framework. 

^  There  are  cases  when  the  verb  qu  is  seemingly  permutable,  yet  qu  is 
linked  to  the  subject  of  the  embedded  clause,  not  the  subject  of  the  matrix 
clause. 

(i)  a.  wo  jiao    ta     shangban  qu 

I     ask  him  work  go 

b.  wo  qu     jiao  ta       shangban 

I      go    ask  him    work 
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The  fact  is  that  such  structures  are  not  really  free  permutations  since  these 
two  sentences  differ  semantically.  Thus  they  should  be  considered  as  having 
two  separate  syntactic  structures.  The  semantic  interpretations  for  a.)  and  b.)  in 
(i)  are  indicated  by  their  translations  in  (ii). 

(ii)       a.  I  (am  going  to)  ask  him  to  go  to  work.  (I  won't  go) 

b.  I  (am  going  to)  go  to  ask  him  to  work.  (I  will  go) 

'^  Huang  (1982)  assumes  that  such  adjunct  clauses  in  Mandarin  are 
adjoined  to  VP.  Since  both  proposals  here  make  the  same  prediction  for  the 
problem  under  discussion,  I  will  ignore  the  details  concerning  the  potential 
differences  between  these  two  proposals. 

In  Zhou  (1988a),  I  give  evidence  to  show  that  the  Chinese  NP  is  also 
head-initial.  The  surface  head-final  nominative  structures  are  derived  from  the 
D-structure  head-initial  configuration  by  an  obligatory  movement  of  the 
complement  XP  to  the  spec  position  of  a  DP. 

The  syntactic  function  and  the  semantic  interpretation  of  the  particle  word 
le  vary  according  to  different  contexts,  it  has  been  generally  observed  that  le, 
besides  being  an  aspect  marker,  can  also  be  used  as  a  sentence  particle  that 
appears  in  sentence-final  position,  expressing  an  exclamation  or  indicating  a 
special  mood  on  the  part  of  the  speaker.  Many  authors  even  regard  these  two 
/es  as  two  separate  morphemes  which  happen  to  be  phonologically  identical. 
Naturally,  the  Neg-Aspect  marker  {le )  alternation  constraint  will  not  apply  to  this 
second  type  of  le  as  shown  in  (i)  and  (ii). 

(i)  che  bu      lai         le 

bus  not       come    EXC. 
The  bus  will  not  come!' 

(ii)         ta  bu      mai       le 

he  not     buy       PART. 

'He  doesn't  want  to  buy  (it)  any  more.' 

As  pointed  out  correctly  by  C.C.  Cheng  (personal  communication), 
though  y+guo  compounds  are  free  of  the  Neg-Aspect  marker  alternation 
constraint  as  shown  in  (40),  only  the  Neg  meiyou,  not  bu,  can  be  used  to  negate 
a  y+guo  compound.  Consider  the  following  example. 

ta        meiyou  /*bu      mai    guo    shu 
he       not  buy    PERF  books 

Thus,  it  seems  that  there  is  a  further  selectional  restriction  on  the  use  of  a 
specific  negation  morpheme  together  with  a  y+guo  compound.  It  is  therefore 
plausible  to  assume  that  in  Mandarin,  meiyou  is  used  only  in  the  negation  of  a 
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past  or  perfective  action,  while  bu  is  used  exclusively  in  negating  a  present  or 
future  action. 

®  There  is  a  slight  difference  between  le  and  guo,  though  both  denote  a 
completion  of  a  certain  action,  le  usually  carries  a  connotation  of  completing  an 
action,  while  guo  usually  stresses  the  experience  of  having  done  something. 

^  The  only  exceptional  V-raisers  are  the  semi-auxiliaries  have  and  be, 
which,  according  to  Pollock,  lack  the  property  of  0-marking  their  complement. 
So  they  can  raise  to  Infl  and  then  to  Comp,  like  the  main  verb  in  the  French  finite 
clause. 


REFERENCES 


AOUN,  J.,  N.  Hornstein,  D.  Lightfoot,  and  A.  Weinberg.  1987.  Two  types  of 

locality.  Linguistic  Inquiry  18.537-578. 
AOUN,  J.,  and  D.  Sportiche.   1982.  On  the  formal  theory  of  government. 

Linguistic  Review  2.  211-236 
BAKER,  M.  1988.  Incorporation:  A  theory  of  grammatical  function  changing. 

Chicago:  The  University  of  Chicago  Press. 
CHOMSKY,  N.  1986.  Barriers.  Cambridge,  Massachusetts:  MIT  Press. 
CHOMSKY,  N.  1988.    Some  problems  in  the  theory  of  government.  [Lecture 

given  at  the  University  of  Illinois  at  Urbana-Champaign] 
HUANG,  C.-T.  J.  1982.  Logical  relations  in  Chinese  and  the  theory  of  grammar. 

MIT  dissertation  in  Linguistics. 
HUANG,  C.-T.  J.   1988.   Wo  pao  de  kuai  and  Chinese  phrase  structure. 

Language  64:2.274-311. 
KOOPMAN,  H.  1984.  The  syntax  of  verbs:  From  verb  movement  rules  in  the  Kru 

languages  to  universal  grammar.  Dordrecht:  Foris. 
LARSON,  R.K.  1988.  On  the  double  object  construction.  Linguistic  Inquiry 

19.335-392. 
LI,  A.  1985.  Abstract  Case  in  Chinese.  USC  Dissertation  in  Linguistics. 
POLLOCK,  J.K.  1987.  Verb  movement,  UG,  and  the  structure  of  IP.  Universite  de 

Haute  Bretagne.  MS. 
LU,  Jianming.   1988.  guanyu  'qu+VP'  he  'VP+qu'  [On  qu+VP  and  VP+qu]. 

Proceedings  of  the  First  International  Symposium  on  Chinese  Language 

Teaching.  20-29.  Beijing:  Beijing  Language  Institute  Press. 
RAPOSO,  E.  1987.  Case  theory  and  Infl-to-Comp:  The  inflected  infinitive  in 

European  Portuguese.  Linguistic  Inquiry  18:85-110. 
RIZZI,  L.  1987.  Relativized  minimality.  Universite  de  Geneve.  MS. 
ROTHSTEIN,  S.  1983.  The  syntactic  forms  of  predication.  MIT  dissertation  in 

Linguistics. 
TRAVIS,   L.   1984.   Parameters  and  effects  of  word  order  variation.   MIT 

dissertation  in  Linguistics. 


210 

1987  Parameters  of  phrase  structure,  and  V2  phenomena.  McGill  University. 

MS. 
WANG,  William,  S.-Y.  1965    Two  aspect  markers  in  Mandarin.    Language 

41.457-471. 
WILLIAMS,  E.  1980   Predication.  Linguistic  Inquiry  11.208-38. 

1982.  The  NP  cycle.  Linguistic  Inquiry  13.227-96. 

1987.  Implicit  arguments,  the  binding  theory,  and  control.  NLLT  5.152-196. 

ZHOU,  Xinping.  1988a.     Is  Chinese  a  head-final  language?  [To  appear  in 

Proceedings  of  the  third  OSU  conference  on  Chinese  linguistics,  eds.  by 

M.Chan  and  T.  Ernst.  Bloomington:  Indiana  University  Linguistics  Club.] 

1988b.  X°  movement,  parametric  AGR  and  Chinese  auxiliaries. 

University  of  Illinois  at  Urbana-Champaign.  MS. 


studies  in  the  Linguistic  Sciences 
Volume  18,  Number  2.  Fall  1988 


FINITENESS    IN    DRAVIDIAN* 

REVIEW   ARTICLE 

Sanford  B.  Steever.  The  serial  verb  formation  in  the  Dravidian  lan- 
guages. (MLBD  Series  in  Linguistics,  IV.)  Delhi:  Motilal  Banarsi- 
dass,  1988.     (ISBN:  81-208-0378-7;   Rs.  100/--.)     pp.  xv,  135. 

Hans  Henrich  Hock 


O.  Although  its  title  may  suggest  something  quite  different,  the  monograph 
under  review  presents  the  most  systematic  investigation  to  date  into  the  ques- 
tion of  finiteness  in  Dravidian.  Steever  (S)  presents  a  wealth  of  data  from  a 
large  sampling  of  Dravidian  languages,  and  his  conclusions  provide  significant 
challenges  not  only  for  comparative  Dravidian  linguistics,  but  for  general 
syntactic  typology  and  for  the  issue  of  South  Asian  convergence.  For  reasons 
of  space,  the  following  discussion  will  concentrate  on  S's  arguments  and  their 
empirical  foundation  in  Dravidian.  I  will  only  briefly  address  the  question  of 
general  linguistic  typology  by  proposing  an  alternative  analysis,  within  a  larger, 
crosslinguistic  framework.  Discussion  of  the  implications  for  South  Asian  con- 
vergence will  be  reserved  for  another  occasion. 

1.1.  According  to  conventional  wisdom,  the  structure  of  the  Dravidian 
languages  is  of  the  'strict  OV  type:  Sentences  contain  just  one  finite  verb  which 
is  invariably  placed  at  their  right  margin.  Embedded  structures  cannot  contain 
finite  verbs,  but  must  be  construed  with  non-finite  forms,  such  as  verbal  nouns, 
infinitives,  relative  participles,  or  absolutives  (the  latter  also  known  as  conjunc- 
tive participles,  verbal  participles,  or  gerunds).  This  view  of  Dravidian  sentence 
structure  is  captured  particularly  well  in  Caldwell's  remarks  on  the  absolutive: 

In  the  Dravidian  languages,  though  nouns  and  pronouns  are 
united  by  means  of  conjunctions,  finite  verbs  are  never  so  united.  In 
every  sentence  there  is  but  one  finite  verb,  which  is  the  last  word  in 
the  sentence,  and  the  seat  of  government;  and  all  the  verbs  which 
express  subordinate  actions  or  circumstances  ...  assume  an  inde- 
terminate continuative  character,  as  verbal  participles  or  gerundials, 
without  the  need  of  conjunctions  or  copulatives  of  any  kind;  so  that 
the  sense  ...  waits  in  suspense  for  the  authoritative  decision  of  the 
final  governing  verb.    (1875(1 91 3):488) 

For  similar  sentiments  see  for  instance  Spencer  1950  and  Winfield  1928.   The 
major  concession  traditionally  made  is  that  the  geographically  more  northern 
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Dravidian  languages  permit  sentences  with  more  than  one  finite  verb,  a  type  of 
structure  attributed  to  Indo-Aryan  (or  in  the  case  of  Brahui,  Iranian)  influence. 

Let  us  characterize  as  the  Strict  OV  Constraint  the  (claimed)  gener- 
alization that  Dravidian  permits  only  one  finite  verb,  at  the  right  margin  of  the 
sentence,  and  encodes  all  embedded  structures  by  non-finite  means. 

1.2.  Now,  if  taken  seriously,  the  claim  that  Dravidian  syntax  is  governed 
by  the  Strict  OV  Constraint  mns  into  a  number  of  difficulties. 

The  most  striking  of  these  is  found  in  quotative  structures  whose  existence 
is  acknowledged  by  all  Dravidianists  and  which  are  considered  a  hall-mark  of 
Dravidian  syntax.  In  fact,  similar  structures  in  Sanskrit  and  many  modern  Indo- 
Aryan  and  fvlunda  languages  are  commonly  attributed  to  Dravidian  influence. 
(Cf.  e.g.  Kuiper  1967,  but  see  also  Hock  1975,  1982,  1984.)  For  an  example 
see  (1).  Here,  in  addition  to  the  finite  verb  ninaikkiren  'I  think',  which  occurs  at 
the  right  margin  of  the  sentence  as  predicted,  we  have  a  second  finite  verb, 
vantan  'he  came',  occurring  at  the  end  of  the  cited  discourse. 

(1)  nan  avan  ihke         vantan  en-ru     ninaikkiren  (Tamil) 
'I'      'he'  'here'        past  3m. s      Q          pres.ls 

'I  think  (that)  he  has  come  here.' 

Now,  when  linguists  consider  structures  of  this  type  to  be  eminently 
Dravidian,  they  focus  on  the  quotative  marker  en-ru  lit.  'saying/having  said' 
which,  being  an  absolutive  of  the  verbal  root  en-  'say  (so)',  conforms  perfectly  to 
the  Strict  OV  Constraint.  What  is  overlooked  is  the  finite  form  which  precedes 
en-ru  and  which  because  of  its  occurrence  in  the  same  sentence  as  finite 
ninaikkiren  must  be  a  violation  of  the  Constraint. 

1.3.  Another  difficulty  lies  in  complex  sentences  of  \^<-  'ype  (2).  Biclausal 
structures  of  this  type,  commonly  referred  to  as  relativ^^  correlative  clauses, 
consist  of  a  relative  clause  containing  an  interrogative  pronoun  which  serves  as 
a  relative  pronoun,  followed  by  a  main  clause  introduced  by  a  'correlative' 
demonstrative  pronoun.  What  is  important  is  that  as  illustrated  in  (2),  both  of 
these  clauses  contain  a  finite  verb,  thus  constituting  a  second  violation  of  the 
Strict  OV  Constraint. 

(2)  evan  nanraka  u.raikkiran  -o        avan  varkkaiyil  munneruvan  (Tamil) 
'who'  pres.Sm.s  Clit.    'that'  fut.3m.s 

'Who  works  hard  will  progress  in  life.' 

Relative-correlative  structures  have  generally  been  considered  an  embar- 
rassment (e.g.  Caldwell  1875(1 91 3):521),  and  there  have  been  repeated  at- 
tempts (e.g.  Sastri  1934,  Narasimhia  1941,  Burrow  &  Bhattacarya  1970,  Pillai 
1973,  Nadkarni  1975,  Sridhar  1981)  to  attribute  them  to  the  influence  of  Indo- 
Aryan  which  offers  similar  constructions;  cf.  e.g.  (2'). 
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(2')    yo  yatnena  karyam    karoti        sa    sarhsare    pracalisyati    (Sanskrit) 
'who'  pres.Ss     that  fut.3s 

'Who  works  hard,  will  progress  in  life.' 

However,  as  Ramasamy  (1981)  has  shown,  structures  of  the  type  (2)  are 
attested  in  the  earliest  Dravidian  (Old  Tamil)  texts,  and  Lakshmi  Bai  (1985),  ap- 
parently without  being  aware  of  Ramasamy's  paper,  has  demonstrated  that  the 
type  is  attested  throughout  the  Dravidian  family.  Both  scholars  conclude  that 
the  structures  are  indigenous  to  Dravidian,  not  borrowings  from  Indo-Aryan. 

1.4.  A  third  problem,  perhaps  the  most  striking  and  egregious  violation  of 
the  Strict  OV  constraint,  comes  from  structures  of  the  type  (3)  with  two  finite 
verbs,  where  the  Strict  OV  Constraint  would  require  the  first  verb  to  be  an  ab- 
solutive.''  This  type  was  known  to  the  ancient  Tamil  grammarians  who  referred 
to  the  first  verb  of  such  structures  as  murreccam  'a  non-finite  verb  (expressed 
as)  a  finite  verb';  cf.  Natarajan  1977:182.  Strangely,  before  S's  monograph,  the 
significance  of  this  type  does  not  seem  to  have  been  generally  appreciated. 
(But  note  Hock  1984:98.) 

(3)  (a)     celvem      allem  (Old  Tamil) 

fut.lp        neg.lp 
'We  will  not  go.' 
(b)     iraincinem    varttinem  (Old  Tamil) 

past  1  p         past  1  p 
'We  worshiped  (and)  lived.  =  We  lived  worshiping.' 

1.5.  Finally,  difficulties  arise  since  due  to  extraposition,  the  finite  verb 
does  not  always  occur  at  the  right  margin,  contrary  to  the  predictions  of  the  Strict 
OV  Constraint.  Some  publications  downplay  the  extent  of  extraposition  in  Dra- 
vidian; cf.  e.g.  Steever  1987b:741  who  admits  nominal  extraposition  only  for 
subjects.  (But  see  3.1  below  for  extraposition  of  embedded  structures.).  Others 
frankly  acknowledge  it;  cf.  Beythan  1943:177  for  Tamil,  Subbarao  1972,  1974 
(also  1984:119)  for  Telugu,  Emeneau  1 955(1 961  ):1 18  for  Kolami,  and  especi- 
ally Sridhar  1989:passim  for  Kannada.  The  examples  in  (4)  below  include 
instances  of  direct  object  extraposition  (4b)  and  even  of  multiple  extraposition 
(last  example).  See  sections  3.1  and  3.3  below  for  extraposition  of  embedded 
structures  with  absolutives  and  quotatives. 

(4)  (a)    akkal    varuvar     en-k  katal6r-e    (Kuruntokai  277)      (Old  Tamil) 

'then'    fut.3m.h  S 

'My  lover  will  come  at  that  time  ..." 

(b)  kanten  antiyai  (Tamil) 
past  1  s      DO 

'I  have  seen  the  mendicant.' 

(c)  eto  ...etuttukontu  poka  citrapatatte  (Old  Malayalam) 
'this'  'painted'  imp.  DO  (from  Pillai  1973:164) 
Take  away  this  painted  picture  ...  !' 
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(d)     avanu  pustaka  kotta  nanage  (Colloquial  Kannada) 

S         DO         V       10 
'He  gave  a  book  to  me."  (emphatic) 
?avanu  nanage  kotta  pustaka 

S  10  V       DO 

'He  gave  me  the  book."  (emphatic) 
kotta  avanu  avanige  eradu  ettu 
V       S         10  DO 

'She  gave  him  two  beatings!'  (dramatic) 

The  following  specific  observations  hold  for  Kannada.    (Comparably  complete 
observations  for  other  Dravidian  languages  are  not  available  to  me.) 

(a)  Direct  objects  extrapose  least  easily.  (Emeneau's  statistics  for  Ko- 
lami  (1 955(1 961  ):1 18)  likewise  show  more  examples  of  subject  extraposition 
than  for  oblique-NP  extraposition.) 

(b)  Extraposition  does  not  occur  freely  in  formal  and  situationally  neutral 
contexts.  It  is  this  feature  which  probably  is  responsible  for  the  fact  that  struc- 
tures with  extraposition  do  not  occur  easily  in  the  fairly  formal  and  artificial 
setting  of  field-work  elicitations. 

(c)  Extraposition  occurs  more  freely  in  various  types  of  traditional  poetry. 

(d)  Extraposition,  even  of  multiple  constituents,  is  quite  common  in  col- 
loquial situations. 

2.1,  Steever's  monograph  constitutes  the  first  attempt  known  to  me  at  a 
systematic  account  for  structures  of  the  type  (1)  -  (3).  not  as  reflecting  Indo- 
Aryan  influence,  or  as  embarrassments,  but  as  firmly  embedded  in  the  indig- 
enous grammatical  system  of  Dravidian.  Such  an  attempt  naturally  requires 
giving  up  the  conventional  wisdom  on  Dravidian  sentence  structure.  However, 
Steever  attempts  to  maintain  the  spirit  of  that  wisdom  by  replacing  the  Strict  OV 
Constraint  with  a  more  sophisticated  understanding  of  finiteness. 

2.2.  Chapter  1  sets  the  background  by  providing  sketches  of  Dravidian 
morphology  and  syntax  and  by  introducing  a  crucial  distinction  between  mor- 
phological and  syntactic  finiteness:  While  morphologically,  only  verbs  can  be 
finite,  S  extends  the  syntactic  definition  of  finiteness  to  nominal  predicates, 
which  in  much  of  Dravidian  exhibit  agreement  with  their  subjects,  cf.  e.g.  (5).  He 
consequently  states  his  subsequent  syntactic  generalizations  in  terms  of  'pred- 
icates' (both  verbal  and  nominal),  not  'verbs'.  This  enables  him  to  make 
generalizations  about  finiteness  without  having  to  state  each  time  that  the 
rightmost  'head'  of  the  sentence  may  be  a  noun  or  a  verb.  Moreover,  defining 
syntactic  finiteness  on  functional,  rather  than  morphological  grounds  permits 
him  to  extend  his  generalizations  to  Malayalam  whose  verbs  are  not  marked  for 
agreement  and  might  thus  be  considered  morphologically  non-finite. 

(5)     avar  aciriyar  (Tamil) 

'he'  Nom.m.h    Nom.m.h 
'He  (is  a)  teacher.' 
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2.3.  Chapter  2  deals  with  structures  of  the  type  (1)  and  (2).  Atter  review- 
ing the  attestations  and  syntactic  behavior  of  these  structures  in  a  broad  selec- 
tion of  Dravidian  languages,  S  argues  that  their  embedded  finite  predicates  are 
licensed  by  a  limited  set  of  lexical  items  that  include  the  quotative  verb  en-  in  (1) 
and  clitic  particles  like  the  -o  in  (2).  Finally,  he  notes  that  in  addition  to  the 
prototypical  embedding  verbs  *en-  'say,  think'  and  'a-  'become',  other  (verbal 
and  nominal)  predicates  of  perception,  cognition,  communication,  and  compar- 
ison tend  to  license  embedded  structures  with  finite  predicates. 

According  to  S,  verbs  like  *en-  and  *a-  are  permitted  to  act  in  this  manner 
because 

In  Lakoff's  (1970)  terms,  they  are  negative  absolute  exceptions 
to  case-marking.  Since  the  theory  of  case-marking  is  responsible  for 
the  morphological  interpretation  of  direct  objects,  and  not  the  theory  of 
finiteness,  the  verbs  *a-  'become'  and  *en-  'say,  think'  shield  their 
direct  objects  from  the  application  and  effects  of  Rule  (1)  [the  rule 
assigning  finiteness].  And  since  these  two  verbs  are  negative  ab- 
solute exceptions  to  case-marking,  they  need  not  subcategorize  ob- 
jects that  are  subject  to  case-marking,  viz.  nouns,  but  may  take  ex- 
pressions of  any  category  and  form.  This  flexibility  in  subcategoriz- 
ation  permits  these  two  verbs  to  combine  with  an  object  that  contains 
a  formally  [i.e.  syntactically]  finite  predicate.  (20) 

Nominal  and  verbal  predicates  which  have  the  same  properties  as  en-  are 
classed  with  it  as  'Finite  Predicate  Embedding  Predicates'  (FPEPs),  and  sim- 
ilarly-acting clitics  are  labeled  'Finite  Predicate  Embedding  Clitics'  (FPECs). 

S  then  proposes  a  rule  (p.  36)  which  permits  predicates  to  be  finite  (i)  in 
the  'root'  (or  matrix)  clause,  and  (ii)  in  embedded  structures  c-commanded  by 
FPEPs  and  FPECs  (such  as  en-  and  -o).  Elsewhere  predicates  are  non-finite. 
Since  these  structures  occur  in  all  Dravidian  languages  and  in  the  earliest 
Dravidian  texts,  S  argues  that  they  must  be  reconstnjcted  for  Proto-Dravidian. 

2.4.  The  remainder  of  the  monograph,  viz.  Chapters  3  -  7,  takes  up  stnjc- 
tures  of  the  type  (3),  which  S  calls  'Serial  Verb'  (SV)  formations  and  defines 
as  structures  'in  which  two  or  more  finite  verbs  occur  together,  marking  agree- 
ment among  themselves'  (1).  To  understand  S's  monograph,  it  is  important  to 
understand  —  and  accept  —  this  definition  which  as  S  acknowledges,  differs 
from  other  uses  of  the  term  'serial  verb'. 

Chapter  3  sets  the  stage  for  the  discussion  of  SVs  and  examines  in  detail 
the  evidence  of  the  South  Dravidian  languages  Tamil,  Kota  and  Toda,  Kanna- 
da,  and  Kodagu.  Chapter  4  looks  at  the  the  South-Central  languages  Telugu. 
Muria  Gondi,  Pengo,  and  Kuvi.  Chapter  5  investigates  the  Central  languages 
Parji,  Kolami,  and  Ollari.  Chapter  6  considers  the  North  Dravidian  languages, 
Kurukh,  Malto,  and  Brahui.  And  Chapter  7  presents  a  summary,  conclusions. 
and  'prospects  for  the  future'. 
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According  to  S,  the  prototypical  SV  construction  is  the  'compound  type'  in 
(3a)  repeated  below  as  (6a).  In  one  subtype,  illustrated  in  (6a),  the  final  verb 
functions  as  an  auxiliary.  Other  types  are  the  following:  (i)  The  'coordinate' 
type  in  (3b)  repeated  as  (6b),  which  encodes  two  coordinate  or  sequential  ac- 
tions; (ii)  the  'balance'  type  in  (6c),  similar  to  (6b),  but  signalling  an  action 
common  to  the  two  conjoined  actions;  (iii)  the  'iterative'  type  in  (6d)  in  which 
repetition  of  the  verb  signals  repeated  or  continuous  action;  and  (iv)  the  'echo' 
formation  in  (6e).  The  latter  construction,  open  also  to  non-predicates,  is  mor- 
phologically similar  to  the  iterative  one,  except  that  the  second  occurrence  of 
the  verb  has  its  first  syllable  replaced  by  ki-  in  Tamil,  gi-  in  Kannada,  etc.  In 
Tamil,  this  structure  tends  to  have  negative  connotations,  similar  to  the  Yiddish- 
English  type  syntax-shmyntax.  In  other  languages,  it  can  be  glossed  by  some- 
thing like  'He  is  hungry  etc'  =  'He  is  hungry  and  thirsty  (...)'. 

(6)    (a)    celvem      allem  (Old  Tamil) 


(b)     iraincinem    varttinem  (Old  Tamil) 


celvem 

allem 

fut.lp 

neg.lp 

'We  will  not  go.' 

iraincinem    varttinem 

past  1  p 

past.  1  p 

■We  worshiped  (and)  lived.' 

...  vizu 

aya  kulur  poti      1 

'all' 

'that  crane  bird'  i 

'We  lived  worshiping.' 

(c)  ...vizu     aya  kulur  poti      tinad  unad  (Konda) 
past  3n.s         past  3n.s 

'eat'  'drink' 

'  ...  that  crane  consumes  everything.' 

(d)  nan  even         even         ayya  (Medieval  Kannada) 
I        tut.  1s        tut.  1s 

'O  lord,  I  will  keep  entreating  (you).' 

(e)  avanukku-p      pacikkiratu        ki-cikkiratu  (Tamil) 
'to  him'             pres.Sn.s          pres.Sn.s 

'He  is  always  hungry,  or  some  damn  thing!'  (S's  gloss) 

After  careful  examination  of  the  comparative  and  historical  evidence,  S 
argues  that  the  SV  construction  and  all  of  its  subtypes  must  be  postulated  for 
Proto-Dravidian.  Though  there  may  be  difficulties  in  details,  S's  demonstration 
is  highly  convincing.  The  two  most  persuasive  arguments  are  the  following: 

(i)  The  type  (6a/b),  is  found  in  Old  Tamil  but  has  disappeared  in  Modern 
Tamil  and  the  closely  related  Malayalam.  This  historical  development  is  more 
compatible  with  a  hypothesis  that  regards  the  SV  formation  as  ancient  and 
inherited  than  one  which  considers  it  an  innovation. 

(ii)  If  the  'auxiliary'  subtype  is  reconstructed  for  Proto-Dravidian,  then  we 
can  account  for  the  phono-morphological  peculiarities  of  a  number  of  verbal 
structures  in  Kota  and  Toda  (p.  52-3),  Konda  (72-77),  Pengo  (77-83),  and  even 
in  Classical  and  Srilankan  Tamil  (84-6).  (For  a  fuller  discussion,  see  Steever 
1987a.)  Without  such  a  reconstruction,  the  history  of  these  structures,  and  (in 
Pengo)  even  their  synchronic  behavior,  remains  obscure. 
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While  the  SV  formation  shares  with  the  structures  in  (1)  and  (2)  the  fact  that 
in  S's  view  it  must  be  reconstructed  for  Proto-Dravidian,  its  grammatical  account 
is  different:  S  argues  that  in  SV  structures  only  the  rightmost  verb,  the  'head',  is 
syntactically  finite,  while  the  remainder  receives  finite  marking  by  morphological 
agreement  with  the  head  (112). 

3.  The  significance  of  S's  work  for  typological  studies  in  general  and  for 
studies  of  South  Asian  convergence  in  particular  cannot  be  underestimated;  for 
it  presents  a  picture  of  early  and  Proto-Dravidian  finiteness  that  is  markedly 
different  from  the  traditional  Strict  OV  Constraint. 

This  is  not  to  say  that  no  difficulties  remain.  In  the  following  I  will  discuss  a 
few  of  these.  I  will  also  argue  that  an  alternative  account  is  worth  considering, 
an  account  which  attempts  to  place  the  question  of  finiteness  into  a  larger  cross- 
linguistic  context. 

3.1.  Let  me  start  with  a  relatively  minor  problem:  S  offers  no  explicit  ac- 
count of  nominal  constituent  extraposition  of  the  type  (4)  above.  True,  he 
acknowledges  the  existence  of  absolutive  extraposition  of  the  type  (7)  below. 
And  in  the  discussion  of  that  phenomenon  he  raises  the  possibility  of  omitting 
all  references  to  linear  order  and  to  'rely  solely  on  command  relations  to  identify 
the  unique  finite  predicate  in  the  sentence'  (17).  However,  given  the  traditional 
view  of  Dravidian  syntax,  an  explicit  discussion  of  constituent  extraposition 
would  have  been  desirable.  Moreover,  it  would  have  been  helpful  if  S  had 
provided  an  explicit  formulation  of  a  non-linear  account  of  Dravidian  finiteness. 

(7)  civa  pujai  kuta  ceykirar,    titcai  etuttu  kontu  (Tamil) 

pres.3m.s         abs.     abs. 
"He  even  performs  a  Siva  puja,  having  taken  (a  vow).' 

3.2.  A  related  problem  lies  in  the  fact  that  S's  account  of  the  SV  formation 
appears  to  make  the  implicit  assumption  that  the  members  of  the  structure  must 
be  adjacent.  But  as  he  acknowledges  in  Chapter  3,  note  3  (p.  125),  Old  Kan- 
nada  offers  two  examples  of  the  echo-subtype,  but  with  non-adjacent  finite 
verbs;  cf.  e.g.  (8).  Interestingly,  it  is  not  the  verb  that  has  undergone  (p/-)echo- 
formation,  but  its  complement.^  Even  so,  the  verb  is  repeated  —  in  morpho- 
logically finite  form  —  along  with  the  echoed  complement. 

(8)  bomman  embavana  kane       gi-mman  embavana  kane 
'Bomma'  en-VN.       neg.ls    'Gi-mma'  en-VN        neg.ls 

'I  do  not  see  =  I  acknowledge  anyone  named  Bomma,  named 
Shmomma'. 

If  no  other  constructions  of  this  type  were  attested  in  Dravidian,  these 
examples  might  be  dismissed  as  isolated  aberrations.  However,  note  the  sim- 
ilar passages  in  (9),  cited  by  S  on  pages  70,  78,  and  86,  but  without  further 
discussion  of  the  question  of  non-adjacency.   These  suggest  that  the  pattern  Is 
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more  widespread.    Still,  structures  of  this  type  could  be  taken  care  of  by  form- 
ulating the  mies  for  SV  constructions  without  appeal  to  adjacency. 


(9)     (a) 


(b) 


(Konda) 


(Pengo) 


(c) 


mera  atad  mide         atad 

'storeyed  past  3n.s  'terraced  past  3n.s 

building'  building' 

'A  mansion  became  (his).' 

iver  nar  ubjatarna         jarman      atarna 

'these  men'       pres.perf.  'birth'        pres.perf. 

3m. p  3m. p  'become' 

'These  men  have  been  born.' 

pav     patti        cendar         or       patti       cendar  (Parji) 

'path'  'taking'    past  3m. p    'path'  'taking'  past  3m. p 
'They  went  by  the  path.' 

The  difficulties,  however,  seem  to  extend  farther  and  to  bear  on  more 
central  aspects  of  Dravidian  syntax,  including  quotatives  and  coordination. 

3.3.  Let  us  first  examine  the  question  of  quotative  structures.  Here,  the 
commonly  held  view,  based  on  the  evidence  of  the  South  Dravidian  languages, 
is  that  quotative  marking  is  obligatory  for  embedded  (direct)  discourse  and  that 
this  marking  is  accomplished  by  the  verb  *en-  'say  so',  either  in  its  finite  form  or 
in  various  non-finite  formations.  Of  the  non-finite  forms,  the  absolutive  is  per- 
haps the  most  important,  since  it  is  said  to  be  required  if  the  discourse  is  intro- 
duced by  verbs  of  speaking  other  than  *en-;  cf.  (10a).  On  the  other  hand,  the 
occurrence  of  a  finite  form  of  *en-  precludes  the  use  of  the  absolutive  in  the 
same  structure;  cf.  (lOc/d).  Finally,  like  other  constructions  with  absolutives, 
embedded  discourse  marked  by  the  absolutive  of  en-  may  be  extraposed;  cf. 
(lOd).  Beythan  (1943:204),  discussing  Tamil,  labels  such  extraposition  as  rare. 
However,  Sridhar  (1989,  §1.1.1.1)  considers  it  perfectly  natural,  as  long  as  the 
absolutive  form  of  en-  follows  the  extraposed  discourse. 


(10)  (a) 


(b) 


(c) 


(d) 


nan 

avan 

ihke      vantan 

en-ru 

ninaikkiren 

past  3m. s 

0 

pres 

.Is 

'1  think  that  he  has  come  here.' 

nan 

avan 

ihke      vantan 

past  3m.s 

enkiren 
pres'.ls 

'1  say 

that  he  has  come  here.' 

*nan 

avan 

ihke    vantan 

en-ru 

enkiren 

past  3m.s 

o" 

pres 

.Is 

'1  say 

that  he  has  come  here.' 

nan 

appo.rute  connen-e  apatt'aka  mutiyum 

en-ru 

past  Is 

Q. 

'1  said  it  then 

:  "It  will  end  dangerously."' 

(Tamil) 


(Tamil) 


(Tamil) 


(Tamil) 


As  noted  in  Hock  1982:75,  in  a  number  of  geographically  more  northern 
languages  of  the  South-Central  and  Central  Dravidian  subgroups,  we  find  a 
rather  different  situation.    Cf.  the  texts  from  Kolami  in  Emeneau  1955(1961), 
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from  Kuvi  in  Israel  1979,  and  from  Pango  In  Burrow  &  Bhattacarya  1970,  from 
which  the  examples  in  (11)  below  have  been  taken. 

First,  (direct)  discourse  is  more  commonly  extraposed  than  not.  And  in 
such  extraposed  structures,  a  finite  form  of  the  quotative  verb  *en-  commonly 
both  precedes  and  follows  the  quoted  passage;  cf.  e.g.  (Ila/b).  Moreover, 
discourse  introduced  by  verbs  other  than  *en-  likewise  tends  to  be  followed  by 
finite  forms  of  *en-;  cf.  e.g.  (11c).  On  the  other  hand,  structures  in  which  non- 
finite,  absolutive  forms  of  *en-  follow  the  discourse  are  quite  rare  and  seem  to 
occur  generally  only  if  another  finite  verb  follows  *en-,  as  in  (lid). 

(11)  (a)    kolama  enun  itte  ale  epud  setna  sutuk  korekad    enun.  (Kolami) 

'en-  past  3f.s  *en-  past  3f.s 

The  wife  said,  "When  so  much  darkness  as  this  was  gone,  I  will 
wash  and  bring  them,"  (she  said).' 

(b)  ...  raja  ele        icesi  mekn  necu  vestaT     icesi        (Kuvi) 

thus'      *en-  past  3m.s  *en-  past  3m.s 

'The  king  said,  "I  will  tell  you  the  pulling-out  day,"  (he  said).' 

(c)  ...  raja  . . .  vecesi  nmu  imbiti  vati  icesi  (Kuvi) 

'speak'  past  3m. s  *en-  past  3m. s 

"...  The  king  asked,  "Where  did  you  come  from?"  (he  said).' 

(d)  nato  hano  va  pihmu     injr  icesi  burha  (Kuvi) 

*en-  abs.   *en-  past  3m.s 
'"Let  us  go  to  the  village  and  you  stop,"  said  the  old  man.' 

In  addition  to  furnishing  further  evidence  for  structures  with  non-adjacent 
multiple  finite  predicates,  passages  like  (1  la/c)  vs.  (d)  show  that  the  verb  *en-  is 
non-finite  only  under  conditions  of  (near-)adjacency,  i.e.,  when  it  cooccurs  with 
another  verb  to  the  right  of  the  embedded  discourse.  Elsewhere  it  is  finite.  This 
suggests  that  at  least  for  these  structures,  multiple  finite  predicates  preferably 
occur  under  conditions  of  non-adjacency.  (Note  also  that  the  pattern  in  (lid), 
with  both  absolutive  and  finite  forms  of  *en-  cooccurring,  differs  considerably 
from  what  we  find  in  Southern  Dravidian,  where  such  structures  are  not 
permitted;  cf.  (10c)  above.) 

Finally,  Kolami  and  Pengo  present  even  more  serious  difficulties  for  S's 
views  on  quotatives  and  syntactic  finiteness.  In  both  of  these  languages  we  find 
frequent  attestations  of  embedded  discourse  not  marked  either  by  *en-  or  by 
any  other  embedding  predicates.  One  hopes  that  a  revised  edition  will  more 
fully  address  these  issues. 

(12)  pot    O      e    tulatir  0  (Kolami) 

pres.pl.2 
'The  cock  (said),  "Where  are  you  running?"' 

3.4.  A  further  problem,  only  partly  addressed  by  S.  concerns  another, 
broader  set  of  multiple  finite  verbs  not  licensed  by  embedding  predicates  or 
clitics.    Such  stmctures  occur  under  conditions  where  common  wisdom  would 
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lead  one  to  expect  one  of  two  alternatives:  (i)  Two  or  more  propositions  are 
juxtaposed  as  independent  sentences,  each  with  a  finite  predicate,  and  without 
any  marking  of  coordination  ('asyndesis');  or  (ii)  Two  or  more  propositions  are 
embedded  in  a  single  sentence,  such  that  all  but  the  last  are  marked  by  a  non- 
finite  predicate,  usually  the  absolutive. 

Before  entering  the  discussion,  let  us  briefly  review  some  background 
information.  While  the  Dravidian  languages  permit  conjunction  of  nominal  con- 
stituents by  means  of  the  enclitic  particle  *-um  (commonly  attached  to  each  of 
the  coordinated  elements,  as  in  (13)),  such  conjunction  is  not  possible  for  finite 
verbs;  cf.  (14a).  Instead,  one  of  the  two  verbs  has  to  be  converted  into  a  non- 
finite  form  (cf.  the  absolutive  in  (14b)).  or  the  two  constructions  are  juxtaposed 
asyndetically,  as  in  (14c)  and  (14d).  Under  either  of  these  alternatives,  no  con- 
necting clitic  *-um  is  said  to  occur.  (Examples  (14a/b)  are  from  Beythan  1943: 
155,  (14d)  is  from  S's  monograph,  p.  84.)). 

(13)  avan-um  aval-um  (Tamil) 
'he  and  she' 

tottattil-um  kollaiyil-um  vTttil-um  ... 
'in  garden,  yard,  house,  and  ..." 

(14)  (a)     *ennai-p   parttan-um        kupittan-um  (Tamil) 

'me'        past  3m.s-&      past  3m.s-& 
'He  saw  and  called  me.' 

(b)  ennai-p     parttu-k     kupittan  (Tamil) 
•me'          abs.          past  3m.s 

'He  saw  and  called  me.'  or  'Having  seen  (me),  he  called  me.' 

(c)  ennai-p     parttan      ennai-p     kupittan  (Tamil) 
'me'          past  3m. s                  past  3m.s 

'He  saw  me.  He  called  me.' 

(d)  kunr-um     malaiy-um  pala  pin  o.riya  vantanen  (Old  Tamil) 

'come'pres.pert.ls 
paricil        kontanen  celarku 

'bring'pres.perf.ls 
'I  have  come  across  many  hills  and  mountains;  I  have  brought 
gifts  to  distribute.' 

As  S  notes  (34-35  and  87),  in  the  geographically  more  northern  langu- 
ages, structures  containing  finite  predicates  can  be  conjoined,  often  by  means 
of  overt  conjunctions;  cf.  e.g.  the  examples  in  (15).  And  these  conjunctions  in 
many  cases  are  borrowed  from  Indo-Aryan  (such  as  the  aur'in  (15d).  (Here  and 
below,  the  Kurukh  examples  come  from  Hahn  1911,  and  the  Pengo  and  Kuvi 
examples  from  S,  Burrow  &  Bhattacarya  1970,  or  Israel  1979.) 

(15)  (a)     bugabugaicat  umtat      rayla  mate  kuckatat  (Pengo) 

past3n.s    past3n.s  past3n.s 

'It  made  (the  sound)  "bugabuga",  flew  about,  (and)  settled  on  a 
Rayla  tree.' 
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(b)  doriti  himborita  pojites^      tenga  dekitesi  vatesi       (Kuvi) 

'cover'  'carry'  'come' 

past  3m.s  past  3m. s.  past  Sm.s 

injaha       burhini  ele  icesi  ... 

Q  =  &  'speak' 

past  Sm.s 

'He  covered  the  rope  with  cloth,  carried  and  came  =  brought  the 

stick,  and  spoke  thus  to  his  wife  ...' 

(c)  eni-m    xoskan       dara       pagar      lauckan  (Kurukh) 
'I-&'      pastlm.s.  particle                pastlm.s. 

' ...  and  I  dug  and  made  the  embankment.' 

or  '  ...  and  having  dug,  I  made  the  embankment.' 

(d)  as     lauras       aur    kecas  (Kurukh) 
'he'  past  Sm.s  '&'    past  Sm.s 

'He  was  beaten  and  died.' 

Unfortunately,  S  treats  constructions  of  this  type  under  very  different  head- 
ings. The  Northern  Dravidian  (Kurukh)  type  he  deals  with  as  a  replacement  of 
the  inherited  system  of  embedding  clitics  by  conjunctions  and  complementizers 
(35).  The  (South-)Central  Dravidian  cases,  on  the  other  hand,  are  mentioned  in 
the  discussion  of  the  SV  formation:  The  example  in  (15a)  is  explicitly  described 
as  resembling  coordinate  SV  formations  (78),  while  for  (15b),  S  notes  that  the 
form  injaha,  originally  an  absolutive  of  quotative  *en-,  has  acquired  'many  of  the 
properties  of  co-ordinating  conjunctions'  (87). 

Given  the  similarities  between  these  structures,  it  is  unfortunate  that  S 
does  not  treat  them  more  systematically  and  consistently.  In  the  following  I  dis- 
cuss a  few  additional  patterns  which  a  more  systematic  and  consistent  analysis 
should  consider,  and  I  conclude  by  briefly  pointing  out  possible  implications. 

First,  the  Kurukh  examples  in  (15c/d)  must  be  considered  in  the  larger 
context  of  the  sychronically  most  productive  Kurukh  counterpart  of  the  common 
Dravidian  (and  Indo-Aryan)  absolutive  formation.  Hahn  (1911:58)  describes 
this  formation  as  'adding  the  governing  verb  to  the  inflected  form  of  the 
preceding  verb  and  ...  making  the  latter  agree  with  the  former  in  number, 
gender,  person,  and  tense.'  This  description  is  perfectly  consonant  with  S's 
characterization  of  SV  formation.  However,  Hahn  further  specifies  that  the  prior 
member  of  such  a  double-finite  construction  may  optionally  be  followed  by 
either  the  particle  /c/'or  the  particle  dara;  cf.  (16a  vs.  b). 

(16)  (a)     as     es'as  barcas  (Kurukh) 

'he'  past  Sm.s  past  Sm.s 

'Having  broken  he  came.'  or  'He  broke  and  came.' 
(b)     as     es'as  (kF/dara)    barcas  (Kumkh) 

'he'  past  Sm.s    particle      past  Sm.s 
'Having  broken  he  came.'  or  'He  broke  and  came.' 
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Now,  (15c)  above  appears  to  be  an  example  of  just  such  a  construction, 
except  that  the  two  verbs  are  not  adjacent.  There  is,  however,  one  problem:  In 
Hahn's  (fairly  limited)  texts,  all  instances  of  non-adjacent  structures  of  this  type 
seem  to  contain  the  particle  dara;  and  on  p.  83,  Hahn  lists  that  particle  as  a 
conjunction,  meaning  'and'.  The  question,  then,  must  be:  What  is  the  historical 
—  and  synchronic  —  standing  of  dara'?  Is  it  a  conjunction?  Can  it  be  con- 
sidered a  clitic  of  the  type  *-o?  Or  was  it  —  and  the  alternative  marker  kT— 
originally  a  tense-aspect  particle  comparable  to  the  postverbal  ke  that 
commonly  appears  in  Kuvi  narratives? 

Secondly,  conjunctions  or  conjunction-like  elements  that  occur  between 
conjoined  propositions  with  finite  verbs  are  found  also  in  members  of  the 
Southern  subgroup.  In  Kannada  we  find  particles  like  mattu  (Sridhar  1989: 
§1.3.1.1,  Madtha  1981)  which  look  native  Dravidian,  as  well  as  af/7ava  (fvladtha 
1981)  which  comes  from  Sanskrit;  Tulu  offers  sivayi  (Dravidian?)  and  athava 
(Sanskrit);  cf.  Bhat  1967:113;  etc.  Now,  it  is  possible  to  consider  all  of  these  to 
have  originated  under  Indo-Aryan,  specifically  perhaps  Sanskrit,  influence.  But 
note  that  Indo-Aryan  particles  interposed  between  conjoined  sentences,  such 
as  Skt.  athava  'or',  Hindi  aur  'and',  are  themselves  innovations,  going  back  to 
earlier  adverbials  of  the  type  '(and)  then'.  Could  some  of  the  indigenous- 
looking  interposed  sentence  linkers  of  Dravidian  have  similar  sources? 

The  prototypical  conjunctions  of  early  Sanskrit  were  ca  'and',  va  'or'  which, 
like  Dravidian  *-um  'and,  also,  even',  were  enclitics  that  followed  the  coord- 
inated elements,  rather  than  being  interposed  between  them;  cf.  (17a). 

(17)  (a)    sa   ca       sa     ca 
'he  and  she' 

udyane  (ca)      prahgane  (ca)     grhe   ca  (...) 
'in  garden,  yard,  and  house  ...' 

(b)  mam  apasyac  (ca)  ahvayac    ca 
'me'    pastsg.3  &     pastsg.3  & 
'He  saw  and  (he)  called  me.' 

(c)  marh  (ca)  apasyat    marh  ca   ahvayat 
'me'    &     past  sg.3  'me'     &    past  sg.3 
'He  saw  me  and  (he)  called  me.' 

Now,  Sanskrit  enclitic  ca  and  va  can  be  used  to  coordinate  not  only  con- 
stituents, but  also  sentences,  as  in  (17c),  whereas  common  wisdom  has  it  that 
Dravidian  *-um  can  only  coordinate  constituents;  cf.  above.  However,  this  as- 
sessment of  the  behavior  of  Drav.'-am  does  not  appear  justified.  True,  (14a) 
shows  that  unlike  Skt.  ca  in  (17b),  Drav. *-um  cannot  attach  to  finite  verbs. 
However,  it  does  not  thereby  establish  that  Dravidian  has  no  counterpart  to  the 
Sanskrit  construction  (17c),  in  which  the  conjunction  is  enclitic  on  the  non- 
verbal first  constituents  of  conjoined  sentences. 

In  fact,  there  is  a  fair  amount  of  evidence  that  Dravidian  *-um  does  occur  in 
stmctures  comparable  to  (17c),  even  (or  perhaps  especially)  in  the  generally 
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Structures.  And  in  that  case  one  must  ask  whether  the  notion  of  embedding 
predicates  and  clitics  cannot  be  replaced  by  something  like  coordination. 

(ii)  Semantically/pragmatically,  multiple  finite  predicate  structures  con- 
nected by  means  of  *-um  can  be  considered  to  share  the  notion  of  coordination 
with  (most  of)  the  SV  formations  (including  the  special  subtypes  in  (8)  and  (9)), 
as  well  as  with  constructions  of  the  type  (15a)  which  describe  a  sequence  of 
conjoined  actions.  (Coordination  is  quite  obviously  involved  in  the  'coordinate' 
and  'balance'  subtypes  of  the  SV  formations.  The  'iterative'  subtype  involves  a 
sequence  of  coordinated  actions.  And  as  the  'etc'  of  glosses  of  the  'echo' 
subtype  suggest,  this  type  involves  coordination  of  the  base-verb  action  with 
similar  (unspecified)  actions.  Only  the  auxiliary  subtype  is  not  clearly  coord- 
inative,  unless  it  is  argued  that  the  base  verb  and  the  auxiliary  jointly 
contribute  to  the  meaning  or  function  of  the  structure.)  If  this  interpretation  is 
correct,  then  SV  formations  may  have  originated  as  stmctures  with  conjoined 
finite  verbs  comparable  to  the  Sanskrit  type  (17b),  except  that  the  Dravidian 
constraint  against  combining  -urn  with  finite  verbs  would  have  required  these 
structures  to  surface  without  an  overt  marker  of  coordination. 

(iii)  Syntactically,  structures  of  the  type  (18)  differ  from  SV  formations,  as 
well  as  from  the  type  (15a):  While  the  latter  require  the  multiple  finite  predicates 
to  have  identical  subjects,  the  type  (18)  permits  non-identical  subjects.  In  fact, 
here  identical  subjects  appear  to  be  in  the  minority. 

Accounting  for  structures  of  the  type  (18),  therefore,  may  require  more  than 
just  a  minor  adjustment  of  S's  rules  for  syntactic  or  morphological  finiteness. 

3.5.  A  further  set  of  problems  concerns  relative-correlative  structures  of 
the  type  (2)  above.  S  here  accepts  Lakshmi  Bai's  (1985)  argument  that  the  clitic 
*-o  is  an  ancient  and  necessary  element  of  these  stmctures^  and  thus  ap- 
parently feels  justified  in  postulating  for  Proto-Dravidian  a  clitic  embedder  that 
licenses  the  finite  predicates  of  relative  clauses. 

However,  Lakshmi  Bai's  Old  Tamil  data  do  not  support  such  an  assump- 
tion. The  only  two  Old  Tamil  examples  of  relative-correlative  stmctures  that  she 
cites  are  of  the  type  (19)  below,  with  a  'conditional',  i.e.  morphologically  non- 
finite  verbal  form  in  the  relative  clause.  (The  example  comes  from  Tolkappiam; 
its  transcription  follows  Ramasamy.) 

(19)  epporul  ayinum  allatu  ill  enin     apporul  alla-p  pirjtuporul  kural 

'which  thing*  cond.  'that  thing' 

'When  the  phrase  allatu  ill-  is  used  to  denote  the  scarcity  of 
anything,  it  is  to  be  used  along  with  the  other  things  available  of 
the  same  category.' 
or:  The  scarcity  of  which  thing  allu  ill-  is  used  to  denote,  ..." 

From  Lakshmi  Bai's  perspective  of  trying  to  establish  that  relative-correlative 
structures  are  old  and  indigenous  in  Dravidian,  there  is  nothing  wrong  with 
citing  such  examples  from  the  oldest  Dravidian  texts.    However,  the  examples 
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tell  us  nothing  about  the  early  Dravician  stmcture  of  relative  clauses  with  finite 
predicates.  In  fact,  they  might  lead  one  to  question  the  very  existence  of  such 
stnjctures. 

It  is  only  by  looking  at  Ramasamy  1981  that  we  can  determine  that  early 
Dravidian  did  have  relative(-correlative)  structures  with  finite  predicates;  cf.  the 
Old  Tamil  example  in  (20).  But  interestingly,  in  none  of  Ramasamy's  Old  Tamil 
examples  with  finite  verb  is  that  verb  followed  by  a  clitic. 

(20)  minnitai  natunkun  kataippeyal  vatai  ekkal     varuvat(u)    enri      0 

'when'  fut.3n.s  pres.2s 

akkal    vamvar     en-k  katal6r-e    (Kumntokai  277) 
'then'    fut.3s.h 

'My  lover  will  come  at  the  time  that  you  say  the  cold  wind  will 
come." 

Comparative  Dravidian  evidence  suggests  that  both  types  of  relative 
structures  are  ancient  in  Dravidian,  those  with  non-finite  conditionals  and  those 
with  finite  predicates  not  followed  by  clitics.  Examples  of  the  former  type  are 
found  in  Modern  Tamil  (Ramasamy  1981),  Toda  (Emeneau  1984:184),'*  Tulu 
(Bhat  1967:1 00), 5  and  Pengo  (Burrow  &  Bhattacarya  1970:58).  Examples  of 
finite  predicates  without  clitics  are  found  for  instance  in  Old  Malayalam  (Pillai 
1973:165),  Pengo  (Burrow  &  Bhattacarya  1970:  text  19.8),  Malto  (Mahapatra 
1979:223  [no.  16]).6 

It  thus  appears  as  if  early  Dravidian  had  two  competing  relative-correlative 
constructions,  one  with  non-finite  conditionals,  the  other  with  finite  predicates 
not  followed  by  clitics.  The  Old  and  Modern  Tamil  data  in  Ramasamy  1981 
suggest  that  structures  with  conditionals  are  of  the  generalizing  type  ("Whoever 
does  this  ...'),  while  the  ones  with  finite  predicates  can  be  either  generalizing  or 
non-generalizing.  In  fact,  the  conditional  relative  clauses  that  he  cites,  both  Old 
and  Modern  Tamil,  could  just  as  well  be  translated  by  'if  or  'when'  clauses  in 
which  the  interrogative  pronoun  functions  as  an  indefinite  or  generalizing  pron- 
oun 'anything,  anyone',  a  function  found  also  in  other  syntactic  contexts.  Cf.  the 
first  gloss  of  (19)  above.  What  needs  to  be  more  fully  investigated  is  whether 
this  distinction  is  a  peculiarity  of  Tamil  or  whether  it  holds  for  other  Dravidian 
languages  as  well. 

What  is  most  important  for  our  purposes  is  that  the  oldest  Dravidian  texts 
exhibit  relative(-correlative)  stnjctures  with  finite  predicates  not  followed  by 
clitics  and  that  such  stnjctures  are  attested  in  many  of  the  modern  languages  as 
well.  Under  these  conditions,  the  fact  that  Modern  Tamil  and  other  geo- 
graphically more  southern  modern  languages  require  the  clitic  -o  appears  to  be 
an  innovation,  geographically  perhaps  limited  to  the  southern  area. 

What  still  needs  to  be  determined  is  the  motivation  for  the  apparent 
replacemtrii  of  finite  relatives  without  clitics  by  struL-iures  with  clitic  *-o.  Lakshmi 
Bai,  who  believes  the  function  of  relative-correlatives  with  finite  verbs  always  to 
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be  generalizing,  considers  the  presence  of  the  clitic  motivated  by  that  function; 
for  *-o  also  elsewhere  is  used  as  generalizer  (as  in  the  Tamil  combination  et-o 
'something'  from  etu  'what'  +  *-o).  However,  Ramasamy  provides  ample  evid- 
ence that  Modern  Tamil  relative-correlatives  with  *-o  are  not  limited  to  gener- 
alizing function  and  that,  moreover,  one  of  their  functions  is  to  serve  as  'ord- 
inary', non-generalizing  relative  clauses  in  syntactic  contexts  where  prenominal 
structures  with  relative  participles  cannot  occur.  Sridhar  (1989)  provides  similar 
rich  evidence  for  Kannada.  Here  again,  more  comparative  and  historical  re- 
search is  needed. 

The  history  of  relative-correlative  clauses  with  finite  predicates,  thus,  is  still 
far  from  settled.  However,  if  as  suggested  above,  the  addition  of  *-o  to  such 
clauses  is  a  (possibly  regional)  innovation,  then  S's  account  for  syntactic  finite- 
ness  will  require  further  reconsideration. 

4.  Given  the  difficulties  observed  so  far,  it  may  not  be  inappropriate  to 
confront  S's  account  of  Dravidian  finiteness  with  a  possible  alternative. 

In  Hock  1984:102  I  noted  a  crosslinguistic  tendency  for  quotatives  and 
absolutives  (or  other  non-finite  verbal  subordinating  devices)  to  be  correlated 
with  SOV  order: 

(W)hile  the  presence  of  one  of  these  syntactic  features  does  not 
necessarily  entail  the  others  (cf.  absolutives  in  SVO  English  and 
quotatives  in  SVO  Ewe,  for  which  cf.  Lord  1976),  the  correlation  is  a 
common  tendency  in  SOV  languages.  Thus  many  of  the  Australian 
SOV  languages  have  absolutives  and  similar  non-finite  subordinating 
devices  (cf.  Yallop  1982:134-40).  The  situtation  is  similar  for  Quechua 
(cf.  e.g.  Cerron-Palomino  1976:251-73,  Coombs  et  al.  1976:171-2, 
Stark  et  al.  1973:24)  and  other  Amerindian  languages  (cf.  Davis 
1973:139,  180,  etc.  for  Luiseno).  Moreover,  at  least  some  varieties  of 
Quechua  (cf.  Coombs  et  al.  1976:176-9,  Stark  et  al.  1973:247),  as 
well  as  North  American  Luiseno  (Davis  1973:257-8)  additionally  have 
quotatival  constructions. 

And  as  noted  elsewhere  in  my  1984  paper,  similar  correlations  of  SOV  with 
absolutives  and  other  non-finite  formations,  as  well  as  with  quotatives,  are 
found  in  the  ancient  Near  Eastern  languages  Sumerian,  Accadian,  and  Elamite, 
as  well  as  in  the  Altaic  languages  (e.g.  Turkish).  I  went  on  to  suspect  that 

(T)his  correlation  is  connected  with  a  tendency  in  SOV  ... 
languages  to  make  the  generalization  that  sentences,  rather  than 
clauses,  have  just  one  finite  verb,  at  the  end  ...  of  the  sentence.  Non- 
finite  absolutive-like  structures  would  help  to  implement  this  gener- 
alization, by  eliminating  possible  violations  in  ...  subordinate  clauses 
...  Quotative  markers  seem  to  solve  the  problem  for  the  finite  verbs  of 
QUOTES,  by  indicating  that,  in  a  sense,  these  quotations  —  and  their 
finite  verbs  —  are  outside  the  scope  of  the  (matrix)  sentence.' 
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My  speculation  concerning  quotatives  finds  further  support  in  Steever's 
(1987c)  language-specific  explanation  of  the  motivation  for  the  use  of  quotatival 
structures  in  Dravidian: 

The  syntactic  restrictions  imposed  by  Rule  A  [the  rule  permitting 
only  one  finite  predicate  per  sentence]  lead  to  a  particularly  thorny 
pragmatic  constraint.  The  grammar  of  complex  sentences,  as  ex- 
pressed in  this  rule,  provides  no  means  of  forming  direct  discourse  ... 
This  consequence  of  Rule  A  goes  against  the  general  belief  that 
direct  discourse  is  both  less  marked  and  more  common  than  indirect 
discourse. 

In  addition,  SOV  languages  exhibit  a  variety  of  strategies  for  relative 
clause  formation.  These  include  of  course  non-finite,  participle-like  structures 
such  as  the  ones  found  in  Dravidian  and  commonly  postulated  as  their  only 
relative-clause  strategy.  Another  non-finite  strategy  is  nominalization,  as  found 
for  instance  in  Yuman  (Langdon  1977).  Beside  (or  instead  of)  these  non-finite 
devices,  however,  a  number  of  SOV  languages  offer  relative-correlative  struc- 
tures; cf.  Downing  1978.  And  as  Ramasamy  and  Lakshmi  Bai  have  shown, 
these  languages  include  the  Dravidian  language  family.  On  the  other  hand, 
relative-correlative  structures  do  not  seem  to  be  common  in  SVO  languages. 
Like  quotatives,  absolutives  (etc.)  they  seem  be  significantly  correlated  with 
SOV. 

This  correlation  can  be  accounted  for  if  we  note  the  following:  In  surface 
structure,  relative-correlative  constructions  differ  from  non-finite  relative  strate- 
gies, as  well  as  from  the  postnominal  clausal  type  of  languages  like  English,  by 
the  fact  that  the  relative  clause  is  not  embedded  into  the  correlative  'main' 
clause.  On  the  surface,  therefore,  they  seem  to  involve  coordination,  not  em- 
bedding or  subordination.  And  that  coordination  is  overtly  marked  by  a  pair  of 
correlated  markers,  the  relative  and  correlative  pronouns,  just  as  the  coord- 
ination of  constituents  (or  clauses)  tends  to  be  marked  by  a  pair  of  correlated 
markers  in  Dravidian  (*-um  ...  *-um)  and  early  Sanskrit  {ca  ...  ca);  cf.  3.4  above. 

Assume  now  that  this  surface  parallelism  between  relative-correlatives 
and  coordinate  structures  is  syntactically  significant  (at  the  level  where  finite- 
ness  is  assigned)  and  that  moreover,  the  Dravidian  pattern  of  apparent  sen- 
tential coordination  with  finite  predicates  in  the  conjoined  sentences  (and  the 
similar  pattern  in  Sanskrit)  is  not  in  some  ways  aberrant,  but  is  a  crosslinguistic 
possibility  in  SOV  languages. 

In  that  case  it  can  be  argued  that  the  constraint  that  sentences  can  contain 
only  one  finite  verb  is  a  property  of  structures  involving  subordination,  but  does 
not  (necessarily)  affect  structures  with  coordination.  Put  differently,  for  the 
purposes  of  finiteness,  structures  with  clausal/sentential  coordination  are  (or 
may  be)  treated  as  containing  conjoined  sentences,  not  (embedded)  clauses. 
Being  construed  as  coordinated  sentences,  relative-correlative  structures 
therefore  can  each  contain  a  finite  verb. 
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In  addition  to  accounting  for  the  early  Dravidian  relative  structures  with 
finite  predicates  not  licensed  by  clitic  particles,  the  alternative  interpretation  just 
proposed  explains  the  widespread  appearance  of  coordinate  structures  with 
multiple  finite  verbs  (cf.  3.4  above).  Like  relative-correlatives,  they  involve  sen- 
tential coordination  and  therefore  are  not  subject  to  the  'one-finite-verb' 
constraint. 

On  the  other  hand,  certain  elements  of  Dravidian  structure  are  more 
difficult  to  explain  under  this  alternative  approach.  One  of  these  is  the  apparent 
introduction  of  clitic  *-o  after  the  finite  verbs  of  relative  clauses  and  the  ex- 
tension of  the  resulting  pattern  in  a  large  number  of  Dravidian  languages,  at  the 
expense  of  structures  without  clitics.  Another  is  the  widespread  tendency  noted 
by  S  to  expand  the  set  of  quotative  embedders  to  verbs  other  than  *en-,  the 
prototypical  quotative  marker  of  early  and  common  Dravidian.  Both  of  these 
developments  are  admirably  accounted  for  by  S's  rules.  But  there  is  nothing  in 
the  alternative  account  that  would  motivate  the  appearance  of  final  clitics  in 
relative  structures;  and  the  modern  Dravidian  quotative  situation  is  markedly 
different  from  the  one  in  most  other  languages  with  quotatives,  where  the 
number  of  quotative  markers  is  highly  restricted. 

It  thus  appears  as  if  the  two  hypotheses  are  complementary;  what  is 
explained  by  one  is  difficult  for  the  other.  Perhaps  this  complementarity  reflects 
a  historical  change  in  Dravidian,  from  an  earlier  system  in  which  the  rules  of 
finiteness  did  not  apply  to  coordinated  structures,  to  a  later  and  perhaps 
regional  system,  in  which  they  did  and  in  which  they  consequently  had  to  be 
governed  by  a  new  set  of  finiteness  rules,  presumably  the  ones  proposed  by  S. 

Such  a  scenario  may  account  not  only  for  the  observed  differences  in 
relative-clause  formation  and  for  the  fact  that  coordinated  structures  of  the  type 
(18)  above  are  relatively  rare  in  geographically  more  southern  Dravidian.  It 
may  also  account  for  the  fact,  noted  by  S  (p.  47-8)  that  except  for  'echo' 
structures,  the  use  of  SV  formations  has  steadily  declined  in  Tamil  and  Malaya- 
lam  'so  that  it  had  virtually  fallen  from  the  language  by  the  dawn  of  the  early 
modern  period."  For  excepting  'echo'  structures  (and  perhaps  a  few  similar 
ones)  which  represent  special  instances  of  a  morphological  type  of  quasi- 
reduplication  not  limited  to  verbs,  SV  formations  with  their  multiple  finite  verbs 
would  constitute  apparent  exceptions  to  the  syntactic  rules  of  finiteness  and 
might  thus  be  expected  to  be  eliminated.  (S  similarly  attributes  the  loss  of  SV 
formations  in  Tamil  and  Dravidian  to  the  'tension  between  the  functional  and 
formal  treatments  of  finiteness'  (47). 7)  The  fact  that  SV  formations  show  no 
signs  of  attrition  in  the  more  northern  languages  may  then  reflect  the  fact  that 
these  languages  did  not  participate  in  the  switch  to  S's  system  of  finiteness,  but 
retained  the  older  system  which  permitted  multiple  finite  predicates  in  conjoined 
sentences  and  thus  provided  a  precedent  (if  not  a  source)  for  the  multiple  finite 
verbs  of  SV  formations. 
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7.  The  preceding  discussion  has  dwelt  mainly  on  difficulties  with  S's 
analysis.  This  should  not,  however,  be  taken  as  diminishing  the  value  of  his 
work.  As  noted  before,  his  monograph  presents  the  most  systematic  study  to 
date  into  the  question  of  finiteness  in  Dravidian,  and  his  conclusions  provide 
significant  challenges  to  the  traditional  view  of  Dravidian  as  obeying  the  'Strict 
OV  Constraint'.  Linguists  interested  in  Dravidian  syntax,  general  linguistic 
typology,  or  South  Asian  convergence  would  ignore  his  findings  at  their  own 
risk. 

The  fact  that  I  have  proposed  a  possible  alternative  to  his  hypothesis  is  in 
fact  a  measure  of  the  significance  of  his  work.  Confronted  with  the  traditional 
view  of  Dravidian  sentence  structure  plus  a  few  stray  examples  (such  as  (3c) 
above)  indicating  that  that  view  does  not  tell  the  whole  story,  it  would  have  been 
next  to  impossible  to  advance  the  hypothesis.  It  is  only  because  of  S's  system- 
atic investigation  and  well  argued  analysis  that  I  could  even  begin  to  develop 
an  alternative  hypothesis.^ 


NOTES 

*This  article  is  an  expanded  version  of  a  review  which  is  to  appear  in  the 
journal  Language.  The  University  of  Illinois  Research  Board  has  granted  partial 
funding  for  research  toward  this  review.  I  also  owe  gratitude  to  Nalini  Rau- 
Murthy  for  consultant  work  on  Kannada  and  to  S.  N.  Sridhar  for  making 
available  to  me  a  prepublication  copy  of  his  1 989  monograph.  The  responsib- 
ility for  any  remaining  errors  and  omissions  is  mine. 

The  following  abbreviations  are  used  in  the  glosses: 


& 

conjunction  'and'  (or  'also,  even') 

abs. 

absolutive 

cut. 

clitic 

cond. 

conditional 

DO 

direct  object 

f. 

feminine  (or  non-masculine,  in  languages  with  a  gender 
contrast  'masculine  :   non-masculine') 

fut. 

future 

h 

honorific  (plural) 

D 

indirect  object 

imp. 

imperative 

m. 

masculine 

N 

nominative 

n. 

neuter 

neg. 

negative  (verb) 

P 

plural 

perf. 

perfect 

pres. 

= 

present 

Q 

= 

quotative  marker 
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S         =  subject 

s         =  singular 

V         =  verb 

VN  =  verbal  noun 

1  Example  (3b)  comes  from  Natarajan  1977  where  it  is  cited  without  a 
gloss.  The  translation  given  here  is  a  conjecture. 

2Strictly  speaking,  the  echoed  constituent  is  not  the  direct  complement  of 
kane  but  of  the  'FPEP'  embavana  (from  the  verbal  root  *en-). 

3S  adds:  'More  compelling  is  the  fact  that  Dravidian  correlative  clauses 
tend  to  use  some  kind  of  quantification  to  join  together  the  two  clauses  of  the 
correlative  construction.  Although  use  of  the  clitic  -o  'or,  whether'  is  the  most 
widespread,  the  use  of  pairs  of  reduplicated  interrogative  and  demonstrative 
pronouns  is  also  attested  ...  To  the  best  of  my  knowledge,  Indo-Aryan  correl- 
ative clauses  do  not  make  much  use  of  quantification  to  form  correlative  con- 
structions.' Actually,  Indo-Aryan  does  employ  structures  with  doubled  relative 
and  correlative  pronouns  for  generalizing  propositions  of  the  type  Skt.  yo  yas 
tad  karoti  sa  (sa)  mOrkhah  'Whoever  does  this  is  a  fool.' 

^The  Toda  form  of  the  verb  functions  as  a  conditional  but  morphologically 
belongs  to  a  special  class  of  'subordinating'  verbs. 

5The  Tulu  structure  contains  the  synchronic  equivalent  of  a  conditional  (a 
combination  of  finite  verb  plus  the  conditional  particle  -ada-)  followed  by  the 
particle  -la  'also'. 

Hoda  has  an  alternative  to  non-finite  relative-clause  formation,  a  structure 
with  finite  verb  followed  by  -o.  Emeneau  (1984:185-6)  observes  that  for  phono- 
logical reasons  this  suffix  cannot  be  inherited  and  therefore  proposes  that  it  is  a 
borrowing  from  Tamil  or  Badaga,  both  of  which  have  -o.  This  might  suggest  that 
the  finite  predicate  construction  preceding  the  -6  is  likewise  a  borrowing;  but  it 
does  not  require  that  assumption. 

7|n  this  context,  S  proposes,  with  some  hesitation,  that  the  well-known  loss 
of  verbal  agreement  markers  in  Malayalam  may  have  been  another,  more 
radical  response  to  the  'tension'  between  the  two  types  of  finiteness.  This  is  an 
intriguing  proposition.  However,  there  is  perhaps  another  motivation  (which 
may  have  interacted  with  the  finiteness  'tension'):  As  Beythan  (1943:179) 
observes,  Tamil  does  not  permit  predicates  to  agree  with  conjoined  human  and 
non-human  subjects.  Instead,  we  find  structures  of  the  type  cited  in  (18c)  above 
and  reproduced  below,  with  two  coordinate  clauses  or  sentences,  each 
containing  its  own  finite  predicate  which  agrees  in  gender  with  its  respective 
subject.  Apparently,  then,  Tamil  third-person  plural  endings  can  signal 
agreement  either  with  humans  or  with  non-humans,  but  not  with  both. 
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manitar  a.rintarkal     kutiraiKat(-um)  a.rintana  (Tamil) 

past  3m.p  &     past3n.p 

'Men  perished  and  horses  perished.' 
=  'Men  and  horses  perished.' 

In  this  regard  it  is  interesting  that  Toda  lacks  the  usual  Dravidian  dis- 
tinctions for  gender  and  number  in  the  third  persons  of  the  verb,  although  like 
most  other  Dravidian  languages  it  has  distinct  endings  for  the  first  and  second 
persons,  singular  and  plural.  Similarly,  Caldwell  (1875(1 91 3):484)  notes  that 
Telugu  preterits  may  lack  tense,  person,  gender,  and  number  markers  for  third 
persons,  while  other  person/number  distinctions  are  always  marked.  Could  it 
be  that  Toda  (and  to  a  more  limited  extent,  Telugu)  gave  up  agreement  in  third 
persons  so  as  to  overcome  the  inability  of  third-person  endings  to  agree  with 
conjoint  human  and  non-human  subjects?  If  so,  could  Malayalam  have  gone 
one  step  farther  by  giving  up  verb  agreement  altogether? 

^Layout  and  printing  of  the  monograph  are  well  produced,  except  for  a 
large  number  of  misprints,  misspellings,  and  missing  words.  Fortunately,  most 
of  these  are  self-correcting.  The  only  passage  where  mistakes  tend  to  affect 
comprehension  is  on  pp.  30-36,  where  there  is  a  recurrent  mismatch  between 
the  numbering  of  examples  (38)  -  (44)  and  the  textual  references  to  them. 
Moreover,  the  same  number,  (44),  is  attached  to  two  different  examples  (pp.  32 
and  36),  the  second  of  which  contains  the  important  syntactic  rule  for  finiteness. 
In  addition,  the  discussion  (pp.  52-3)  of  Kota  and  Toda  auxiliary  developments 
is  condensed  to  the  point  of  being  almost  unintelligible.  Fortunately,  the  later 
discussion  of  similar  developments  in  Konda,  Pengo,  and  Tamil  (71-77,  78-86) 
is  clear  and  retroactively  helps  in  understanding  S's  earlier  remarks. 
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